They are some of the unsung “backroom heroes” of our time fighting the current pandemic
Highlights
Dubai: As SARS-CoV-2 continues to wreak havoc, researchers do painstaking surveillance work to make sense of the thousands of viral genome sequences. The sequences come from different corners of the planet, at a rate of about 5,000 per day. Trying to make sense of the genetic data of a mutating virus is a bit like shooting at a constantly moving target. But their work is important and must continue: the coronavirus pandemic has left 2.6 million people dead (as of Tuesday, March 9, 2021) and 117 million infected.
Genome:
A genome is the complete genetic material of an organism. It is the set of a living being’s genetic instructions. Each genome contains all of the information needed to build that organism and allow it to grow, develop and multiply.
Scientists are fighting back, with knowledge. So far, this diverse group of experts agree on one thing: the virus has shown mutations that confer it higher infectiousness.
The researchers do meticulous work, combining information from the so-called “wet biology” (swab tests) and sequence analyses. They update the data sets daily. The bulk of analyses of the genome sequences of SARS-CoV-2 rests on the initiative of academic researchers — phylogeneticists — who put together software and analytical tools to establish patterns, find essential answers and increase the scientific community’s understanding of the virus behind the pandemic, Nature explains.
The experts track and share real-time data to produce joined-up images of mutations as they emerge and spread. Such collaboration is invaluable (see tweets below). One result of their work: Systematic tracking of the SARS-CoV-2 viral genome. Here’s what we know about the people behind this effort, how they’re able to confirm the existence of new variants, what we know — and don't know:
Phylogenetics:
Phylogenetics is the area of research concerned with finding the genetic connections and relationships between species. This area greatly enhances our understanding of SARS-CoV-2 outbreaks and variants.
Since January 2020, researchers around the world have posted huge numbers of SARS-CoV-2 genome sequences online, through scientific data-sharing platforms. There are different clusters or online databases involved in tracking its mutations around the world.
Armed with such data, they help track and share real-time open-source data to produce joined-up images of mutations as they emerge and spread. Such collaboration is invaluable (see tweets below). One result of their work: Systematic tracking of the SARS-CoV-2 viral genome, and their mutations.
GISAID, EBI, Nextstrain and GenBank are some of these public databases tracking the COVID-19 viral mutations.
GISAID, a scientific data sharing site hosted by Germany, is one of the more popular platforms. As of March 1, 2021, it already holds more than 610,000 viral genomes, the journal Nature reported. That number is estimated to well exceed 1 million by the end of the pandemic. Soumya Swaminathan, the chief scientist at the WHO, has called GISAID a “game-changer” in the pandemic.
On this platform, researchers upload their sequences, and also download other viral genome sequences from counterparts other countries. “There’s quite a good international effort where that sharing happens that helps us in this type of investigation,” Dr Jeop de Ligt, of the New Zealand’s Institute of Environmental and Science Research (ESR), told RNZ channel.
Another database is the European Bioinformatics Institute (EBI) near Cambridge, UK, which hosts its own large genome database that includes SARS-CoV-2 sequences.
is an open-source project involving academic researchers from Switzerland and the US, helping to coordinate analyses of the SARS-CoV-2 genome sequences based on data from GISAID.
GenBank is an open-access genetic sequence database maintained by the US National Institutes of Health (NIH). It is an annotated collection of all publicly available nucleotide sequences and their protein translations. It is produced and maintained by the National Center for Biotechnology Information as part of the International Nucleotide Sequence Database Collaboration.
In theory, these genomes could help us better understand the transmission of the virus, through communities and between countries. That understanding would allow us to curb infections.
The complete genome sequence of SARS-CoV-2 was first released on the GenBank on January 5, 2020. Since then, there has been a rapid accumulation of SARS-CoV-2 genome sequences.
Early sharing of SARS-CoV-2’s genetic data, in January 2020, enabled the rapid development of diagnostics (ways to detect it). The knowledge of the virus’ RNA sequence was key in characterising it. This made possible an increased surveillance of SARS-CoV-2 viral genome, which enables the research community to identify and track new “mutations of concern”, to do the following:
Scientists and clinicians working in different parts of the world gather data, upload them and download data from others to compare the sequences and analyse them. The researchers include people from Argentina to Zimbabwe.
Phylogeneticists analyse, compare and contrast genetic sequences of SARS-CoV-2 viral mutations.
More than 5,000 (and rising) sequences, according to a Nature report.
Yes. Mathematically, the number of possible genetic mutations is greater than all the atoms in the visible universe, Vincent Racaniello, a professor of microbiology and immunology at Columbia University told Live Science. "A good fraction of the genome can be replaced."
Yes, there may be limit, but that limit is currently unknown. It's possible the virus maymutate and still make people sick — or a dominant strain may arise that simply reduces it to a common cold, instead of a deadly pneumonia-like disease. Scientists say they can only track the mutations, but not predict all of the possible mutations the virus could undergo,
There are at least five emerging coronavirus variants with the “most worrying mutations,” with genetic changes that can make them more contagious and “evasive”, reports Scientific American.
Mutation
Mutation is a change in a DNA sequence. It is usually the result of a phenomenon in which nucleotide sequences found within a DNA of any life form are subjected to change. This then leads to a modification in an organism’s genetic makeup. Depending on the mutation, it can prove harmless, helpful, or even hurtful to the organism. SARS-CoV-2, like any virus, it has been mutating since it was first sequenced in early January 2020.
There are two kinds of mutations:
They take place in non-reproductive cells. Certain other mutations can greatly impact the life and function of an organism. For example, somatic mutations that affect cell division (particularly those that allow cells to divide uncontrollably) are the basis for many forms of cancer.
They occur in gametes (an organism's reproductive cells) or in cells that eventually produce gametes. In contrast with somatic mutations, germ-line mutations are passed on to an organism's progeny. As a result, future generations of organisms will carry the mutation in all of their cells (both somatic and germ-line).
They are identified by analysing thousands of samples taken from patients. When a patient tests positive, and a mutation is suspected, researchers use the new sample to run background observations of their genetic code, against an existing database of known mutants. If it's a new strain, it's plotted as a new "leaf" in a "tree" that tracks mutations.
In the process of duplicating the SARS-CoV-2 viral RNA within an infected human cell, "errors" may occur. This results in viruses that are similar, but not exact, copies of the original. These errors in the viral RNA are called "mutations". Viruses with these mutations are called "variants". Variants could differ by a single or multiple mutations.
Samples, usually taken from nasal swabs, are subjected to polymerase chain reaction (RT-PCR) test. Reverse transcriptase and DNA polymerase enzymes are added to the sample. Multiple copies of any viral RNA present in the sample are made. Primers and probes are deployed aimed to target specific segments of the virus’s genome that are unlikely to change over time. This “chain reaction” generates enough copies — allowing for even a small presence of the virus to be detected when the sample is tested. These primers and probes then attach themselves to specific sequences in the virus’s genetic code — the signals are captured, which confirm whether a sample is positive or negative.
PCR tests are performed to actually detect genetic material specific to SARS-CoV-2 or any organism. As such, the detection of new strains can happen almost in real-time. Moreover, the test is highly accurate (compared to other tests) and picks up any virus strains.
Sequencing DNA means determining the order of the four chemical building blocks — "bases" — that make up the DNA molecule. The sequence tells scientists the kind of genetic information that is carried in a particular DNA segment.
A gene is a segment of DNA, and contains the instructions for the production of biological molecules — usually proteins. The DNA contains the instructions found in the genome of any living thing. Within DNA is a unique chemical code that guides growth and development.
The chance of an emergent COVID-19 variant evading testing is highly unlikely. This is due to the unprecedented ability to quickly compare genetic strains of the virus through shared databases, which highlights utility of scientific collaboration on a global scale.
More than 610,000, according to Nature (March 1, 2021). Laboratories around the world are sequencing more SARS-CoV-2 samples; the number is expected to exceed 1 million by the end of the pandemic.
In theory, these genomes could help the scientific community better understand the spread of the virus or any pathogen (disease-causing organism, bacterium, virus) across the globe.
GENOMICS:
The branch of molecular biology concerned with the structure, function, evolution, and mapping of genomes.
The bulk of analyses of the genome sequences of SARS-CoV-2 rests on the initiative of academic researchers — phylogeneticists — who put together software and analytical tools to establish patterns, find essential answers and increase the scientific community’s understanding of the virus behind the current pandemic, according to Nature.
Pretty effective and pretty quick. For example, less than two hours after the spread of a new variant (now called 501Y.V1, or B.1.1.7, first detected in the UK) was announced by the UK health minister in December 2020, Nextstrain researchers had provided context for its key mutations (via Twitter), and showed its progression in the UK and across Europe in the months prior. The Twitter thread became a key source of information on the new variant. Over the Christmas 2020 break, Nextstrain researchers crunched further sequences and briefed journalists.
Sequencing, combined with data from epidemiologists (taken from hospitals and PCR tests), provide near real-time knowledge of the emergence of new variants and their biology, explained Nature.
Genetic sequencing costs
Genetic sequencing costs have come down significantly. * In mid-2015, the cost to generate a high-quality “draft" whole human genome sequence was just above $4,000; by late in 2015, that figure had fallen below $1,500. * In 2018, the cost to sequence your entire genome has gone further down to $200, according to Wired. * PCR tests, which accurately test viral signature, have come down in price and has become routine.
With SARS-CoV-2, many high-income countries (such as Iceland, Luxembourg, and Japan) have sequenced the most viral genomes per 1,000 cases. Many countries, especially in Africa, have no sequencing data at all. However, Gambia, Equatorial Guinea, and Sierra Leone have a higher rate of sequencing than France, Italy, or the USA, according to GISAID.
No. But there should be a sufficient level to detect and track mutations and their effects — both for COVID-19 and for future emerging and re-emerging infections, say scientists.
Phylogeneticists had been trained to use certain tools to track transmission, flag key mutations, calculate metrics and inform public health officials. But they’re also humans, subject to certain limitations. Some of them are buckling under pressure due to the technical challenges they face in automating their work of keeping track of on-going viral mutations of SARS-CoV-2. They bare their angsts, discoveries, challenges — and their minds — on Twitter, as well as on journals such as Nature, or Lancet.
It’s a result of disparate systems used. Sequences are being made available on several different databases, websites, and platforms. At present there is an absence of standardised nomenclature for variants, which contributes to a lack of clarity, according to experts.
Sequencing viral genomes is important; what’s more important, however, is to have enough data to help researchers understand the effects of mutations on the virus's biology and cross-refer it with clinical data. Researchers — especially “phylogeneticists” — also have their limits. For example, Nextstrain was previously used to track influenza and Ebola outbreaks, through small updates every week or month. Now, researchers need to update their analyses daily.
This presents enormous challenges for people running the system — those who run data analytics, bioinformatics and viral evolution research — whose tools are now being stretched to their utmost limit when they are most needed.
“We must move beyond the limitations of existing tools and improve processes, so that they are fit to handle a pandemic,” wrote Dr Emma Hodcroft, co-developer of Nextstrain.
There's hope that vaccines would help resolve this pandemic, but some experts now say this may not be necessarily true. Real-time global genomic surveillance of pathogens is a key weapon in the world's arsenal against the outbreak. By its very nature, effective viral genomic surveillance needs to be a global concern, the Lancet asserts. It must be widely adopted, powered by seamless open data sharing.
There's a lot more research that's going to be needed to know how the dynamics of this virus will continue and change. A lot more work needs to be done for a scientifically-informed response that would put the pandemic, the mask mandates, social distancing, infections and untimely deaths behind us. In these COVID-19 times, "nobody is safe until everyone is safe," says the UN Deveopment Programme.
Sign up for the Daily Briefing
Get the latest news and updates straight to your inbox
Network Links
GN StoreDownload our app
© Al Nisr Publishing LLC 2025. All rights reserved.