The genomic variation landscape of globally-circulating clades of SARS-CoV-2 defines a genetic barcoding scheme
dc.contributor.author | Guan, Qingtian | |
dc.contributor.author | Sadykov, Mukhtar | |
dc.contributor.author | Nugmanova, Raushan | |
dc.contributor.author | Carr, Michael J | |
dc.contributor.author | Arold, Stefan T. | |
dc.contributor.author | Pain, Arnab | |
dc.date.accessioned | 2020-04-26T09:54:25Z | |
dc.date.available | 2020-04-26T09:54:25Z | |
dc.date.issued | 2020-04-23 | |
dc.identifier.citation | Guan, Q., Sadykov, M., Nugmanova, R., Carr, M. J., Arold, S. T., & Pain, A. (2020). The genomic variation landscape of globally-circulating clades of SARS-CoV-2 defines a genetic barcoding scheme. doi:10.1101/2020.04.21.054221 | |
dc.identifier.doi | 10.1101/2020.04.21.054221 | |
dc.identifier.uri | http://hdl.handle.net/10754/662635 | |
dc.description.abstract | We describe fifteen major mutation events from 2,058 high-quality SARS-CoV-2 genomes deposited up to March 31st, 2020. These events define five major clades (G, I, S, D and V) of globally-circulating viral populations, representing 85.7% of all sequenced cases, which we can identify using a 10 nucleotide genetic classifier or barcode. We applied this barcode to 4,000 additional genomes deposited between March 31st and April 15th and classified successfully 95.6% of the clades demonstrating the utility of this approach. An analysis of amino acid variation in SARS-CoV-2 ORFs provided evidence of substitution events in the viral proteins involved in both host-entry and genome replication. The systematic monitoring of dynamic changes in the SARS-CoV-2 genomes of circulating virus populations over time can guide therapeutic and prophylactic strategies to manage and contain the virus and, also, with available efficacious antivirals and vaccines, aid in the monitoring of circulating genetic diversity as we proceed towards elimination of the agent. The barcode will add the necessary genetic resolution to facilitate tracking and monitoring of infection clusters to distinguish imported and indigenous cases and thereby aid public health measures seeking to interrupt transmission chains without the requirement for real-time complete genomes sequencing. | |
dc.description.sponsorship | This work was supported by funding from King Abdullah University of Science and Technology (KAUST), Office of Sponsored Research (OSR), under award number FCC/1/1976-25-01. Work in AP’s laboratory is supported by the KAUST faculty baseline fund (BAS/1/1020-01- 01) and research grants from the Office for Sponsored Research (OSR-2015-CRG4-2610, OCRF-2014-CRG3-2267). We thank all laboratories which have contributed sequences to the GISAID database. We thank Olga Douvropoulou, Raeece Naeem Mohamed Ghazzali and Sharif Hala for their support during the work. We also thank Richard Culleton (Nagasaki University, Japan) and Gabo Gonzalez (UCD, Ireland) for their critical comments on the manuscript draft. | |
dc.publisher | Cold Spring Harbor Laboratory | |
dc.relation.url | http://biorxiv.org/lookup/doi/10.1101/2020.04.21.054221 | |
dc.relation.url | https://www.biorxiv.org/content/biorxiv/early/2020/04/23/2020.04.21.054221.full.pdf | |
dc.rights | Archived with thanks to Cold Spring Harbor Laboratory | |
dc.title | The genomic variation landscape of globally-circulating clades of SARS-CoV-2 defines a genetic barcoding scheme | |
dc.type | Preprint | |
dc.contributor.department | Biological and Environmental Sciences and Engineering (BESE) Division | |
dc.contributor.department | Bioscience | |
dc.contributor.department | Bioscience Program | |
dc.contributor.department | Computational Bioscience Research Center (CBRC) | |
dc.contributor.department | Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division | |
dc.contributor.department | Environmental Science and Engineering Program | |
dc.contributor.department | King Abdullah University of Science and Technology (KAUST), Pathogen Genomics Laboratory, Biological and Environmental Science and Engineering (BESE), Thuwal8 Jeddah, 23955-6900, Saudi Arabia. | |
dc.contributor.department | Pathogen Genomics Laboratory | |
dc.contributor.department | Structural Biology and Engineering | |
dc.eprint.version | Pre-print | |
dc.contributor.institution | National Virus Reference Laboratory (NVRL), School of Medicine, University College Dublin, Belfield, Dublin 4, Ireland. | |
dc.contributor.institution | Research Center for Zoonosis Control, Global Institution for Collaborative Research and Education (GI-CoRE); Hokkaido University, N20 W10 Kita-ku, Sapporo, 001-0020 Japan. | |
dc.contributor.institution | Centre de Biochimie Structurale, CNRS, INSERM, Université de Montpellier, 34090 Montpellier, France. | |
dc.contributor.institution | Nuffield Division of Clinical Laboratory Sciences (NDCLS), The John Radcliffe Hospital, University of Oxford, Headington, Oxford, OX3 9DU, United Kingdom. | |
kaust.person | Guan, Qingtian | |
kaust.person | Sadykov, Mukhtar | |
kaust.person | Nugmanova, Raushan | |
kaust.person | Arold, Stefan T. | |
kaust.person | Pain, Arnab | |
kaust.grant.number | FCC/1/1976-25-01 | |
kaust.grant.number | OCRF-2014-CRG3-2267 | |
kaust.grant.number | OSR-2015-CRG4-2610 | |
refterms.dateFOA | 2020-04-26T09:55:39Z | |
kaust.acknowledged.supportUnit | Office of Sponsored Research (OSR) |
Files in this item
This item appears in the following Collection(s)
-
Biological and Environmental Sciences and Engineering (BESE) Division
For more information visit: https://bese.kaust.edu.sa/ -
Preprints
-
Bioscience Program
For more information visit: https://bese.kaust.edu.sa/study/Pages/Bioscience.aspx -
Environmental Science and Engineering Program
For more information visit: https://bese.kaust.edu.sa/study/Pages/EnSE.aspx -
Computational Bioscience Research Center (CBRC)
-
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/