Articles Service
Minireview
An Overview of the Genetic Variations of the SARS-CoV-2 Genomes Isolated in Southeast Asian Countries
1Department of Medical Microbiology, Faculty of Medicine, University of Malaya, 50603 Kuala Lumpur, Malaysia 2Department of Anaesthesiology and Intensive Care, UKM Medical Centre, 56000 Kuala Lumpur, Malaysia 3Department of Medicine, Faculty of Medicine, University of Malaya, 50603 Kuala Lumpur, Malaysia
Correspondence to:J. Microbiol. Biotechnol. 2020; 30(7): 962-966
Published July 28, 2020 https://doi.org/10.4014/jmb.2006.06009
Copyright © The Korean Society for Microbiology and Biotechnology.
Abstract
Keywords
Graphical Abstract
The Study
Human mobility (in parentheses) is one of the main factors that contribute to the worldwide dissemination of microorganisms. The spread of coronavirus disease 2019 (COVID-19) was recently reported to transmit to neighboring countries with relocation diffusion [1]. With most of the studies focusing on China, Western Europe and the USA, little is known about its evolution and genome variability in Southeast Asian (SEA) countries. SEA is home to more than half a billion or 9% of the world’s population. As the region grapples with a surge in infection cases since March 2020 [2], it is important to investigate purported mutations and the role of geographical proximity in shaping the genetic structure of the SARS-CoV-2 in SEA countries. On March 4, 2020, the World Health Organisation (WHO) outlined that only nine of the eleven countries have the capacity to test for COVID- 19 [3], suggesting that the lack of testing facilities could hinder the preparedness and response planning of these countries towards COVID-19.
Among the SEA countries, Malaysia, Thailand and Singapore employ a large number of migrant workers, with Malaysia being the top importer with approximately 2.23 million people [4]. Concurrently, there has been a mass exodus of Malaysians seeking greater economic security in Singapore, with approximately 450,000 people crossing the Malaysia-Singapore border daily [5]. The Indonesia authorities reported that more than 64,000 Indonesian migrant workers had returned from Malaysia amid the country’s ongoing lockdown [6]. Another type of human mobility is refugees. The political instability which holds sway in Myanmar has forced 10% of the population to emigrate in search of refuge [7]. More boats carrying Rohingya refugees were spotted off the coasts of Malaysia and southern Thailand, adding to the challenges faced by these countries fighting the pandemic outbreak [8]. This trend of massive internal mobility is expected to continue as the countries ease the lockdown in the foreseeable future, continuously shifting the genetic drift of the viral population. Studies have shown that human migration (gene flow) is a remarkable factor to consider in virus evolution [9]. Hence, characterization of the genetic variability of viral populations provides important insights in virus evolution and epidemiology for devising efficient and reliable infection control strategies. As of April 30, 2020, only 142 complete sequences (plural) of SARS-CoV-2 from six of the SEA countries, including Cambodia (
Overall, all strains isolated before national implementation of border control were largely invariant (
-
Fig. 1.
Phylogenetic tree inferred by using neighbor-joining method and Tamura-Nei model with 1,000 bootstrap replicates, representing complete SARS-CoV-2 genomes from SEA countries against the reference genome Wuhan-Hu-1 (*). Clusters identified were colored accordingly: Cluster I (blue), Cluster II (red) and Cluster III (green). Isolates with symbol (▲) denotes samples collected after country border control implementation whereas isolates without symbol denotes samples collected before country border control implementation.
-
Fig. 2.
Heatmap showing the presence of marker variants of Clades S, L, V, G, GH and GR named by GISAID [12] in SEA SARS-CoV-2 genomes. Grey denotes the presence and empty space denotes the absence of the genes listed. Isolates with symbol ( ▲ ) denotes samples collected after country border control implementation whereas isolates without symbol denotes samples collected before country border control implementation.
The current studied genomes showed phylogenetic relation with common recurrent mutations. Cluster I exhibited common recurrent mutation at 8782C>T in ORF1ab (
Cluster II was distinguished by not only the spike mutation D614G (23403A>G; G clade), but also at 241C>T, 3037C>T, and 14408C>T (
The evolution and transmission of the SARS-CoV-2 is potentially affected by distinctive travel histories, founder events, host characteristics as well as geographical and climate factors. It is prudent to consider the possibility that mutational variants might influence the virus spread and subsequently the clinical presentation and outcome. Therefore, the described core mutations and phylogenetic classification in this work may provide information regarding outbreak control as well as evaluating the clinical and epidemiological outcomes of SARS- CoV-2 infection.
Methodology
A phylogenetic tree was constructed using 142 complete genome sequences from the South East Asia countries plus the reference genome Wuhan-Hu-1 (MN908947.3). The SARS-CoV-2 genomes were obtained from the GISAID (https://www.gisaid.org/), as of April 30, 2020. Genome sequence alignment was performed using Multiple Sequence Comparison by Log-Expectation (MUSCLE) and the phylogenetic tree was inferred by using the neighbor-joining method and Tamura-Nei model with 1,000 bootstrap replicates in Geneious software (version 2020.1; Biomatters Ltd.). The sequences were also aligned to the reference genome. Sites were masked in the first 130bp and last 50 bp, as were other ambiguous positions following the protocol advocated by van Dorp
Data Availability
The genome sequences of the SARS-CoV-2 used in this study are available from the GISAID database (https://www.gisaid.org/), upon registration.
Supplemental Material
Acknowledgment
We gratefully acknowledge the authors from originating and submitting laboratories of sequence data on which the analysis is based.
Conflict of Interest
The authors have no financial conflicts of interest to declare.
References
- Sirkeci I, Yüceşahin MM. 2020. Coronavirus and migration: Analysis of human mobility and the spread of COVID-19.
Migrat. Lett. 17 : 379-398. - Southeast Asia Covid-19 Tracker, 2020. Available from: https://www.csis.org/programs/southeast-asia-program/southeast-asia-covid-19-tracker-0. Accesed May 12, 2020.
- World Health Organisation. WHO emphasizes on agile response capacities, as South-East Asia Region confirms more COVID-19 cases, 2020. Available from: https://www.who.int/southeastasia/news/detail/04-03-2020-who-emphasizes-on-agile-response- capacities-as-south-east-asia-region-confirms-more-covid-19-cases. Accessed May 18, 2020.
- TRIANGLE in ASEAN International Labour Organisation. Available from: https://www.ilo.org/asia/projects/WCMS_428584/lang--en/index.htm. Accessed May 20, 2020.
- Today Online. Amid cross border tensions, Malaysians working in Singapore hope rice bowls unaffected, 2018. Available from: https://www.todayonline.com/world/amid-cross-border-tensions-malaysians-working-singapore-hope-rice-bowls-unaffected. Accessed May 20, 2020.
- The Jakarta Post. High influx of Indonesian migrant workers from Malaysia, despite call to stay amid outbreak, 2020. Available from: https://www.thejakartapost.com/news/2020/04/23/high-influx-of-indonesian-migrant-workers-from-malaysia-despite-call-to- stay-amid-outbreak.html. Accessed 23 April, 2020.
- The Center for Migration Studies. The Stateless Rohingya in Thailand. Available from: https://cmsny.org/the-stateless-rohingya-in-thailand/. Accessed May 20, 2020.
- The Guardian. Malaysia cites Covid-19 for rounding up hundreds of migrants, 2020. Available from: https://www.theguardian.com/global-development/2020/may/02/malaysia-cites-covid-19-for-rounding-up-hundreds-of-migrants. Accessed May 2, 2020.
- Moya A, Holmes EC, Gonzalez-Candelas F. 2004. The population genetics and evolutionary epidemiology of RNA viruses.
Nat. Rev. Microbiol. 2 : 279-288. - Chong YM, Sam IC, Ponnampalavanar S, Syed Omar SF, Kamarulzaman A, Munusamy V,
et al . 2020. Complete Genome Sequences of SARS-CoV-2 Strains Detected in Malaysia.Microbiol. Resour. Announc. 9 : e00383-20. - van Dorp L, Acman M, Richard D, Shaw LP, Ford CE, Ormond L,
et al . 2020. Emergence of genomic diversity and recurrent mutations in SARS-CoV-2.Infect. Genet. Ecol. 83 : 104351. - GISAID. GISAID_hCoV-19_Analysis_Update_2020-06-12_1900UTC, 2020. Available from: https://www.gisaid.org/. Accessed June 16, 2020.
- Khailany RA, Safdar M, Ozaslan M. 2020. Genomic characterization of a novel SARS-CoV-2.
Gene Rep. 19 : 100682. - Wang C, Liu Z, Chen Z, Huang X, Xu M, He T,
et al . 2020. The establishment of reference sequence for SARS-CoV-2 and variation analysis.J. Med. Virol. 92 : 667-674. - Lau SK, Woo PC, Li KS, Huang Y, Tsoi HW, Wong BH,
et al . 2005. Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats.Proc. Natl. Acad. Sci. USA 102 : 14040-14045. - Kim JI, Kim YJ, Lemey P, Lee I, Park S, Bae JY,
et al . 2016. The recent ancestry of Middle East respiratory syndrome coronavirus in Korea has been shaped by recombination.Sci. Rep. 6 : 18825. - Forster P, Forster L, Renfrew C, Forster M. 2020. Phylogenetic network analysis of SARS-CoV-2 genomes.
Proc. Natl. Acad. Sci. USA 117 : 9241-9243. - Muth D, Corman VM, Roth H, Binger T, Dijkman R, Gottula LT,
et al . 2018. Attenuation of replication by a 29 nucleotide deletion in SARS-coronavirus acquired during the early stages of human-to-human transmission.Sci. Rep. 8 : 15177. - Lau SK, Feng Y, Chen H, Luk HK, Yang WH, Li KS,
et al . 2015. Severe Acute Respiratory Syndrome (SARS) coronavirus ORF8 protein is acquired from SARS-Related coronavirus from greater horseshoe bats through recombination.J. Virol. 9 : 10532-10547. - The Star. New Covid-19 cluster emerges from students returning from Indonesia, says Health DG, 2020. Available from: https://www.thestar.com.my/news/nation/2020/04/19/new-covid-19-cluster-emerges-from-students-returning-from-indonesia-says-health-dg. Accessed April 19, 2020.
- Du L, He Y, Zhou Y, Liu S, Zheng BJ, Jiang S. 2009. The spike protein of SARS-CoV--a target for vaccine and therapeutic development.
Nat. Rev. Microbiol. 7 : 226-236. - Choi Y, Chan AP. 2015. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels.
Bioinformatics 31 : 2745-2747. - Jin X, Lian JS, Hu JH, Gao J, Zheng L, Zhang YM,
et al . 2020. Epidemiological, clinical and virological characteristics of 74 cases of coronavirus-infected disease 2019 (COVID-19) with gastrointestinal symptoms.Gut 69 : 1002-1009. - Gire SK, Goba A, Andersen KG, Sealfon RS, Park DJ, Kanneh L,
et al . 2014. Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak.Science 345 : 1369-1372. - Holmes EC, Dudas G, Rambaut A, Andersen KG. 2016. The evolution of Ebola virus: Insights from the 2013-2016 epidemic.
Nature 538 : 193-200.
Related articles in JMB
Article
Minireview
J. Microbiol. Biotechnol. 2020; 30(7): 962-966
Published online July 28, 2020 https://doi.org/10.4014/jmb.2006.06009
Copyright © The Korean Society for Microbiology and Biotechnology.
An Overview of the Genetic Variations of the SARS-CoV-2 Genomes Isolated in Southeast Asian Countries
Polly Soo Xi Yap1,*, Tse Siang Tan2, Yoke Fun Chan1, Kok Keng Tee1, Adeeba Kamarulzaman3, Cindy Shuan Ju Teh1,*
1Department of Medical Microbiology, Faculty of Medicine, University of Malaya, 50603 Kuala Lumpur, Malaysia 2Department of Anaesthesiology and Intensive Care, UKM Medical Centre, 56000 Kuala Lumpur, Malaysia 3Department of Medicine, Faculty of Medicine, University of Malaya, 50603 Kuala Lumpur, Malaysia
Correspondence to:Cindy Shuan Ju Teh,
E-mail : cindysjteh@um.edu.my
Abstract
Monitoring the mutation dynamics of human severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is critical in understanding its infectivity, virulence and pathogenicity for development of a vaccine. In an “age of mobility,” the pandemic highlights the importance and vulnerability of regionalization and labor market interdependence in Southeast Asia. We intend to characterize the genetic variability of viral populations within the region to provide preliminary information for regional surveillance in the future. By analyzing 142 complete genomes from South East Asian (SEA) countries, we identified three central variants distinguished by nucleotide and amino acid changes.
Keywords: Coronavirus, SARS-CoV-2, genetic variation, COVID-19, mutation, Southeast Asia
The Study
Human mobility (in parentheses) is one of the main factors that contribute to the worldwide dissemination of microorganisms. The spread of coronavirus disease 2019 (COVID-19) was recently reported to transmit to neighboring countries with relocation diffusion [1]. With most of the studies focusing on China, Western Europe and the USA, little is known about its evolution and genome variability in Southeast Asian (SEA) countries. SEA is home to more than half a billion or 9% of the world’s population. As the region grapples with a surge in infection cases since March 2020 [2], it is important to investigate purported mutations and the role of geographical proximity in shaping the genetic structure of the SARS-CoV-2 in SEA countries. On March 4, 2020, the World Health Organisation (WHO) outlined that only nine of the eleven countries have the capacity to test for COVID- 19 [3], suggesting that the lack of testing facilities could hinder the preparedness and response planning of these countries towards COVID-19.
Among the SEA countries, Malaysia, Thailand and Singapore employ a large number of migrant workers, with Malaysia being the top importer with approximately 2.23 million people [4]. Concurrently, there has been a mass exodus of Malaysians seeking greater economic security in Singapore, with approximately 450,000 people crossing the Malaysia-Singapore border daily [5]. The Indonesia authorities reported that more than 64,000 Indonesian migrant workers had returned from Malaysia amid the country’s ongoing lockdown [6]. Another type of human mobility is refugees. The political instability which holds sway in Myanmar has forced 10% of the population to emigrate in search of refuge [7]. More boats carrying Rohingya refugees were spotted off the coasts of Malaysia and southern Thailand, adding to the challenges faced by these countries fighting the pandemic outbreak [8]. This trend of massive internal mobility is expected to continue as the countries ease the lockdown in the foreseeable future, continuously shifting the genetic drift of the viral population. Studies have shown that human migration (gene flow) is a remarkable factor to consider in virus evolution [9]. Hence, characterization of the genetic variability of viral populations provides important insights in virus evolution and epidemiology for devising efficient and reliable infection control strategies. As of April 30, 2020, only 142 complete sequences (plural) of SARS-CoV-2 from six of the SEA countries, including Cambodia (
Overall, all strains isolated before national implementation of border control were largely invariant (
-
Figure 1.
Phylogenetic tree inferred by using neighbor-joining method and Tamura-Nei model with 1,000 bootstrap replicates, representing complete SARS-CoV-2 genomes from SEA countries against the reference genome Wuhan-Hu-1 (*). Clusters identified were colored accordingly: Cluster I (blue), Cluster II (red) and Cluster III (green). Isolates with symbol (▲) denotes samples collected after country border control implementation whereas isolates without symbol denotes samples collected before country border control implementation.
-
Figure 2.
Heatmap showing the presence of marker variants of Clades S, L, V, G, GH and GR named by GISAID [12] in SEA SARS-CoV-2 genomes. Grey denotes the presence and empty space denotes the absence of the genes listed. Isolates with symbol ( ▲ ) denotes samples collected after country border control implementation whereas isolates without symbol denotes samples collected before country border control implementation.
The current studied genomes showed phylogenetic relation with common recurrent mutations. Cluster I exhibited common recurrent mutation at 8782C>T in ORF1ab (
Cluster II was distinguished by not only the spike mutation D614G (23403A>G; G clade), but also at 241C>T, 3037C>T, and 14408C>T (
The evolution and transmission of the SARS-CoV-2 is potentially affected by distinctive travel histories, founder events, host characteristics as well as geographical and climate factors. It is prudent to consider the possibility that mutational variants might influence the virus spread and subsequently the clinical presentation and outcome. Therefore, the described core mutations and phylogenetic classification in this work may provide information regarding outbreak control as well as evaluating the clinical and epidemiological outcomes of SARS- CoV-2 infection.
Methodology
A phylogenetic tree was constructed using 142 complete genome sequences from the South East Asia countries plus the reference genome Wuhan-Hu-1 (MN908947.3). The SARS-CoV-2 genomes were obtained from the GISAID (https://www.gisaid.org/), as of April 30, 2020. Genome sequence alignment was performed using Multiple Sequence Comparison by Log-Expectation (MUSCLE) and the phylogenetic tree was inferred by using the neighbor-joining method and Tamura-Nei model with 1,000 bootstrap replicates in Geneious software (version 2020.1; Biomatters Ltd.). The sequences were also aligned to the reference genome. Sites were masked in the first 130bp and last 50 bp, as were other ambiguous positions following the protocol advocated by van Dorp
Data Availability
The genome sequences of the SARS-CoV-2 used in this study are available from the GISAID database (https://www.gisaid.org/), upon registration.
Supplemental Material
Acknowledgment
We gratefully acknowledge the authors from originating and submitting laboratories of sequence data on which the analysis is based.
Conflict of Interest
The authors have no financial conflicts of interest to declare.
Fig 1.
Fig 2.
References
- Sirkeci I, Yüceşahin MM. 2020. Coronavirus and migration: Analysis of human mobility and the spread of COVID-19.
Migrat. Lett. 17 : 379-398. - Southeast Asia Covid-19 Tracker, 2020. Available from: https://www.csis.org/programs/southeast-asia-program/southeast-asia-covid-19-tracker-0. Accesed May 12, 2020.
- World Health Organisation. WHO emphasizes on agile response capacities, as South-East Asia Region confirms more COVID-19 cases, 2020. Available from: https://www.who.int/southeastasia/news/detail/04-03-2020-who-emphasizes-on-agile-response- capacities-as-south-east-asia-region-confirms-more-covid-19-cases. Accessed May 18, 2020.
- TRIANGLE in ASEAN International Labour Organisation. Available from: https://www.ilo.org/asia/projects/WCMS_428584/lang--en/index.htm. Accessed May 20, 2020.
- Today Online. Amid cross border tensions, Malaysians working in Singapore hope rice bowls unaffected, 2018. Available from: https://www.todayonline.com/world/amid-cross-border-tensions-malaysians-working-singapore-hope-rice-bowls-unaffected. Accessed May 20, 2020.
- The Jakarta Post. High influx of Indonesian migrant workers from Malaysia, despite call to stay amid outbreak, 2020. Available from: https://www.thejakartapost.com/news/2020/04/23/high-influx-of-indonesian-migrant-workers-from-malaysia-despite-call-to- stay-amid-outbreak.html. Accessed 23 April, 2020.
- The Center for Migration Studies. The Stateless Rohingya in Thailand. Available from: https://cmsny.org/the-stateless-rohingya-in-thailand/. Accessed May 20, 2020.
- The Guardian. Malaysia cites Covid-19 for rounding up hundreds of migrants, 2020. Available from: https://www.theguardian.com/global-development/2020/may/02/malaysia-cites-covid-19-for-rounding-up-hundreds-of-migrants. Accessed May 2, 2020.
- Moya A, Holmes EC, Gonzalez-Candelas F. 2004. The population genetics and evolutionary epidemiology of RNA viruses.
Nat. Rev. Microbiol. 2 : 279-288. - Chong YM, Sam IC, Ponnampalavanar S, Syed Omar SF, Kamarulzaman A, Munusamy V,
et al . 2020. Complete Genome Sequences of SARS-CoV-2 Strains Detected in Malaysia.Microbiol. Resour. Announc. 9 : e00383-20. - van Dorp L, Acman M, Richard D, Shaw LP, Ford CE, Ormond L,
et al . 2020. Emergence of genomic diversity and recurrent mutations in SARS-CoV-2.Infect. Genet. Ecol. 83 : 104351. - GISAID. GISAID_hCoV-19_Analysis_Update_2020-06-12_1900UTC, 2020. Available from: https://www.gisaid.org/. Accessed June 16, 2020.
- Khailany RA, Safdar M, Ozaslan M. 2020. Genomic characterization of a novel SARS-CoV-2.
Gene Rep. 19 : 100682. - Wang C, Liu Z, Chen Z, Huang X, Xu M, He T,
et al . 2020. The establishment of reference sequence for SARS-CoV-2 and variation analysis.J. Med. Virol. 92 : 667-674. - Lau SK, Woo PC, Li KS, Huang Y, Tsoi HW, Wong BH,
et al . 2005. Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats.Proc. Natl. Acad. Sci. USA 102 : 14040-14045. - Kim JI, Kim YJ, Lemey P, Lee I, Park S, Bae JY,
et al . 2016. The recent ancestry of Middle East respiratory syndrome coronavirus in Korea has been shaped by recombination.Sci. Rep. 6 : 18825. - Forster P, Forster L, Renfrew C, Forster M. 2020. Phylogenetic network analysis of SARS-CoV-2 genomes.
Proc. Natl. Acad. Sci. USA 117 : 9241-9243. - Muth D, Corman VM, Roth H, Binger T, Dijkman R, Gottula LT,
et al . 2018. Attenuation of replication by a 29 nucleotide deletion in SARS-coronavirus acquired during the early stages of human-to-human transmission.Sci. Rep. 8 : 15177. - Lau SK, Feng Y, Chen H, Luk HK, Yang WH, Li KS,
et al . 2015. Severe Acute Respiratory Syndrome (SARS) coronavirus ORF8 protein is acquired from SARS-Related coronavirus from greater horseshoe bats through recombination.J. Virol. 9 : 10532-10547. - The Star. New Covid-19 cluster emerges from students returning from Indonesia, says Health DG, 2020. Available from: https://www.thestar.com.my/news/nation/2020/04/19/new-covid-19-cluster-emerges-from-students-returning-from-indonesia-says-health-dg. Accessed April 19, 2020.
- Du L, He Y, Zhou Y, Liu S, Zheng BJ, Jiang S. 2009. The spike protein of SARS-CoV--a target for vaccine and therapeutic development.
Nat. Rev. Microbiol. 7 : 226-236. - Choi Y, Chan AP. 2015. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels.
Bioinformatics 31 : 2745-2747. - Jin X, Lian JS, Hu JH, Gao J, Zheng L, Zhang YM,
et al . 2020. Epidemiological, clinical and virological characteristics of 74 cases of coronavirus-infected disease 2019 (COVID-19) with gastrointestinal symptoms.Gut 69 : 1002-1009. - Gire SK, Goba A, Andersen KG, Sealfon RS, Park DJ, Kanneh L,
et al . 2014. Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak.Science 345 : 1369-1372. - Holmes EC, Dudas G, Rambaut A, Andersen KG. 2016. The evolution of Ebola virus: Insights from the 2013-2016 epidemic.
Nature 538 : 193-200.