Transmission networks of SARS-CoV-2 in Coastal Kenya during the first two waves: a retrospective genomic study
Abstract
Background: Detailed understanding on SARS-CoV-2 regional transmission networks within sub-Saharan Africa is key for guiding local public health interventions against the pandemic.
Methods: Here, we analysed 1,139 SARS-CoV-2 genomes from positive samples collected between March 2020 and February 2021 across six counties of Coastal Kenya (Mombasa, Kilifi, Taita Taveta, Kwale, Tana River and Lamu) to infer virus introductions and local transmission patterns during the first two waves of infections. Virus importations were inferred using ancestral state reconstruction and virus dispersal between counties were estimated using discrete phylogeographic analysis.
Results: During Wave 1, 23 distinct Pango lineages were detected across the six counties, while during Wave 2, 29 lineages were detected; nine of which occurred in both waves, and four seemed to be Kenya specific (B.1.530, B.1.549, B.1.596.1 and N.8). Most of the sequenced infections belonged to lineage B.1 (n=723, 63%) which predominated in both Wave 1 (73%, followed by lineages N.8 (6%) and B.1.1 (6%)) and Wave 2 (56%, followed by lineages B.1.549 (21%) and B.1.530 (5%). Over the study period, we estimated 280 SARS-CoV-2 virus importations into Coastal Kenya. Mombasa City, a vital tourist and commercial centre for the region, was a major route for virus imports, most of which occurred during Wave 1, when many COVID-19 government restrictions were still in force. In Wave 2, inter-county transmission predominated, resulting in the emergence of local transmission chains and diversity.
Conclusions: Our analysis supports moving COVID-19 control strategies in the region from a focus on international travel to strategies that will reduce local transmission.
Funding: This work was funded by The Wellcome (grant numbers; 220985, 203077/Z/16/Z, and 222574/Z/21/Z) and the National Institute for Health Research (NIHR), project references: 17/63/and 16/136/33 using UK aid from the UK Government to support global health research, The UK Foreign, Commonwealth and Development Office.
Data availability
1) Sequence data have been deposited in GISAID database under accession numbers provided in Supplement File 22) Source Data files have been provided for Figures 1-2 and 4-10.3) Source Code associated with the figures has been uploaded (Source Code File 1) and also been made available through Harvard Dataverse
-
Replication Data for: Genomic surveillance reveals the spread patterns of SARS-CoV-2 in coastal Kenya during the first two wavesHarvard Dataverse, V3, UNF:6:RL6Vg7q0JyS7YoCkjhHe1A== [fileUNF].
-
Genomic epidemiology of SARS-CoV-2 in coastal Kenya (March - July 2020)Github; sars-cov-2-early-phase-manuscript.
Article and author information
Author details
Funding
National Institute for Health Research (17/63/82)
- D James Nokes
National Institute for Health Research (16/136/33)
- Charles N Agoti
- Samson Kinyanjui
- George Warimwe
- D James Nokes
- George Githinji
Wellcome Trust (220985)
- D James Nokes
- George Githinji
Wellcome Trust (203077/Z/16/Z)
- Edwine Barasa
- Benjamin Tsofa
- Philip Bejon
Wellcome Trust (220977/Z/20/Z)
- My Phan
- Matthew Cotten
Medical Research Council (NC_PC_19060)
- My Phan
- Matthew Cotten
H2020 European Research Council (n{degree sign}874850)
- Simon Dellicour
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Human subjects: Samples analysed here were collected under the Ministry of Health protocols as part of the national COVID-19 public health response. The whole genome sequencing study protocol was reviewed and approved by the Scientific and Ethics Review Committee (SERU) at Kenya Medical Research Institute (KEMRI), Nairobi, Kenya (SERU protocol #4035). Individual patient consent was not required by the committee for the use of these samples for studies of genomic epidemiology to inform public health response.
Copyright
© 2022, Agoti et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,400
- views
-
- 396
- downloads
-
- 9
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Genetics and Genomics
Burden of stroke differs by region, which could be attributed to differences in comorbid conditions and ethnicity. Genomewide variation acts as a proxy marker for ethnicity, and comorbid conditions. We present an integrated approach to understand this variation by considering prevalence and mortality rates of stroke and its comorbid risk for 204 countries from 2009 to 2019, and Genome-wide association studies (GWAS) risk variant for all these conditions. Global and regional trend analysis of rates using linear regression, correlation, and proportion analysis, signifies ethnogeographic differences. Interestingly, the comorbid conditions that act as risk drivers for stroke differed by regions, with more of metabolic risk in America and Europe, in contrast to high systolic blood pressure in Asian and African regions. GWAS risk loci of stroke and its comorbid conditions indicate distinct population stratification for each of these conditions, signifying for population-specific risk. Unique and shared genetic risk variants for stroke, and its comorbid and followed up with ethnic-specific variation can help in determining regional risk drivers for stroke. Unique ethnic-specific risk variants and their distinct patterns of linkage disequilibrium further uncover the drivers for phenotypic variation. Therefore, identifying population- and comorbidity-specific risk variants might help in defining the threshold for risk, and aid in developing population-specific prevention strategies for stroke.
-
- Epidemiology and Global Health
- Evolutionary Biology
Several coronaviruses infect humans, with three, including the SARS-CoV2, causing diseases. While coronaviruses are especially prone to induce pandemics, we know little about their evolutionary history, host-to-host transmissions, and biogeography. One of the difficulties lies in dating the origination of the family, a particularly challenging task for RNA viruses in general. Previous cophylogenetic tests of virus-host associations, including in the Coronaviridae family, have suggested a virus-host codiversification history stretching many millions of years. Here, we establish a framework for robustly testing scenarios of ancient origination and codiversification versus recent origination and diversification by host switches. Applied to coronaviruses and their mammalian hosts, our results support a scenario of recent origination of coronaviruses in bats and diversification by host switches, with preferential host switches within mammalian orders. Hotspots of coronavirus diversity, concentrated in East Asia and Europe, are consistent with this scenario of relatively recent origination and localized host switches. Spillovers from bats to other species are rare, but have the highest probability to be towards humans than to any other mammal species, implicating humans as the evolutionary intermediate host. The high host-switching rates within orders, as well as between humans, domesticated mammals, and non-flying wild mammals, indicates the potential for rapid additional spreading of coronaviruses across the world. Our results suggest that the evolutionary history of extant mammalian coronaviruses is recent, and that cases of long-term virus–host codiversification have been largely over-estimated.