Core genes can have higher recombination rates than accessory genes within global microbial populations

  1. Asher Preska Steinberg
  2. Mingzhi Lin
  3. Edo Kussell  Is a corresponding author
  1. New York University, United States

Abstract

Recombination is essential to microbial evolution, and is involved in the spread of antibiotic resistance, antigenic variation, and adaptation to the host niche. However, assessing the impact of homologous recombination on accessory genes which are only present in a subset of strains of a given species remains challenging due to their complex phylogenetic relationships. Quantifying homologous recombination for accessory genes (which are important for niche-specific adaptations) in comparison to core genes (which are present in all strains and have essential functions) is critical to understanding how selection acts on variation to shape species diversity and genome structures of bacteria. Here, we apply a computationally efficient, non-phylogenetic approach to measure homologous recombination rates in the core and accessory genome using >100,000 whole genome sequences from Streptococcus pneumoniae and several additional species. By analyzing diverse sets of sequence clusters, we show that core genes often have higher recombination rates than accessory genes, and for some bacterial species the associated effect sizes for these differences are pronounced. In a subset of species, we find that gene frequency and homologous recombination rate are positively correlated. For S. pneumoniae and several additional species, we find that while the recombination rate is higher for the core genome, the mutational divergence is lower, indicating that divergence-based homologous recombination barriers could contribute to differences in recombination rates between the core and accessory genome. Homologous recombination may therefore play a key role in increasing the efficiency of selection in the most conserved parts of the genome.

Data availability

Lists of SRA accession numbers corresponding to the raw reads used to build the multi-sequence alignments analyzed in this manuscript are included as Figure 2 - source data 1 and Figure 3 - source data 1. All SRA files, reference genomes, and complete genome assemblies are available through NCBI. All sequence collections used are listed in Supplementary File 5. For the PubMLST sequence collections, PubMLST was used to identify whole genome sequences (by filtering for strains in the 'Genome Collection' of each species where the sequence length is at least that of the reference genome), then the raw reads were downloaded from NCBI using their SRA numbers. Accession numbers for reference genomes used for each microbial species are also listed in Supplementary File 5.All original code has been deposited at GitHub and is publicly available. Links are given below:- https://github.com/kussell-lab/mcorr- https://github.com/kussell-lab/mcorr-clustering- https://github.com/kussell-lab/ReferenceAlignmentGenerator- https://github.com/kussell-lab/PangenomeAlignmentGenerator

The following previously published data sets were used

Article and author information

Author details

  1. Asher Preska Steinberg

    Department of Biology, New York University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Mingzhi Lin

    Department of Biology, New York University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Edo Kussell

    Department of Biology, New York University, New York, United States
    For correspondence
    edo.kussell@nyu.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-0590-4036

Funding

National Institutes of Health (R01-GM097356)

  • Edo Kussell

Simons Foundation (Simons Foundation Awardee of the Life Sciences Research Foundation)

  • Asher Preska Steinberg

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Paul B Rainey, Max Planck Institute for Evolutionary Biology, Germany

Version history

  1. Preprint posted: September 14, 2021 (view preprint)
  2. Received: March 10, 2022
  3. Accepted: June 30, 2022
  4. Accepted Manuscript published: July 8, 2022 (version 1)
  5. Version of Record published: September 5, 2022 (version 2)

Copyright

© 2022, Preska Steinberg et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,644
    views
  • 518
    downloads
  • 15
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Asher Preska Steinberg
  2. Mingzhi Lin
  3. Edo Kussell
(2022)
Core genes can have higher recombination rates than accessory genes within global microbial populations
eLife 11:e78533.
https://doi.org/10.7554/eLife.78533

Share this article

https://doi.org/10.7554/eLife.78533

Further reading

    1. Epidemiology and Global Health
    2. Microbiology and Infectious Disease
    Patrick E Brown, Sze Hang Fu ... Ab-C Study Collaborators
    Research Article Updated

    Background:

    Few national-level studies have evaluated the impact of ‘hybrid’ immunity (vaccination coupled with recovery from infection) from the Omicron variants of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).

    Methods:

    From May 2020 to December 2022, we conducted serial assessments (each of ~4000–9000 adults) examining SARS-CoV-2 antibodies within a mostly representative Canadian cohort drawn from a national online polling platform. Adults, most of whom were vaccinated, reported viral test-confirmed infections and mailed self-collected dried blood spots (DBSs) to a central lab. Samples underwent highly sensitive and specific antibody assays to spike and nucleocapsid protein antigens, the latter triggered only by infection. We estimated cumulative SARS-CoV-2 incidence prior to the Omicron period and during the BA.1/1.1 and BA.2/5 waves. We assessed changes in antibody levels and in age-specific active immunity levels.

    Results:

    Spike levels were higher in infected than in uninfected adults, regardless of vaccination doses. Among adults vaccinated at least thrice and infected more than 6 months earlier, spike levels fell notably and continuously for the 9-month post-vaccination. In contrast, among adults infected within 6 months, spike levels declined gradually. Declines were similar by sex, age group, and ethnicity. Recent vaccination attenuated declines in spike levels from older infections. In a convenience sample, spike antibody and cellular responses were correlated. Near the end of 2022, about 35% of adults above age 60 had their last vaccine dose more than 6 months ago, and about 25% remained uninfected. The cumulative incidence of SARS-CoV-2 infection rose from 13% (95% confidence interval 11–14%) before omicron to 78% (76–80%) by December 2022, equating to 25 million infected adults cumulatively. However, the coronavirus disease 2019 (COVID-19) weekly death rate during the BA.2/5 waves was less than half of that during the BA.1/1.1 wave, implying a protective role for hybrid immunity.

    Conclusions:

    Strategies to maintain population-level hybrid immunity require up-to-date vaccination coverage, including among those recovering from infection. Population-based, self-collected DBSs are a practicable biological surveillance platform.

    Funding:

    Funding was provided by the COVID-19 Immunity Task Force, Canadian Institutes of Health Research, Pfizer Global Medical Grants, and St. Michael’s Hospital Foundation. PJ and ACG are funded by the Canada Research Chairs Program.

    1. Microbiology and Infectious Disease
    Alejandro Prieto, Luïsa Miró ... Antonio Juarez
    Research Article

    Antimicrobial resistance (AMR) poses a significant threat to human health. Although vaccines have been developed to combat AMR, it has proven challenging to associate specific vaccine antigens with AMR. Bacterial plasmids play a crucial role in the transmission of AMR. Our recent research has identified a group of bacterial plasmids (specifically, IncHI plasmids) that encode large molecular mass proteins containing bacterial immunoglobulin-like domains. These proteins are found on the external surface of the bacterial cells, such as in the flagella or conjugative pili. In this study, we show that these proteins are antigenic and can protect mice from infection caused by an AMR Salmonella strain harboring one of these plasmids. Furthermore, we successfully generated nanobodies targeting these proteins, that were shown to interfere with the conjugative transfer of IncHI plasmids. Considering that these proteins are also encoded in other groups of plasmids, such as IncA/C and IncP2, targeting them could be a valuable strategy in combating AMR infections caused by bacteria harboring different groups of AMR plasmids. Since the selected antigens are directly linked to AMR itself, the protective effect extends beyond specific microorganisms to include all those carrying the corresponding resistance plasmids.