Genomics: The early days of late blight

  1. Paul RJ Birch  Is a corresponding author
  2. David EL Cooke  Is a corresponding author
  1. University of Dundee, United Kingdom
  2. James Hutton Institute, United Kingdom

There can be few diseases of plants that have had a more profound effect on human health, society and politics than potato late blight. Emerging in the mid-1840s, the disease rapidly became pandemic, precipitating the Great Famine, which led to almost a million deaths in Ireland and the emigration of many more, plus considerable loss of life and extreme political upheaval throughout mainland Europe (Zadoks, 2008).

The water mould Phytophthora infestans was first identified as the probable cause of late blight by Miles Joseph Berkeley in 1846, almost 20 years before Louis Pasteur formally proposed the germ theory of disease. As such, it has been credited with establishing the field of plant pathology, and is a major milestone in the foundation of epidemiology as a scientific discipline. Now, more than 160 years later, Hernán Burbano of the Max Planck Institute for Developmental Biology and co-workers have combined the power of next-generation sequencing technology with the availability of herbarium samples to peek through a 50-year window of history at the genome of the pathogen that contributed to so much human suffering (Yoshida et al., 2013).

Work in the 1990s by researchers at Cornell University revealed a dominant lineage of P. infestans, dubbed US-1, which was identified in 13 countries on four continents. This lineage was defined by nuclear DNA markers and, crucially, a mitochondrial haplotype termed Ib, and the Cornell team proposed that it had emerged from Mexico in the 1840s to cause severe crop losses on a global scale (Goodwin et al., 1994). However, this idea was challenged when researchers at North Carolina State University identified a different type of mitochondrial DNA in herbarium samples of potato late blight collected between 1845 and 1886 (Ristaino et al., 2001).

Now, Burbano and co-workers—including Kentaro Yoshida of the Sainsbury Laboratory and Verena Schuenemann of the University of Tübingen as joint first authors, plus colleagues at other centres in Germany and the US—have sequenced the genome of P. infestans from 11 infected leaf samples that were gathered in Ireland, Great Britain, Germany and Canada between 1845 and 1896 (Yoshida et al., 2013). They found that the pathogen strains in all samples were representatives of a single lineage, which they have called HERB-1. This strain of P. infestans was responsible for the first global migration of the pathogen in the 1840s, and then persisted for at least 50 years. It has not been observed in modern times, but is clearly a close relative of US-1. Analysis of both mitochondrial and nuclear DNA showed HERB-1 and US-1 to be sister groups in a clade distinct from contemporary genotypes.

Mexico is the foremost centre of P. infestans diversity in the world, but Yoshida, Schuenemann et al. propose that HERB-1 and US-1 emerged from a metapopulation that was established outside of Mexico in the early 19th century and that US-1 displaced HERB-1 at some point in the 20th century (Figure 1). The new work also emphasises the threats associated with the movement of pathogens between continents, and highlights the occurrence of waves of pathogen migration and genotype displacement. Indeed, dramatic shifts in P. infestans populations are still being observed today with, for example, the aggressive genotype 13_A2 displacing other genotypes across Europe over a period of less than three years (Cooke et al., 2012), and the rapid emergence of genotype US-22 in the eastern USA in 2009 (Fry et al., 2013). Moreover, it should be possible to use the newly obtained DNA sequences to develop markers that allow 20th century samples to be tested for the presence of HERB-1. It will be interesting to figure out what happened between 1900 and the 1970s, when new populations began to displace US-1 in Europe (Figure 1).

Schematic representation of the time-line for the emergence of major lineages of Phytophthora infestans, the pathogen that causes late blight in potatoes, as proposed by Burbano and co-workers (Yoshida et al., 2013). P. infestans and P. mirabilis (a pathogen that does not infect potatoes) are thought to have diverged from a common ancestor about 1200 years ago. Since then there have been at least three major migrations of P. infestans from a sexually reproducing population in Mexico (shown by black triangles). The first involves a metapopulation established in an unknown region, from which HERB-1 and US-1 emerged and spread around the world. US-1 is rarely found in contemporary populations (as indicated by the dotted black line) and HERB-1 is not found anywhere today (red cross). A shipment of potatoes from Mexico into Europe during a drought in 1976 is thought to be the source of the population of P. infestans that displaced US-1 in Europe and resulted in the emergence of aggressive new clones such as 13_A2 (also known as Blue13). Lastly, independent migrations into the US resulted in the dominance of the US-8 lineage, although this is now being displaced by other lineages such as US-22. The precise timing of the displacement of HERB-1 by US-1 remains undefined.

Although these latest results provide new insights into the pathogen responsible for famines in the 19th century, questions remain about the precise cause and timing of subsequent waves of new lineages. Yoshida et al. found no evidence for the positive selection of particular genes that might have driven displacement of HERB-1 by US-1. They hypothesised that cross breeding commercial potatoes with Solanum demissum, a wild species of potato that is resistant to some lineages of P. infestans, in Europe in the 1930s-1950s may have driven selection against HERB-1 and, by inference, favoured the US-1 strain that displaced it. This is an interesting theory, and there is a clear example of a widely grown commercial cultivar (the Pentland Dell) that contained resistance genes from S. demissum that were subsequently ‘broken’ by a changing pathogen population (Malcolmson, 1969; Fry, 2008). However, more data are needed to test this hypothesis.

Yoshida et al. propose that US-1 and HERB-1 come from a single metapopulation that was itself derived from Mexican populations that were likely to have been exposed to S. demissum. Should we expect to see a difference in the response of the different lineages of P. infestans to this form of resistance? Understanding the co-evolution of hundreds of virulence genes in the pathogen (Haas et al., 2009) with potentially similar numbers of resistance genes in potato (Jupe et al., 2012) is challenging, and datasets such as this should help to answer this question.

HERB-1 existed as a clonal (asexual) lineage for at least 50 years with a generation time as short as a few days during each cropping season. The long-term evolutionary consequences of such clonality are that deleterious mutations can accumulate. The decline in fitness caused by these mutations is an example of Muller’s ratchet in action (Goodwin, 1997) and might explain why lineages emerging from the sexually reproducing metapopulation described by Yoshida et al. displaced HERB-1. Yoshida et al. provide further support for such a scenario by showing that HERB-1 was diploid but US-1 and contemporary clonal lineages often have increased ploidy. It could be that such increased genetic diversity, providing a buffering capacity against deleterious mutations, offered subsequent lineages an evolutionary advantage over HERB-1.

Emerging diseases present serious threats to human health and to the health of livestock, crops and natural ecosystems. We have seen an increase in emerging diseases in the past two decades, likely due to increased trade, transport and travel, and perhaps due also to changes in climate (Fisher et al., 2012). By providing significant insights into the make-up of one of the most notorious emerging diseases in history, the work of Burbano, Yoshida, Schuenemann and colleagues offers us a valuable opportunity to learn from the past.


Article and author information

Author details

  1. Paul RJ Birch

    Division of Plant Sciences, University of Dundee, Dundee, United Kingdom
    For correspondence
    Competing interests
    The authors declare that no competing interests exist.
  2. David EL Cooke

    James Hutton Institute, Dundee, United Kingdom
    For correspondence
    Competing interests
    The authors declare that no competing interests exist.

Publication history

  1. Version of Record published: June 18, 2013 (version 1)


© 2013, Birch and Cooke

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.


  • 850
  • 108
  • 9

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Paul RJ Birch
  2. David EL Cooke
Genomics: The early days of late blight
eLife 2:e00954.
  1. Further reading

Further reading

  1. Genome sequencing reveals the strain of P. infestans that caused the Great Famine in the late 1840s.

    1. Evolutionary Biology
    2. Genetics and Genomics
    Brian PH Metzger, Yeonwoo Park ... Joseph W Thornton
    Research Article

    A protein’s genetic architecture – the set of causal rules by which its sequence produces its functions – also determines its possible evolutionary trajectories. Prior research has proposed that the genetic architecture of proteins is very complex, with pervasive epistatic interactions that constrain evolution and make function difficult to predict from sequence. Most of this work has analyzed only the direct paths between two proteins of interest – excluding the vast majority of possible genotypes and evolutionary trajectories – and has considered only a single protein function, leaving unaddressed the genetic architecture of functional specificity and its impact on the evolution of new functions. Here, we develop a new method based on ordinal logistic regression to directly characterize the global genetic determinants of multiple protein functions from 20-state combinatorial deep mutational scanning (DMS) experiments. We use it to dissect the genetic architecture and evolution of a transcription factor’s specificity for DNA, using data from a combinatorial DMS of an ancient steroid hormone receptor’s capacity to activate transcription from two biologically relevant DNA elements. We show that the genetic architecture of DNA recognition consists of a dense set of main and pairwise effects that involve virtually every possible amino acid state in the protein-DNA interface, but higher-order epistasis plays only a tiny role. Pairwise interactions enlarge the set of functional sequences and are the primary determinants of specificity for different DNA elements. They also massively expand the number of opportunities for single-residue mutations to switch specificity from one DNA target to another. By bringing variants with different functions close together in sequence space, pairwise epistasis therefore facilitates rather than constrains the evolution of new functions.