Survival of mineral-bound peptides into the Miocene

  1. Beatrice Demarchi  Is a corresponding author
  2. Meaghan Mackie
  3. Zhiheng Li
  4. Tao Deng
  5. Matthew J Collins
  6. Julia Clarke
  1. Department of Life Sciences and Systems Biology, University of Turin, Italy
  2. The Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
  3. The Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
  4. Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, China
  5. McDonald Institute for Archaeological Research, University of Cambridge, United Kingdom
  6. Department of Geological Sciences, The University of Texas at Austin, United States

Abstract

Previously, we showed that authentic peptide sequences could be obtained from 3.8-Ma-old ostrich eggshell (OES) from the site of Laetoli, Tanzania (Demarchi et al., 2016). Here, we show that the same sequences survive in a >6.5 Ma OES recovered from a palaeosteppe setting in northwestern China. The eggshell is thicker than those observed in extant species and consistent with the Liushu Struthio sp. ootaxon. These findings push the preservation of ancient proteins back to the Miocene and highlight their potential for paleontology, paleoecology, and evolutionary biology.

Editor's evaluation

This fundamental study substantially pushes the known preservation of protein sequences bound to mineral surfaces. The successful recovery of these sequences from late Miocene fossil eggshell also has important implications for taxonomic classification. This solid work encourages future paleoproteomic research on paleontological remains from deep antiquity and across various taxa. The paper will be of great interest to a wide range of paleoscientists.

https://doi.org/10.7554/eLife.82849.sa0

Introduction

The oldest authenticated peptide sequences to date were reported in 2016 from 3.8-Ma-old ostrich eggshell (OES) from the site of Laetoli, Tanzania (Demarchi et al., 2016). This finding had great scientific impact since it integrated computational chemistry (molecular dynamics simulations) as well as experimental data to propose a mechanism of preservation, concluding that mineral binding ensures the survival of protein sequences. Importantly, this study demonstrated that peptide-bound amino acids could survive into deep time even in hot environments. The effect of temperature on the kinetics of protein diagenesis has been described by several authors, both on the basis of actualistic experiments and of the quantification of the extent of degradation in ancient samples of known ages (Crisp et al., 2013; Demarchi et al., 2013; Hendy et al., 2012; Johnson et al., 1997; Kaufman, 2006; Kaufman, 2003; Schroeder and Bada, 1976; Wehmiller, 2013; Wehmiller, 1977; Wehmiller and Belknap, 1978). This discovery has fuelled the analysis of ancient proteins from other mineral matrices, namely tooth enamel (Cappellini et al., 2019; Welker et al., 2020; Welker et al., 2019) in order to reconstruct phylogenies.

The recovery of peptides has so far been limited to biomineral samples of Plio-Pleistocene age: our attempts at retrieving intact peptides from Cretaceous eggshell were unsuccessful, despite the fact that a genuine intracrystalline fraction of amino acids was preserved in the same sample (Saitta et al., 2020). Here, we show that the same peptide sequences that exhibit strong binding to the calcite surface—and that were recovered from the Laetoli OES—also persist into the late Miocene, in a >6.5-Ma-old eggshell sample from the Linxia Basin, northeastern Tibetan Plateau, China, Liushu Formation.

Results and discussion

The OES (IVPP V26107) was recovered from an area between the towns of Xinji and Songming, in Hezheng County, close to the border with Guanghe County, from mudstone facies of the Liushu Formation (Figure 1).

Map showing the location of the fossil eggshell site.

Both eggshell and skeletal remains are previously known from the Late Miocene Liushu Formation, Linxia Basin, of Gansu Province, where the mean annual temperature is ∼11C (Hou et al., 2005; Liu et al., 2016; Li et al., 2021; Wang, 2008). Age control on the highly fossiliferous units of the Lishu Formation is good (Deng et al., 2019; Deng et al., 2013; Zhang et al., 2012), with detailed correlations across China as well as with Neogene deposits globally. These units have been assessed via new magnetostratigraphic data; most of the Hipparion fauna is estimated to be earlier than 6.4–6.5 Ma, with a lower Liushu transition to a Platybelodon fauna yielding age estimates of 11.1–12.5 Ma (Zhang et al., 2012). These dates agree with the prior estimates from biostratigraphic and magnetostratigraphic data (Deng et al., 2019; Deng et al., 2013). The recovery site is closer to the southern part of the basin near the Heilinding section of Zhang et al., 2012, which had an estimated minimum age of 6.4–6.5 Ma (Chron 3An). In contrast, the candidate stratotype section, Guoniguo, that exposes lower parts of the Liushu, is north of the recovery area (Deng et al., 2019; Zhang et al., 2012).

Ostrich (Struthio) remains to have a long history of recovery from the late Miocene of northwest China (Buffetaut and Angst, 2021; Hou et al., 2005; Li et al., 2021; Lowe, 1931; Mikhailov and Zelenkov, 2020). The first reported ostrich fossils from China were from units of the ‘Hipparion Clay’ (Red Clay) dated between 6.54 and 7.18 Ma (Lowe, 1931; Zhu et al., 2008). Mikhailov and Zelenkov, 2020 concluded that the Wang, 2008 Liushu taxon is referable to Struthio and not presently supported as a distinct but related genus; it is proposed to be from a taxon larger than extant Struthio species with a thickness of ~2.4 mm consistent with the eggshell sampled here (Figure 2C).

Eggshell specimen (IVPP V26107): photographs (A, B) and CT scan (C).

Given the extreme antiquity of the samples, protein extraction was performed in an ultra-clean facility at the University of Copenhagen in order to minimize any chance of contamination, following Hendy et al., 2018. Furthermore, OES powders were bleached extensively to isolate the intracrystalline fraction only (as described by Demarchi et al., 2016), samples were analysed by LC-MS/MS using a new LC column, and six blanks (analytical and procedural) were included in the run, flanking the sample. Data analysis was equally cautious: raw tandem mass spectrometry data were used to reconstruct potential peptide sequences starting from raw product ion spectra; only de novo peptides with an ALC (Average Local Confidence) score ≥80% were considered. This is the most stringent threshold that can be applied, and it signifies that all potential peptide sequences with lower confidence are discarded. The de novo peptides were searched against the Uniprot/Swissprot database (containing 565,254 manually annotated and reviewed protein sequences). A database of common laboratory contaminants (common Repository of Adventitious Protein [cRAP]) was included in the search. Eleven unique peptide sequences were found to match the sequence of struthiocalcin-1 (Table 1), all containing the typical Asp-rich motif ‘DDDD’ (Figures 3 and 4), which had been shown in our previous paper to be the mineral-binding peptide belonging to the sequence of struthiocalcin-1 (SCA-1). Annotated tandem mass spectra are shown in Figure 3 (ALDDDDYPKG) and Figure 3—figure supplement 1 (ALDDDYPK), Figure 3—figure supplement 2 (SALDDDDYPKG), Figure 3—figure supplement 3 (DDDDYPKGKH), Figure 3—figure supplement 4 (LDDDDYPKGK), Figure 3—figure supplement 5 (SALDDDDYPK), Figure 3—figure supplement 6 (DDDDYPKGK), Figure 3—figure supplement 7 (LDDDDYPKG), Figure 3—figure supplement 8 (DDDYPKGK), Figure 3—figure supplement 9 (DDDDYPK), and Figure 3—figure supplement 10 (DDYPKGK).

Table 1
Peptide sequences identified in sample IVPP V26107.
ScanPrecursor masszm/zPeptide sequencePeptide massError (ppm)Peptide –10lgP
54011107.4722555.246ALDDDDYPKG1107.4725.727.24
52681050.4572526.236ALDDDDYPK1050.4516.124.83
50511194.5172598.266SALDDDDYPKG1194.50410.921.11
49841137.4942569.754SALDDDDYPK1137.48310.129.94
47271137.4942569.755SALDDDDYPK1137.4839.716.76
45651107.4832554.749ALDDDDYPKG1107.4729.832.57
45651107.4842554.749ALDDDDYPKG1107.47210.628.7
44431050.4592526.237ALDDDDYPK1050.4517.617.62
43131050.4612526.235ALDDDDYPK1050.4519.915.13
41991050.4582526.236ALDDDDYPK1050.4517.216.35
40371036.4422519.230LDDDDYPKG1036.4357.216.57
27241164.5392583.277LDDDDYPKGK1164.538.230.91
2502866.3352434.176DDDDYPK866.3356.918.46
2176866.3352434.175DDDDYPK866.3357.121.18
2026866.3382434.176DDDDYPK866.32910.119.07
13961051.4542526.734DDDDYPKGK1051.4467.828.62
1138936.4272469.221DDDYPKGK936.4198.427.8
10461188.5133397.178DDDDYPKGKH1188.5057.230.65
1042821.3982411.706DDYPKGK821.3927.617.39
Figure 3 with 10 supplements see all
Annotated product ion spectrum of peptide ALDDDDYPKG, m/z 554.749, −10lgP=32.57.
Coverage of struthiocalcin-1 (SCA-1) in Miocene OES specimen IVPP V26107.

Numbers indicate spectrum matches for each position in the sequence.

Conclusions

We present the first evidence for peptide survival into the Miocene, confirming our previous (Pliocene) data and providing further support to the mechanism of preservation based on the binding of Asx-rich peptides to calcite surfaces (Demarchi et al., 2016). The sequence recovered from the Chinese specimen is identical to the peptides found in the 3.8 Ma Laetoli OES, thus also supporting the attribution of the Liushu ootaxon to genus Struthio. While the four Asp residues are conserved across several avian taxa, in some species, including other ratites, Asp can be substituted by Glu and some of the flanking residues are also variable (Demarchi et al., 2022, fig. S1); therefore, variability within this sequence could be informative for evolutionary relationships between extinct and extant taxa. The absolute age of the Chinese sample is greater than that of the African material (Demarchi et al., 2016), but, given the latitudinal differences between the two sites, it is likely to have experienced lower temperatures throughout its burial history. This suggests that sequences of even greater antiquity may be recovered from biominerals harboring a closed system of proteins, particularly from sites in cold environments (e.g., high altitude and/or latitude).

Materials and methods

Sample preparation and analysis

Request a detailed protocol

A subsample of OES specimen IVPP V26107 was prepared in the ultra-clean facility at the University of Copenhagen, following the protocol of Demarchi et al., 2016; Demarchi et al., 2022 and omitting the digestion step. In brief, the fragment was powdered, bleached for 72 hr (NaOCl, 15% w/v) and demineralized in cold weak hydrochloric acid (0.6 M HCl). The acid was added in 200 µl increments until it stopped reacting, for a total volume of 1400 µl 0.6 M HCl. The extracts were exchanged in ammonium bicarbonate buffer (pH=7.8) using 3 kDa MWCO ultrafilters and the peptides were purified and concentrated using C18 Stage Tips (Cappellini et al., 2019; Rappsilber et al., 2007).

Prepared StageTips of the procedural blank and sample were eluted with 30 µl of 40% acetonitrile (ACN) 0.1% formic acid into a 96-well plate prior to LC-MS/MS analysis. To remove the ACN, the plate was vacuum centrifuged until approximately 5 µl remained. Samples were then resuspended with 6 µl of 0.1% trifluoroacetic acid (TFA) 5% ACN. Based on protein concentration results at 205 nm (NanoDrop, Thermo Fisher Scientific), 5 µl of each sample and procedural blank was then separated over a 77 min gradient by an EASY-nLC 1200 (Proxeon, Odense, Denmark) attached to a Q-Exactive HF-X mass spectrometer (Thermo Fisher Scientific, Germany) using a 15 cm column. The column (75 μm inner diameter) was made in-house, laser pulled, and packed with 1.9 μm C18 beads (Dr. Maisch, Germany). Parameters were the same as those already published for historical samples (Mackie et al., 2018). In short, MS1: 120k resolution, maximum injection time (IT) 25 ms, scan target 3E6. MS2: 60k resolution, top 10 mode, maximum IT 118 ms, minimum scan target 3E3, normalized collision energy of 28, dynamic exclusion 20 s, and isolation window of 1.2 m/z. Wash-blanks consisting of 0.1% TFA 5% ACN were also run in order to hinder cross-contamination. The LC-MS/MS run included, in this order: two wash blanks, one procedural blank, one wash blank, the OES sample, and two wash blanks. The data sets have been deposited to the ProteomeXchange Consortium via the Proteomics Identifications Database (PRIDE) partner repository with the identifier PXD035872.

Data analysis

Request a detailed protocol

Bioinformatic analysis was carried out using PEAKS Studio 8.5 (Bioinformatics Solutions Inc; Zhang et al., 2012). The Uniprot_swissprot database (downloaded 11/08/2021) was used for carrying out the searches and common contaminants were included (cRAP: http://www.thegpm.org/crap/). No enzyme was specified for the digestion and the tolerance was set to 10 ppm on the precursor and 0.05 Da on the fragments. The thresholds for peptide and protein identification were set as follows: peptide score −10lgP≥15, protein score −10lgP≥20, and de novo sequences scores (ALC%)≥80.

Data availability

Tandem mass spectra supporting peptide sequence identification are reported in Figure 3 and Figure 3 - Supplement 1 to 10. Raw mass spectrometry data and results of bioinformatics analysis are available via ProteomeXchange with identifier PXD035872.

The following data sets were generated
    1. Demarchi B
    (2022) PRIDE
    ID PXD035872. Ostrich eggshell peptides survival into the Miocene (6-9 Ma).

References

  1. Book
    1. Deng T
    2. Qiu ZX
    3. Wang BY
    4. Wang X
    5. Hou SK
    (2013)
    Late Cenozoic Biostratigraphy of the Linxia Basin, Northwestern China. Fossil Mammals of Asia
    Columbia University Press.
    1. Li Z
    2. Bailleul AM
    3. Stidham TA
    4. Wang M
    5. Deng T
    (2021)
    Exceptional preservation of an extinct ostrich from the Late Miocene Linxia basin of China
    Vertebrata PalAsiatica 59:229–244.
  2. Book
    1. Lowe PR
    (1931)
    Struthious remains from northern China and Mongolia: with descriptions of Struthio wimani, Struthio anderssoni and Struthio mongolicus spp. nov
    Geological Survey of China.
    1. Wang S
    (2008)
    Reexamination of taxonomic assignment of Struthio linxiaensis Hou et al., 2005
    Acta Palaeontol Sin 47:362–368.

Article and author information

Author details

  1. Beatrice Demarchi

    Department of Life Sciences and Systems Biology, University of Turin, Torino, Italy
    Contribution
    Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing
    For correspondence
    beatrice.demarchi@unito.it
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-8398-4409
  2. Meaghan Mackie

    1. The Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
    2. The Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
    Contribution
    Formal analysis, Methodology, Writing – review and editing
    Competing interests
    No competing interests declared
  3. Zhiheng Li

    Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing, China
    Contribution
    Formal analysis, Visualization, Writing – review and editing
    Competing interests
    No competing interests declared
  4. Tao Deng

    Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing, China
    Contribution
    Resources, Investigation, Methodology, Writing – review and editing
    Competing interests
    No competing interests declared
  5. Matthew J Collins

    1. The Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
    2. McDonald Institute for Archaeological Research, University of Cambridge, Cambridge, United Kingdom
    Contribution
    Conceptualization, Resources, Formal analysis, Writing – original draft, Project administration, Writing – review and editing
    Competing interests
    No competing interests declared
  6. Julia Clarke

    Department of Geological Sciences, The University of Texas at Austin, Austin, United States
    Contribution
    Conceptualization, Investigation, Methodology, Writing – original draft, Writing – review and editing
    Competing interests
    No competing interests declared

Funding

Ministry of University and Research - Italy (Young Researchers - Rita Levi Montalcini)

  • Beatrice Demarchi

Danish National Research Foundation (PROTEIOS (DNRF128))

  • Meaghan Mackie
  • Matthew J Collins

Alexander von Humboldt Foundation

  • Julia Clarke

University of Texas at Austin

  • Julia Clarke

National Natural Science Foundation of China

  • Zhiheng Li

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

BD’s work was supported by a “Young Researchers - Rita Levi Montalcini” grant from the Italian Ministry of University and Research. MM and MJC were supported by the Danish National Research Foundation Award PROTEIOS (DNRF128). JC acknowledges the Alexander von Humboldt Foundation and the Jackson School of Geosciences and ZL and DT the Chinese National Science Foundation. The authors thank Prof. Jesper Velgaard Olsen at the Novo Nordisk Center for Protein Research for providing access and resources, which were also funded in part by a donation from the Novo Nordisk Foundation (Grant no. NNF14CC0001). For the purpose of open access, MJC is part of the Rights Retention Pilot at the University of Cambridge and has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission. The authors wish to thank the reviewers and editors for their feedback on this manuscript.

Version history

  1. Preprint posted: August 19, 2022 (view preprint)
  2. Received: August 24, 2022
  3. Accepted: December 16, 2022
  4. Accepted Manuscript published: December 19, 2022 (version 1)
  5. Version of Record published: December 30, 2022 (version 2)

Copyright

© 2022, Demarchi et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 793
    views
  • 127
    downloads
  • 6
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Beatrice Demarchi
  2. Meaghan Mackie
  3. Zhiheng Li
  4. Tao Deng
  5. Matthew J Collins
  6. Julia Clarke
(2022)
Survival of mineral-bound peptides into the Miocene
eLife 11:e82849.
https://doi.org/10.7554/eLife.82849

Share this article

https://doi.org/10.7554/eLife.82849

Further reading

    1. Biochemistry and Chemical Biology
    Pattama Wiriyasermkul, Satomi Moriyama ... Shushi Nagamori
    Research Article

    Transporter research primarily relies on the canonical substrates of well-established transporters. This approach has limitations when studying transporters for the low-abundant micromolecules, such as micronutrients, and may not reveal physiological functions of the transporters. While d-serine, a trace enantiomer of serine in the circulation, was discovered as an emerging biomarker of kidney function, its transport mechanisms in the periphery remain unknown. Here, using a multi-hierarchical approach from body fluids to molecules, combining multi-omics, cell-free synthetic biochemistry, and ex vivo transport analyses, we have identified two types of renal d-serine transport systems. We revealed that the small amino acid transporter ASCT2 serves as a d-serine transporter previously uncharacterized in the kidney and discovered d-serine as a non-canonical substrate of the sodium-coupled monocarboxylate transporters (SMCTs). These two systems are physiologically complementary, but ASCT2 dominates the role in the pathological condition. Our findings not only shed light on renal d-serine transport, but also clarify the importance of non-canonical substrate transport. This study provides a framework for investigating multiple transport systems of various trace micromolecules under physiological conditions and in multifactorial diseases.

    1. Biochemistry and Chemical Biology
    2. Cell Biology
    Natalia Dolgova, Eva-Maria E Uhlemann ... Oleg Y Dmitriev
    Research Article

    Mediator of ERBB2-driven Cell Motility 1 (MEMO1) is an evolutionary conserved protein implicated in many biological processes; however, its primary molecular function remains unknown. Importantly, MEMO1 is overexpressed in many types of cancer and was shown to modulate breast cancer metastasis through altered cell motility. To better understand the function of MEMO1 in cancer cells, we analyzed genetic interactions of MEMO1 using gene essentiality data from 1028 cancer cell lines and found multiple iron-related genes exhibiting genetic relationships with MEMO1. We experimentally confirmed several interactions between MEMO1 and iron-related proteins in living cells, most notably, transferrin receptor 2 (TFR2), mitoferrin-2 (SLC25A28), and the global iron response regulator IRP1 (ACO1). These interactions indicate that cells with high MEMO1 expression levels are hypersensitive to the disruptions in iron distribution. Our data also indicate that MEMO1 is involved in ferroptosis and is linked to iron supply to mitochondria. We have found that purified MEMO1 binds iron with high affinity under redox conditions mimicking intracellular environment and solved MEMO1 structures in complex with iron and copper. Our work reveals that the iron coordination mode in MEMO1 is very similar to that of iron-containing extradiol dioxygenases, which also display a similar structural fold. We conclude that MEMO1 is an iron-binding protein that modulates iron homeostasis in cancer cells.