Ancient trans-species polymorphism at the Major Histocompatibility Complex in primates

  1. Alyssa Lyn Fortier  Is a corresponding author
  2. Jonathan K Pritchard
  1. Department of Biology, Stanford University, United States
  2. Department of Genetics, Stanford University, United States
8 figures, 5 tables and 3 additional files

Figures

Figure 1 with 2 supplements
The MHC region in humans (HLA).

(A) Each point at top represents the location of a gene. The different types of HLA genes are distinguished by different colors, shown in the key at left. The 19 functional HLA genes are labeled with their name (omitting their ‘HLA’ prefix due to space constraints). Gray points represent non-HLA genes and pseudogenes in the region. The black line shows nucleotide diversity (Nei and Li’s π) across the region, while the pink horizontal line shows the genome-wide average nucleotide diversity (π0.001) (Sachidanandam et al., 2001). (B) Nucleotide diversity around classical Class I gene HLA-A, with exon structure shown. (C) Nucleotide diversity around classical Class II gene HLA-DRB1, with exon structure shown. (D) Species tree showing the phylogenetic relationships among selected primates from this study (Kuderna et al., 2023). The colors of the icons are consistent with colors used throughout the paper to distinguish species. The pink vertical dashed lines indicate split times of the new-world monkeys (NWM) from the apes/old-world monkeys (OWM) (39 MYA), OWM from the apes (31 MYA), and the lesser apes (gibbons) from the great apes (23 MYA).

Figure 1—figure supplement 1
Nucleotide diversity in the human HLA region.

Nucleotide diversity (Nei and Li’s π) around each HLA gene is shown in black, while the pink horizontal line shows the genome-wide average nucleotide diversity (π0.001) (Sachidanandam et al., 2001). The genes are shown in order along the genome (from top left to bottom right), but the x-axis is repeatedly broken in order to zoom in on detail around each gene. The genes are colored according to their type, with key shown at bottom right. For the functional genes (and some pseudogenes), the exon structure is shown by boxes; other pseudogenes are not annotated with this level of detail.

Figure 1—figure supplement 2
Species key.

On the left-hand side is the species tree relating the species used in this study (Kuderna et al., 2023; Foley et al., 2023). Each species/tip is labeled with a unique color and 4-letter abbreviation, which is composed of the first two letters of the genus name and first two letters of the species name. The common name and Latin name for each species is shown on the right-hand side. Plecturocebus moloch is listed in the IPD-MHC database under its old name, Callicebus moloch, and uses a different abbreviation (Camo) in that resource. Similarly, Leontocebus fuscicolis was formerly known as Saguinus fuscicollis (Safu) in the IPD-MHC database. There appears to be some debate as to whether the pygmy marmoset should be placed in the Callithrix or Cebuella genus, but we have used the name Cebuella pygmaea (Cepy) in accordance with a recent primate study (Kuderna et al., 2023). This species is known as Callithrix pygmaea (Capy) in IPD-MHC. OWM, Old-World Monkeys; NWM, New World Monkeys; Str., Strepsirrhini; Gli., Glires; Lau., Laurasiatheria; Atl., Atlantogenata.

Figure 2 with 16 supplements
BEAST2 allele summary trees using sequences from exon 2.

(A) MHC-C, (B) MHC-DQB1, and (C) MHC-DOA. Each tip represents an allele, with color and four-letter abbreviation representing the species (see Figure 1—figure supplement 2 for full species key). The species label is followed by the allele name (see Appendix 1 for more details on nomenclature) or RefSeq accession number. For simplicity, monophyletic groups of similar alleles are collapsed with a triangle and labeled with their one-field allele name. The color/abbreviation key (center) also depicts the species tree (Kuderna et al., 2023). Human alleles (HLA; red) are bolded for emphasis. Dashed outgroup branches are scaled by a factor of 110 to clarify tree structure within the clade of interest. The smaller inset tree in panel B highlights the relationships between two human allele groups (red) and two OWM allele groups (green). The indicated human and OWM lineages coalesce more recently between groups than within each group. Pri., primate backbone sequences; Mam., mammal outgroup sequences.

Figure 2—figure supplement 1
MHC-A-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; > 100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 2
MHC-B-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1Appendix 1for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; > 100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 3
MHC-C-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 4
MHC-E-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 5
MHC-F-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 6
MHC-G-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 7
MHC-DRA-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 8
MHC-DQA-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 9
MHC-DPA-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 10
MHC-DMA-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 11
MHC-DOA-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 12
MHC-DRB-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 13
MHC-DQB-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 14
MHC-DPB-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 15
MHC-DMB-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 2—figure supplement 16
MHC-DOB-related group BEAST2 tree for exon 2 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3 with 14 supplements
Strong support for TSP at Class I genes MHC-B and -C.

Bayes factors computed over the set of BEAST trees indicate deep TSP. Different species comparisons are listed on the y-axis, and different gene regions are listed on the x-axis. Each table entry is colored and labeled with the maximum Bayes factor among all tested quartets of alleles belonging to that category. High Bayes factors (orange) indicate support for TSP among the given species for that gene region, while low Bayes factors (teal) indicate that alleles assort according to the species tree, as expected. Bayes factors above 100 are considered decisive. Tan values show poor support for either hypothesis, while white boxes indicate that there are not enough alleles in that category with which to calculate Bayes factors. MHC-A is not present in the NWMs, and MHC-C was not present before the human-orangutan ancestor, so it is not possible to calculate Bayes factors for these species comparisons.

Figure 3—figure supplement 1
MHC-A-related group BEAST2 tree for exon 3 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 2
MHC-A-related group BEAST2 tree for exon 4 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 3
MHC-B-related group BEAST2 tree for exon 3 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 4
MHC-B-related group BEAST2 tree for exon 4 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 5
MHC-C-related group BEAST2 tree for exon 3 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 6
MHC-C-related group BEAST2 tree for exon 4 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 7
MHC-E-related group BEAST2 tree for exon 3 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 8
MHC-E-related group BEAST2 tree for exon 4 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 9
MHC-F-related group BEAST2 tree for exon 3 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 10
MHC-F-related group BEAST2 tree for exon 4 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 11
MHC-G-related group BEAST2 tree for exon 3 (PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 12
MHC-G-related group BEAST2 tree for exon 4 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 3—figure supplement 13
TSP among old-world monkey groups for the Class I genes.

Different species comparisons are listed on the y-axis, and different gene regions are listed on the x-axis. Each table entry is colored and labeled with the maximum Bayes factor among all tested quartets of alleles belonging to that category. High Bayes factors (orange) indicate support for TSP among the given species for that gene region, while low Bayes factors (teal) indicate that alleles assort according to the species tree, as expected. Bayes factors above 100 are considered decisive. Tan values show poor support for either hypothesis, while white boxes indicate that there are not enough alleles in that category with which to calculate Bayes factors. MHC-C was not present before the human-orangutan ancestor, so it is not possible to calculate Bayes factors for MHC-C for these species comparisons.

Figure 3—figure supplement 14
TSP among new-world monkey groups for the Class I genes.

Different species comparisons are listed on the y-axis, and different gene regions are listed on the x-axis. Each table entry is colored and labeled with the maximum Bayes factor among all tested quartets of alleles belonging to that category. High Bayes factors (orange) indicate support for TSP among the given species for that gene region, while low Bayes factors (teal) indicate that alleles assort according to the species tree, as expected. Bayes factors above 100 are considered decisive. Tan values show poor support for either hypothesis, while white boxes indicate that there are not enough alleles in that category with which to calculate Bayes factors. MHC-C was not present before the human-orangutan ancestor, and MHC-A is not present in the NWM, so it is not possible to calculate Bayes factors for these genes.

Figure 4 with 12 supplements
Strong support for TSP at the classical Class II genes.

Bayes factors computed over the set of BEAST trees indicate deep TSP. Different species comparisons are listed on the y-axis, and different gene regions are listed on the x-axis. Each table entry is colored and labeled with the maximum Bayes factor among all tested quartets of alleles belonging to that category. High Bayes factors (orange) indicate support for TSP among the given species for that gene region, while low Bayes factors (teal) indicate that alleles assort according to the species tree, as expected. Bayes factors above 100 are considered decisive. Tan values show poor support for either hypothesis, while white boxes indicate that there are not enough alleles in that category with which to calculate Bayes factors.

Figure 4—figure supplement 1
MHC-DRA-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 2
MHC-DQA-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 3
MHC-DPA-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 4
MHC-DMA-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 5
MHC-DOA-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 6
MHC-DRB-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 7
MHC-DQB-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 8
MHC-DPB-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 9
MHC-DMB-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 10
MHC-DOB-related group BEAST2 tree for exon 3 (non-PBR-encoding).

In the tree, each tip represents a sequence (see Appendix 1 for more details on nomenclature), with the colored rectangle and four-letter abbreviation indicating the species (see Figure 1—figure supplement 2 for full species key). Following the rectangle, tips are labeled with the sequence name; sequences which have been assigned to loci are colored according to the gene group, while unassigned sequences are written in gray. Dashed branches are shortened to 10% of their length to expand detail in the rest of the tree. The left-hand side shows example Bayes factors we calculated from the set of posterior trees; the tree tips correspond to the rows of the grid. Each panel represents a type of comparison (labeled at top) and each column represents one of the top 5 highest-Bayes-factor comparisons (exact value at bottom; >100 indicates strong evidence for TSP). The four colored blocks in each column include two red blocks (human sequences) that were tested against two sequences from other species (see Materials and methods; Bayes factors). Grayed-out rows correspond to sequences that were not considered for Bayes factors, either because they belong to a non-orthologous gene or backbone sequence (and thus not relevant to compare with the human gene in question), or because they may have been involved in a gene conversion event in this exon (according to GENECONV or from the literature).

Figure 4—figure supplement 11
TSP among old-world monkey groups for the Class II genes.

Different species comparisons are listed on the y-axis, and different gene regions are listed on the x-axis. Each table entry is colored and labeled with the maximum Bayes factor among all tested quartets of alleles belonging to that category. High Bayes factors (orange) indicate support for TSP among the given species for that gene region, while low Bayes factors (teal) indicate that alleles assort according to the species tree, as expected. Bayes factors above 100 are considered decisive. Tan values show poor support for either hypothesis, while white boxes indicate that there are not enough alleles in that category with which to calculate Bayes factors.

Figure 4—figure supplement 12
TSP among new-world monkey groups for the Class II genes.

Different species comparisons are listed on the y-axis, and different gene regions are listed on the x-axis. Each table entry is colored and labeled with the maximum Bayes factor among all tested quartets of alleles belonging to that category. High Bayes factors (orange) indicate support for TSP among the given species for that gene region, while low Bayes factors (teal) indicate that alleles assort according to the species tree, as expected. Bayes factors above 100 are considered decisive. Tan values show poor support for either hypothesis, while white boxes indicate that there are not enough alleles in that category with which to calculate Bayes factors. There is not enough data in each category to compute Bayes factors for these groups for MHC-DPA1, -DMA, -DMB, -DOA, and -DOB.

Figure 5 with 5 supplements
Rapidly-evolving sites in the Class I genes.

(A) Rapidly-evolving sites are primarily located in exons 2 and 3. Here, the exons are concatenated such that the cumulative position along the coding region is on the x-axis. The dashed orange lines denote exon boundaries. The three genes are aligned such that the same vertical position indicates an evolutionarily equivalent site. The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods). (B) Rapidly-evolving sites are located in each protein’s peptide-binding pocket. Structures are Protein Data Bank (Berman et al., 2000) 4BCE (Teze et al., 2014) for HLA-B, 4NT6 (Choo et al., 2014) for HLA-C, and 7P4B (Walters et al., 2022) for HLA-E, with images created in PyMOL (Schrödinger, LLC, 2021). Substitution rates for each amino acid are computed as the mean substitution rate of the three sites composing the codon. Orange indicates rapidly-evolving amino acids, while teal indicates conserved amino acids. (C) Rapidly-evolving amino acids are significantly closer to the peptide than conserved amino acids. The y-axis shows the BEAST2 substitution rate and the x axis shows the minimum distance to the bound peptide, measured in PyMOL (Schrödinger, LLC, 2021). Each point is an amino acid, and distances are averaged over several structures (see Table 5). The orange line is a linear regression of substitution rate on minimum distance, with slope and p-value annotated on each panel.

Figure 5—figure supplement 1
Rapidly-evolving sites in the Class I genes.

Rapidly-evolving sites are primarily located in exons 2 and 3. Here, the exons are concatenated such that the cumulative position along the coding region is on the x-axis. The dashed orange lines denote exon boundaries and the exon numbers are labeled in the top panel. The genes are aligned such that the same vertical position indicates an evolutionarily equivalent site. The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods).

Figure 5—figure supplement 2
Proportions of rapidly-evolving sites for Class I.

Sites were binned into slowly-evolving (≤-1), rapidly-evolving (>1), or baseline (>-1 but ≤1) categories. We then calculated proportions of these categories for exon 2, exon 3, exon 4, and the ’other’ exons (not exons 2, 3, or 4). These bins are all approximately the same size, ∼270 bp. For each gene and exon, we tested the difference in the proportion of rapidly-evolving sites between that exon and the ‘other’ exons group (two-sample z-test for equality of proportions with continuity correction). Significant tests (Bonferroni corrected; p<0.05/3) are marked with an asterisk.

Figure 5—figure supplement 3
Rapidly-evolving sites on Class I protein structures.

Structures are Protein Data Bank (Berman et al., 2000) 6 J1W (Zhu et al., 2019) for HLA-A, 3BVN (Kumar et al., 2009) for HLA-B, 4NT6 (Choo et al., 2014) for HLA-C, 7P4B (Walters et al., 2022) for HLA-E, 5IUE (Dulberger et al., 2017) for HLA-F, and 3KYN (Walpole et al., 2010) for HLA-G, with images created in PyMOL (Schrödinger, LLC, 2021). Substitution rates for each amino acid are computed as the mean substitution rate of the three sites composing the codon. Orange indicates rapidly-evolving amino acids, while teal indicates conserved amino acids.

Figure 5—figure supplement 4
Evolutionary rate is related to the distance to peptide.

The y-axis shows the BEAST2 substitution rate, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods). The x-axis shows the minimum distance to the bound peptide, measured in PyMOL (Schrödinger, LLC, 2021). Each point is an amino acid, and distances are averaged over several structures (see Table 5). The orange line is a linear regression of substitution rate on minimum distance, with slope and p-value annotated on each panel. Amino acids with a fold change greater than 1.5 are labeled.

Figure 5—figure supplement 5
Class I rapidly-evolving sites by binned distance to peptide.

Nucleotide sites were divided into those whose corresponding amino acids contact the peptide (<4Å) versus do not contact the peptide (≥4Å), shown on the x-axis. This distance has been used previously to define plausible peptide-contacting residues (Nielsen et al., 2007). The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods). For each gene, the groups are compared using a Wilcoxon test, with p-value displayed at the top of each panel.

Figure 6 with 7 supplements
Rapidly-evolving sites in the Class II genes.

(A) Rapidly-evolving sites are primarily located in exon 2. Here, the exons are concatenated such that the cumulative position along the coding region is on the x-axis. The dashed orange lines denote exon boundaries. The α genes (top two plots) are aligned such that the same vertical position indicates an evolutionarily equivalent site; the same is true for the β genes (bottom two plots). The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods). (B) Rapidly-evolving sites are located in each protein’s peptide-binding pocket. Structures are Protein Data Bank (Berman et al., 2000) 5JLZ (Gerstner et al., 2016) for HLA-DR and 2NNA (Henderson et al., 2007) for HLA-DQ, with images created in PyMOL (Schrödinger, LLC, 2021). Substitution rates for each amino acid are computed as the mean substitution rate of the three sites composing the codon. Orange indicates rapidly-evolving amino acids, while teal indicates conserved amino acids. (C) Rapidly-evolving amino acids are significantly closer to the peptide than conserved amino acids. The y-axis shows the BEAST2 substitution rate and the x axis shows the minimum distance to the bound peptide, measured in PyMOL (Schrödinger, LLC, 2021). Each point is an amino acid, and distances are averaged over several structures (see Table 5). The orange line is a linear regression of substitution rate on minimum distance, with slope and p-value annotated on each panel.

Figure 6—figure supplement 1
Rapidly-evolving sites in the Class IIA genes.

Rapidly-evolving sites are primarily located in exon 2. Here, the exons are concatenated such that the cumulative position along the coding region is on the x-axis. The dashed orange lines denote exon boundaries and the exon numbers are labeled in the top panel. The genes are aligned such that the same vertical position indicates an evolutionarily equivalent site. The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods).

Figure 6—figure supplement 2
Proportions of rapidly-evolving sites for Class IIA.

Sites were binned into slowly-evolving (≤-1), rapidly-evolving (>1), or baseline (>-1 but ≤1) categories. We then calculated proportions of these categories for exon 2, exon 3, and the ‘other’ exons (not exons 2 or 3). These bins are all approximately the same size, ∼270bp. For each gene and exon, we tested the difference in the proportion of rapidly-evolving sites between that exon and the ‘other’ exons group (two-sample z-test for equality of proportions with continuity correction). Significant tests (Bonferroni corrected; p<0.05/2) are marked with an asterisk.

Figure 6—figure supplement 3
Rapidly-evolving sites in the Class IIB genes.

Rapidly-evolving sites are primarily located in exon 2. Here, the exons are concatenated such that the cumulative position along the coding region is on the x-axis. The dashed orange lines denote exon boundaries and the exon numbers are labeled in the top panel. The genes are aligned such that the same vertical position indicates an evolutionarily equivalent site. The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods).

Figure 6—figure supplement 4
Proportions of rapidly-evolving sites for Class IIB.

Sites were binned into slowly-evolving (≤-1), rapidly-evolving (>1), or baseline (>-1 but ≤1) categories. We then calculated proportions of these categories for exon 2, exon 3, and the ‘other’ exons (not exons 2 or 3). These bins are all approximately the same size, ∼270 bp. For each gene and exon, we tested the difference in the proportion of rapidly-evolving sites between that exon and the ‘other’ exons group (two-sample z-test for equality of proportions with continuity correction). Significant tests (Bonferroni corrected; p<0.05/2) are marked with an asterisk.

Figure 6—figure supplement 5
Rapidly-evolving sites on Class II protein structures.

Structures are Protein Data Bank (Berman et al., 2000) 5JLZ (Gerstner et al., 2016) for HLA-DR, 2NNA (Henderson et al., 2007) for HLA-DQ, 7 T2A (Ciacchi et al., 2023) for HLA-DP, 2BC4 (Nicholson et al., 2006) for HLA-DM, and 4I0P (Guce et al., 2013) for HLA-DO, with images created in PyMOL (Schrödinger, LLC, 2021). Substitution rates for each amino acid are computed as the mean substitution rate of the three sites composing the codon. Orange indicates rapidly-evolving amino acids, while teal indicates conserved amino acids.

Figure 6—figure supplement 6
Class II rapidly-evolving sites by binned distance to peptide.

Nucleotide sites were divided into those whose corresponding amino acids contact the peptide (<4Å) versus do not contact the peptide (≥4Å), shown on the x-axis. This distance has been used previously to define plausible peptide-contacting residues (Nielsen et al., 2007). The y-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods). For each gene, the groups are compared using a Wilcoxon test, with p-value displayed at the top of each panel. MHC-DM and -DO are not shown because they do not bind peptides.

Figure 6—figure supplement 7
Number of associations per amino acid as a function of evolutionary rate.

The x-axis shows the substitution rate at each site, expressed as a fold change (the base-2 logarithm of each site’s evolutionary rate divided by the mean rate among mostly-gap sites in each alignment; see Materials and methods). The y-axis shows the number of unique associations for each amino acid, including diseases, TCR phenotypes, and protein expression levels. Only genes with associations are shown; at the time of publishing, there were no amino acid associations meeting our criteria for MHC-E, -F, -G, -DRA, -DMA, -DMB, -DOA, or -DOB. For each gene, a regression line is shown in orange, with slope and p-value displayed at the top of each panel. The amino acids with the greatest number of associations within each gene are also labeled.

Possible unrooted trees of 4 alleles.

There is one tree where the human alleles are monophyletic, and two trees where they are non-monophyletic.

Appendix 1—figure 1
MHC allele nomenclature.

(A) Human HLA alleles are named in a standard fashion, with the gene name followed by four colon-separated fields. The first field indicates a broad-scale allele group which sometimes corresponds to a serological antigen. The second field denotes a specific HLA protein. The third field indicates synonymous changes to the nucleotide sequence in the coding region, while the fourth field is used to distinguish alleles with differences in the noncoding regions. If an allele’s expression has been characterized, an informative suffix is sometimes added (Robinson et al., 2024; Marsh et al., 2010). (B) Researchers have applied the same format to non-human alleles, with some key differences. Instead of ‘HLA’, a prefix which concatenates the first two letters of the genus name with the first two letters of the species name is used, except in certain cases where the species’ MHC system was named long ago. Paralogs can be distinguished using numbers, but sequences unassigned to a particular locus or paralog might incorporate a ‘W’ in the gene name. Use of expression tags varies, with some being added to the end of the gene name instead of the end of the entire allele name. Pseudogenes can be denoted with gene name suffixes, gene names themselves, expression suffixes, or not at all. For both human and non-human alleles, the lack of an expression suffix does not imply normal expression (de Groot et al., 2020). SLA: Swine Leukocyte Antigen; Chsa: Chlorocebus sabaeus—green monkey; Lero: Leontopithecus rosalia—golden lion tamarin; Mamu: Macaca mulatta—rhesus macaque; Aotr: Aotus trivirgatus—three-striped night monkey; Popy: Pongo pygmaeus—Bornean orangutan; Ceat: Cercocebus atys—sooty mangabey; Sala: Saguinus labiatus—white-lipped tamarin; Gogo: Gorilla gorilla—Western gorilla; Rano: Rattus norvegicus—brown rat; Patr: Pan troglodytes—chimpanzee; Papa: Pan paniscus—bonobo.

Tables

Table 1
Rapidly-evolving amino acids in MHC-B and their trait and disease associations.

Shown here are all amino acid positions in the MHC-B group evolving at more than twice the baseline rate (fold change ≥ 1). Many corresponding positions in human HLA-B have associations with autoimmune or infectious diseases, biomarkers, or TCR phenotypes. Disease associations were collected from a literature search of HLA fine-mapping studies with over 1000 cases (see Materials and methods).

Amino Acid PositionEvol. Rate Fold ChangeDistance to Peptide (Å)Associations
1563.423.55Chronic Hepatitis C (Hirata et al., 2019), HIV Set Point Viral Load (Luo et al., 2021), Asthma (Sakaue et al., 2021), Eosinophil Count (Sakaue et al., 2021), Hypothyroidism (Sakaue et al., 2021), Pediatric Asthma (Sakaue et al., 2021), Systolic Blood Pressure (Sakaue et al., 2021), Total Protein (Sakaue et al., 2021), TCR β Interaction Probability >50% (Sharon et al., 2016), Plasma Protein Levels of ADAM8, AGER, ASPSCR1, B2M, CCL16, CCL28, CCL4, CD200R1, CD5L, CDSN, CX3CL1, FCRL5, IGF2R, IL12A, IL12B, IL5RA, MICB, NUCB2, PDCD1, RARRES2, SFTPD, SIGLEC6, SNX2, TIMD4, TNFRSF4, TNR, TYRP1 (Krishna et al., 2024)
953.103.82KLRF1 Plasma Protein Level (Krishna et al., 2024)
1143.084.80Rheumatoid Arthritis (Sakaue et al., 2021), Plasma Protein Levels of AIF1, CD1C, DDR1, IL15, LILRB2, MICB (Krishna et al., 2024)
1162.843.44Eosinophil Count (Hirata et al., 2019), HIV Control (McLaren et al., 2012), Angina (Sakaue et al., 2021), Allergic Rhinitis (Waage et al., 2018), Psoriasis (Zhou et al., 2016), Plasma Protein Levels of ADAM15, APOM, BTN2A1, CD1C, CFB, CXCL11, CXCL9, FLT4, GNLY, KLRF1, LILRB1, MICB, PLXDC2, TNF, TNFRSF13C, TNXB (Krishna et al., 2024)
702.643.71Platelet (Sakaue et al., 2021), Plasma Protein Levels of CD8A, GZMA, MICB, NRP2 (Krishna et al., 2024)
972.374.29Ankylosing Spondylitis (Butler-Laporte et al., 2023), HIV Set Point Viral Load (Luo et al., 2021), HIV Control (McLaren et al., 2012), Adult Height (Sakaue et al., 2021), Alkaline Phosphatase (Sakaue et al., 2021), Body Weight (Sakaue et al., 2021), C-reactive Protein (Sakaue et al., 2021), IgA Nephritis (Sakaue et al., 2021), Mean Arterial Pressure (Sakaue et al., 2021), Mean Corpuscular Hemoglobin (Sakaue et al., 2021), Pneumonia (Tian et al., 2017), Tonsillectomy (Tian et al., 2017), Plasma Protein Levels of ADGRE2, BTN3A2, CCL21, CCL3, CD1C, CD8A, CDSN, CPVL, DXO, EBI3, EFCAB14, HBEGF, HCG22, IL12A, IL12B, LILRB2, LRP1, LRPAP1, LTB, LY75, MANF, MANSC1, MICB, OSCAR, PLA2G10, PRTN3, SIGLEC10, STAB2, TEK, TNFSF13, ZNRD2 (Krishna et al., 2024)
242.354.68Hepatic Cancer (Sakaue et al., 2021), Plasma Protein Levels of ADAM15, CXCL10, FCRL6, GZMB, MICB, TNFSF8 (Krishna et al., 2024)
1632.234.13Lung Cancer (Squamous Cell Carcinoma) (Ferreiro-Iglesias et al., 2018), Alanine Aminotransferase (Sakaue et al., 2021), TCR Expression (TRAV38-1) (Sharon et al., 2016), TCR α Interaction Probability >50% (Sharon et al., 2016), Plasma Protein Levels of DDR1, GZMA, LILRB1, MICB, NPTX1, SEPTIN3, TEK, WFDC2 (Krishna et al., 2024)
671.963.72Graves’ Disease (Hirata et al., 2019), HIV Set Point Viral Load (Luo et al., 2021), Asthma (Sakaue et al., 2021), Psoriasis (Stuart et al., 2022), Plasma Protein Levels of AMBP, C2, CD160, CD28, CD48, CFB, FCRL1, FCRL6, FRZB, GP1BB, LILRB1, LTA, LY96, NID1, SIGLEC9, SORT1, THBD, TNFRSF4, TNFSF13B, TNXB, TP53BP1, VCAM1 (Krishna et al., 2024)
1521.963.53JIA (Oligoarthritis/RF-negative Polyarthritis) (Hinks et al., 2017), Plasma Protein Levels of LTBR, MICB, PLXNA4, RARRES2 (Krishna et al., 2024)
631.712.94HIV Control (McLaren et al., 2012), Skin Cancer (Sakaue et al., 2021), B2M Plasma Protein Level (Krishna et al., 2024)
991.703.00Plasma Protein Levels of APOM, CRTAM, DXO, IL15, MICB, OSCAR (Krishna et al., 2024)
661.583.41TNFSF11 Plasma Protein Level (Krishna et al., 2024)
691.564.69Parkinson’s Disease (Naito et al., 2021)
741.314.30Chronic Sinusitis (Sakaue et al., 2021)
621.293.89PGLYRP1 Plasma Protein Level (Krishna et al., 2024)
1381.269.18
91.203.36Primary Biliary Cholangitis (Darlay et al., 2018), Systemic Lupus Erythematosus (Molineros et al., 2019), Rheumatoid Arthritis (Raychaudhuri et al., 2012), Hyperthyroidism (Sakaue et al., 2021), Monocyte Count (Sakaue et al., 2021), Serum Creatinine (Sakaue et al., 2021), Psoriasis (Zhou et al., 2016), Plasma Protein Levels of CD1C, CX3CL1, IL12B, LIPF, LTA, MICB, PDCD1, RGMA, SGSH, SLAMF7, TNFRSF8 (Krishna et al., 2024)
811.034.04
Table 1—source data 1

Rapidly-evolving amino acids and their trait and disease associations for all studied genes.

For each gene group, the rapidly-evolving amino acid positions (substitution rate 𝑙𝑜𝑔2 fold change >1) are listed alongside their disease and trait associations. group: gene group; position: amino acid position; evol_rate: substitution rate 𝑙𝑜𝑔2 fold change; min_dist: minimum distance to peptide (Å); disease_assoc: list of disease, trait, and other associations, with citations.

https://cdn.elifesciences.org/articles/103547/elife-103547-table1-data1-v1.txt
Table 2
Data summary for Class I.

Each row represents a species, and each column represents a gene group. Each cell lists the number of alleles included for each gene represented by that gene group. Bolded entries are ‘backbone’ sequences that are included in every group.

CladeSpecies GroupSpeciesLatin NamePref.MHC-A GroupMHC-B GroupMHC-C GroupMHC-E GroupMHC-F GroupMHC-G Group
ApeHumanHumanHomo sapiensHosa63 –A, 1 –A, 4 –H, 1 –H, 2 –Y, 1 –B, 1 –L, 1 –C, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K, 1 –V, 1 –W1 –A, 1 –H, 91 –B, 1 –B, 1 –L, 1 –C, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K, 1 –V, 1 –W1 –A, 1 –H, 1 –B, 1 –L, 90 –C, 1 –C, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K, 1 –V, 1 –W1 –A, 1 –H, 1 –B, 1 –L, 1 –C, 15 –E, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K, 1 –V, 1 –W1 –A, 1 –H, 1 –B, 1 –L, 1 –C, 1 –E, 10 –F, 1 –F, 1 –G, 1 –J, 1 –K, 1 –V, 1 –W1 –A, 1 –H, 1 –B, 1 –L, 1 –C, 1 –E, 1 –F, 17 –G, 1 –G, 1 –J, 1 –K, 1 –V, 1 –W
ChimpanzeeBonoboPan paniscusPapa10 –A, 1 –H26 –B11 –C1 –E1 –F2 –G
ChimpanzeePan troglodytesPatr30 –A, 3 –A/AL/OKO, 1 –H48 –B28 –C2 –E3 –F1 –G
GorillaEastern gorillaGorilla beringeiGobe1 –A/AL/OKO1 –B1 –C
Western gorillaGorilla gorillaGogo4 –A, 4 –A/AL/OKO, 1 –H, 3 –Y10 –B, 3 –B8 –C2 –E3 –F2 –G
OrangutanSumatran orangutanPongo abeliiPoab5 –A/AL/OKO, 1 –H12 –B2 –C1 –E1 –F1 –G
Bornean orangutanPongo pygmaeusPopy12 –A/AL/OKO, 1 –H, 7 -Ap20 –B5 –C1 –E1 –F
GibbonLar gibbonHylobates larHyla2 –A1 –B
Silvery gibbonHylobates molochHymo1 –A1 unknown1 –E1 –F
Northern white-cheeked gibbonNomascus leucogenysNole1 –A2 unknown1 –E1 –F
OWMBaboonOlive baboonPapio anubisPaan1 –A, 1 –AG1 –B1 –E3 –F1 –AG
Hamadryas baboonPapio hamadryasPaha2 –B
Yellow baboonPapio cynocephalusPacy1 –E
GeladaGeladaTheropithecus geladaThge1 –A, 1 –AG1 –E1 –F1 –G
MangabeySooty mangabeyCercocebus atysCeat3 –A, 1 –AG1 –B, 1 –I5 –E5 –F
DrillDrillMandrillus leucophaeusMale1 –E1 –F2 –G
MacaqueCrab-eating macaqueMacaca fascicularisMafa1 –L, 1 –V, 1 –W, 1 –A81 –B,
1 –L, 1 –V, 1 –W
1 –L, 1 –V, 1 –W1 –L, 17 –E, 1 –V, 1 –W1 –L, 24 –F, 1 –V, 1 –W1 –L, 9 –G, 1 –V, 1 –W
Rhesus macaqueMacaca mulattaMamu8 –A, 1 –A, 5 –AG, 1 –AG, 1 –B, 1 –I, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K1 –A, 1 –AG, 9 –B, 1 –B, 1 –I, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K1 –A, 1 –AG, 1 –B, 1 –I, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K1 –A, 1 –AG, 1 –B, 1 –I, 30 –E, 1 –E, 1 –F, 1 –G, 1 –J, 1 –K1 –A, 1 –AG, 1 –B, 1 –I, 1 –E, 18–F, 1 –F, 1 –G, 1 –J, 1 –K1 –A, 5 –AG, 1 –AG, 1 –B, 1 –I, 1 –E, 1 –F, 4 –G, 1 –G, 1 –J, 1 –K
Stump-tailed macaqueMacaca arctoidesMaar1 –B
Assam macaqueMacaca assamensisMaas1 –B
Northern pig-tailed macaqueMacaca leoninaMalo1 –B
Southern pig-tailed macaqueMacaca nemestrinaMane2 –B10 –E7 –F
Tibetan macaqueMacaca thibetanaMath1 –B1 –E1 –F1 –G
GrivetGrivetChlorocebus aethiopsChae2 –G
Vervet MonkeyVervet monkeyChlorocebus pygerythrusChpy2 –B
Green MonkeyGreen monkeyChlorocebus sabaeusChsa1 –A, 1 –AG, 1 –A81 –B5 –E1 –F1 –AG
GuenonBlue monkeyCercopithecus mitisCemi2 –B
ColobusAngola colobusColobus angolensisCoan1 –AG2 –E1 –F
Ugandan red colobusPiliocolobus tephroscelesPite1 –E1 –F1 –G
LangurFrancois’ langurTrachypithecus francoisiTrfr1 –A, 1 –AG1 –E1 –F1 –G
Snub-Nosed MonkeyGolden snub-nosed monkeyRhinopithecus roxellanaRhro1 –A, 1 –AG1 –E1 –G
Black-and-white snub-nosed monkeyRhinopithecus bietiRhbi1 –E
NWMTamarinCotton-top tamarinSaguinus oedipusSaoe1 –E4 –F9 –G, 1 –PS, 3 –N1/3/4
Brown-mantled tamarinLeontocebus fuscicollisLefu5 –G
Golden lion tamarinLeontopithecus rosaliaLero2 –G
White-lipped tamarinSaguinus labiatusSala9 –G
MarmosetCommon marmosetCallithrix jacchusCaja1 –B, 1 –E, 1 –F, 1 –G8 –B, 1 –B, 1 –E, 1 –F, 1 –G1 –B, 1 –E, 1 –F, 1 –G1 –B, 2 –E, 1 –E, 1 –F, 1 –G1 –B, 1 –E, 17 –F, 1 –F, 1 –G1 –B, 1 –E, 1 –F, 76 –G, 1 –G, 1 –PS
Night MonkeyThree-striped night monkeyAotus trivirgatusAotr1 –E3 –G, 1 –PS
Gray-bellied night monkeyAotus lemurinusAole5 –F
Nancy Ma’s night monkeyAotus nancymaaeAona3 –F1 –B, 7 –G
CapuchinPanamanian white-faced capuchinCebus imitatorCeim1 –E1 –F6 –G, 1 unknown
Tufted capuchinSapajus apellaSaap1 –E1 –F4 –G, 2 unknown
Squirrel MonkeyBlack-capped squirrel monkeySaimiri boliviensisSabo3 –E2 –F1 –B, 3 –G, 1 unknown
Common squirrel monkeySaimiri sciureusSasc1 –G
Spider MonkeyWhite-bellied spider monkeyAteles belzebuthAtbe1 –B1 –E3 –G
Black-headed spider monkeyAteles fuscicepsAtfu1 –B2 –E9 –G
SakiWhite-faced sakiPithecia pitheciaPipi1 –B1 –E4 –G
TarsierTarsierPhilippine tarsierCarlito syrichtaCasy1 unknown1 unknown1 unknown1 unknown1 unknown1 unknown
StrepsirrhiniLemurRing-tailed lemurLemur cattaLeca1 unknown1 unknown1 unknown1 unknown1 unknown1 unknown
Table 3
Data summary for Class IIA.

Each row represents a species, and each column represents a gene group. Each cell lists the number of alleles included for each gene represented by that gene group. Bolded entries are ‘backbone’ sequences that are included in every group.

CladeSpecies GroupSpeciesLatin NamePref.MHC-DPA GroupMHC-DQA GroupMHC-DRA GroupMHC-DMA GroupMHC-DOA Group
ApeHumanHumanHomo sapiensHosa22 –DPA, 2 –DPA, 2 –DQA, 1 –DRA, 1 –DMA, 1 –DOA2 –DPA, 22 –DQA, 2 –DQA, 1 –DRA, 1 –DMA, 1 –DOA2 –DPA, 2 –DQA, 4–DRA, 1 –DRA, 1 –DMA, 1 –DOA2 –DPA, 2 –DQA, 1 –DRA, 8–DMA, 1 –DMA, 1 –DOA2 –DPA, 2 –DQA, 1 –DRA, 1 –DMA, 14–DOA, 1–DOA
ChimpanzeeBonoboPan paniscusPapa1 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
ChimpanzeePan troglodytesPatr5 –DPA6 –DQA3 –DRA1 –DMA1 –DOA
GorillaWestern gorillaGorilla gorillaGogo3 –DPA10 –DQA1 –DRA1 –DMA1 –DOA
OrangutanSumatran orangutanPongo abeliiPoab4 –DPA6 –DQA4 –DRA1 –DMA1 –DOA
Bornean orangutanPongo pygmaeusPopy4 –DPA4 –DQA2 –DRA
GibbonSilvery gibbonHylobates molochHymo1 –DPA3 –DQA1 –DRA1 –DMA1 –DOA
Northern white-cheeked gibbonNomascus leucogenysNole1 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
Lar gibbonHylobates larHyla6 –DQA
OWMBaboonOlive baboonPapio anubisPaan13 –DPA8 –DQA3 –DRA1 –DMA1 –DOA
Hamadryas baboonPapio hamadryasPaha1 –DPA3 –DQA
Yellow baboonPapio cynocephalusPacy7 –DQA
Guinea baboonPapio papioPapp4 –DQA
GeladaGeladaTheropithecus geladaThge2 –DPA3 –DQA1 –DRA1 –DMA1 –DOA
MangabeySooty mangabeyCercocebus atysCeat2 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
Black crested mangabeyLophocebus aterrimusLoat1 –DQA
DrillDrillMandrillus leucophaeusMale1 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
MacaqueCrab-eating macaqueMacaca fascicularisMafa30 –DPA, 1 –DMA, 1 –DOA11 –DQA, 1 –DMA, 1 –DOA16 –DRA, 1 –DMA, 1 –DOA7 –DMA, 1 –DMA, 1 –DOA1 –DMA, 6 –DOA, 1 –DOA
Northern pig-tailed macaqueMacaca leoninaMalo6 –DPA8 –DQA5 –DRA2 –DMA7 –DOA
Rhesus macaqueMacaca mulattaMamu22 –DPA, 1 –DPA, 2 –DQA, 1 –DRA1 –DPA, 9 –DQA, 2 –DQA, 1 –DRA1 –DPA, 2 –DQA, 12 –DRA, 1 –DRA1 –DPA, 2 –DQA, 1 –DRA, 4 –DMA1 –DPA, 2 –DQA, 1 –DRA, 1 –DOA
Southern pig-tailed macaqueMacaca nemestrinaMane14 –DPA10 –DQA11 –DRA1 –DMA1 –DOA
Tibetan macaqueMacaca thibetanaMath7 –DPA1 –DQA1 –DRA10 –DMA
Stump-tailed macaqueMacaca arctoidesMaar1 –DPA2 –DQA
GrivetGrivetChlorocebus aethiopsChae6 –DQA
Green MonkeyGreen monkeyChlorocebus sabaeusChsa5 –DPA2 –DQA1 –DRA1 –DOA
GuenonBlue monkeyCercopithecus mitisCemi5 –DQA
De Brazza’s monkeyCercopithecus neglectusCene2 –DQA
ColobusAngola colobusColobus angolensisCoan2 –DQA1 –DRA1 –DMA1 –DOA
Ugandan red colobusPiliocolobus tephroscelesPite1 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
Mantled guerezaColobus guerezaCogu1 –DQA
LangurFrancois’ langurTrachypithecus francoisiTrfr1 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
Snub-Nosed MonkeyBlack-and-white snub-nosed monkeyRhinopithecus bietiRhbi1 –DRA1 –DMA1 –DOA
Golden snub-nosed monkeyRhinopithecus roxellanaRhro1 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
NWMTamarinCotton-top tamarinSaguinus oedipusSaoe4 –DQA
MarmosetCommon marmosetCallithrix jacchusCaja1 –DPA, 2 –DQA, 1 –DMA, 1 –DOA1 –DPA, 6 –DQA, 2 –DQA, 1 –DMA, 1 –DOA1 –DPA, 2 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DPA, 2 –DQA, 1 –DMA, 1 –DOA1 –DPA, 2 –DQA, 1 –DMA, 1 –DOA
Night MonkeyNancy Ma’s night monkeyAotus nancymaaeAona1 –DPA, 1 –DRA6 –DQA, 1 –DRA2 –DRA, 1 –DRA1 –DRA, 1 –DMA1 –DRA, 1 –DOA
Gray-bellied night monkeyAotus lemurinusAole3 –DQA
Spix’s night monkeyAotus vociferansAovo3 –DRA
CapuchinPanamanian white-faced capuchinCebus imitatorCeim1 –DPA3 –DQA1 –DRA1 –DMA1 –DOA
Tufted capuchinSapajus apellaSaap1 –DPA3 –DQA1 –DRA1 –DMA1 –DOA
Squirrel MonkeyBlack-capped squirrel monkeySaimiri boliviensisSabo2 –DQA1 –DRA1 –DMA1 –DOA
Common squirrel monkeySaimiri sciureusSasc3 –DPA
TarsierTarsierPhilippine tarsierCarlito syrichtaCasy3 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
StrepsirrhiniLemurRing-tailed lemurLemur cattaLeca1 –DPA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DPA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DPA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DPA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DPA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA
Gray mouse lemurMicrocebus murinusMimu1 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
LorisSunda slow lorisNycticebus coucangNyco1 –DPA2 –DRA1 –DMA1 –DOA
GalagoNorthern greater galagoOtolemur garnettiiOtga1 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
SifakaCoquerel’s sifakaPropithecus coquereliPrco2 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
Flying LemurFlying LemurSunda flying lemurGaleopterus variegatusGava2 –DPA1 –DRA1 –DMA1 –DOA
Tree ShrewTree ShrewChinese tree shrewTupaia chinensisTuch4 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
GliresRodentGroundhogMarmota monaxMamo1 –DPA, 1 –DPA1 –DPA1 –DPA, 1 –DRA1 –DPA, 1 –DMA1 –DPA, 1 –DOA
Brown ratRattus norvegicusRano1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA2 –DQA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DQA, 2 –DRA, 1 –DRA, 1 –DMA, 1 –DOA1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DQA, 1 –DRA, 1 –DMA, 1–DOA, 1–DOA
PikaPlateau pikaOchotona curzoniaeOccu2 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
American pikaOchotona princepsOcpr2 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
LaurasiatheriaArtiodactylaBactrian camelCamelus bactrianusCaba1 –DPA1 –DQA1 –DMA1 –DOA
Wild boarSus scrofaSLA1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DQA, 1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA1 –DQA, 3 –DRA, 1 –DRA, 1 –DMA, 1 –DOA1 –DQA, 1 –DRA, 4 –DMA, 1 –DMA, 1 –DOA1 –DQA, 1 –DRA, 1 –DMA, 1 –DOA, 1 –DOA
Even-toed ungulatesBos sp.BoLA2 –DQA
Domestic yakBos grunniensBogr2 –DQA
Water buffaloBubalus bubalisBubu2 –DQA
SheepOvis ariesOvar4 –DQA3 –DRA
Dromedary camelCamelus dromedariusCadr1 –DRA
Bighorn sheepOvis canadensisOvca3 –DRA
FerungulataSea otterEnhydra lutrisEnlu1 –DQA1 –DRA1 –DMA1 –DOA
CatFelis catusFeca1 –DRA1 –DMA1 –DOA
Sunda pangolinManis javanicaMaja1 –DQA1 –DRA1 –DMA1 –DOA
CougarPuma concolorPuco1 –DPA1 –DMA1 –DOA
JaguarundiPuma yagouaroundiPuya1 –DPA1 –DRA1 –DMA1 –DOA
Steller sea lionEumetopias jubatusEuju1 –DPA1 –DQA
HorseEquus caballusEqca5 –DQA3 –DRA
BatBig brown batEptesicus fuscusEpfu1 –DPA2 –DQA1 –DRA1 –DMA
Kuhl’s pipistrellePipistrellus kuhliiPiku1 –DPA1 –DQA1 –DRA1 –DMA
Large flying foxPteropus vampyrusPtva2 –DQA1 –DRA1 –DOA
MoleStar-nosed moleCondylura cristataCocr3 –DQA1 –DOA
AtlantogenataXenarthraLinnaeus’s two-toed slothCholoepus didactylusChdi1 –DPA1 –DQA1 –DRA1 –DMA1 –DOA
Nine-banded armadilloDasypus novemcinctusDano1 –DPA2 –DQA1 –DRA1 –DMA1 –DOA
AfrotheriaCape golden moleChrysochloris asiaticaChas1 –DPA1 –DRA1 –DMA1 –DOA
Cape elephant shrewElephantulus edwardiiEled1 –DPA1 –DPA, 1 –DQA1 –DPA, 1 –DRA1 –DPA, 1 –DMA1 –DPA, 1 –DOA
AardvarkOrycteropus aferOraf1 –DPA1 –DQA1 –DMA1 –DOA
West Indian manateeTrichechus manatusTrma2 –DPA1 –DQA1 –DRA1 –DMA
Lesser hedgehog tenrecEchinops telfairiEcte1 –DRA1 –DOA
Table 4
Data summary for Class IIB.

Each row represents a species, and each column represents a gene group. Each cell lists the number of alleles included for each gene represented by that gene group. Bolded entries are ‘backbone’ sequences that are included in every group.

CladeSpecies GroupSpeciesLatin NamePref.MHC-DPB GroupMHC-DQB GroupMHC-DRB GroupMHC-DMB GroupMHC-DOB Group
ApeHumanHumanHomo sapiensHosa74 –DPB, 2 –DPB, 2 –DQB, 9 –DRB, 1 –DMB, 1 –DOB2 –DPB, 24 –DQB, 2 –DQB, 9 –DRB, 1 –DMB, 1 –DOB2 –DPB, 2 –DQB, 46 –DRB, 9 –DRB, 1 –DMB, 1 –DOB2 –DPB, 2 –DQB, 9 –DRB, 6 –DMB, 1 –DMB, 1 –DOB2 –DPB, 2 –DQB, 9 –DRB, 1 –DMB, 14 –DOB, 1 –DOB
ChimpanzeeBonoboPan paniscusPapa8 –DPB2 –DQB5 –DRB2 –DMB1 –DOB
ChimpanzeePan troglodytesPatr6 –DPB9 –DQB17 –DRB2 –DMB2 –DOB
GorillaWestern gorillaGorilla gorillaGogo5 –DPB10 –DQB7 –DRB2 –DMB1 –DOB
OrangutanSumatran orangutanPongo abeliiPoab5 –DPB6 –DQB7 –DRB1 –DMB1 –DOB
Bornean orangutanPongo pygmaeusPopy5 –DPB3 –DQB7 –DRB3 –DMB
GibbonSilvery gibbonHylobates molochHymo1 –DPB2 –DQB5 –DRB1 –DMB1 –DOB
Northern white-cheeked gibbonNomascus leucogenysNole1 –DPB2 –DQB1 –DMB1 –DOB
Lar gibbonHylobates larHyla4 –DQB
OWMBaboonOlive baboonPapio anubisPaan5 –DPB5 –DQB11 –DRB1 –DMB
Hamadryas baboonPapio hamadryasPaha3 –DQB1 –DRB
Chacma baboonPapio ursinusPaur8 –DRB
GeladaGeladaTheropithecus geladaThge2 –DPB1 –DQB3 –DRB1 –DMB1 –DOB
MangabeySooty mangabeyCercocebus atysCeat3 –DPB2 –DQB1 –DRB1 –DMB1 –DOB
DrillDrillMandrillus leucophaeusMale1 –DPB1 –DRB1 –DMB1 –DOB
MandrillMandrillMandrillus sphinxMasp10 –DRB
MacaqueCrab-eating macaqueMacaca fascicularisMafa7 –DPB, 2 –DRB, 1 –DMB, 1 –DOB5 –DQB, 2 –DRB, 1 –DMB, 1 –DOB6 –DRB, 2 –DRB, 1 –DMB, 1 –DOB2 –DRB, 4 –DMB, 1 –DMB, 1 –DOB2 –DRB, 1 –DMB, 6 –DOB, 1 –DOB
Northern pig-tailed macaqueMacaca leoninaMalo5 –DPB5 –DQB3 –DRB2 –DMB3 –DOB
Rhesus macaqueMacaca mulattaMamu5 –DPB, 2 –DPB, 1 –DQB, 4 –DRB2 –DPB, 4 –DQB, 1 –DQB, 4 –DRB2 –DPB, 1 –DQB, 10 –DRB, 4 –DRB2 –DPB, 1 –DQB, 4 –DRB, 5 –DMB2 –DPB, 1 –DQB, 4 –DRB, 1 –DOB
Southern pig-tailed macaqueMacaca nemestrinaMane5 –DPB5 –DQB6 –DRB1 –DMB1 –DOB
Tibetan macaqueMacaca thibetanaMath13 –DPB6 –DQB1 –DRB1 –DMB1 –DOB
Stump-tailed macaqueMacaca arctoidesMaar5 –DQB2 –DRB
Japanese macaqueMacaca fuscataMafu3 –DRB
Lion-tailed macaqueMacaca silenusMasi3 –DRB
GrivetGrivetChlorocebus aethiopsChae3 –DQB6 –DRB
Green MonkeyGreen monkeyChlorocebus sabaeusChsa3 –DPB4 –DQB7 –DRB1 –DMB1 –DOB
ColobusAngola colobusColobus angolensisCoan2 –DPB2 –DQB1 –DMB1 –DOB
Ugandan red colobusPiliocolobus tephroscelesPite1 –DPB1 –DQB3 –DRB1 –DMB1 –DOB
LangurFrancois’ langurTrachypithecus francoisiTrfr2 –DPB1 –DQB3 –DRB1 –DMB1 –DOB
Gray langurSemnopithecus entellusSeen1 –DPB
Snub-Nosed MonkeyBlack-and-white snub-nosed monkeyRhinopithecus bietiRhbi1 –DMB1 –DOB
Golden snub-nosed monkeyRhinopithecus roxellanaRhro1 –DPB1 –DQB2 –DRB1 –DMB1 –DOB
NWMTamarinCotton-top tamarinSaguinus oedipusSaoe3 –DPB4 –DQB8 –DRB
White-lipped tamarinSaguinus labiatusSala3 –DRB
MarmosetCommon marmosetCallithrix jacchusCaja1 –DPB, 1 –DPB, 2 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 3 –DQB, 2 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 2 –DQB, 3 –DRB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 2 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 2 –DQB, 1 –DRB, 1 –DMB, 1 –DOB
Night MonkeyNancy Ma’s night monkeyAotus nancymaaeAona4 –DPB3 –DQB6 –DRB1 –DMB1 –DOB
Gray-bellied night monkeyAotus lemurinusAole3 –DPB1 –DQB
Azara’s night monkeyAotus azaraeAoaz2 –DRB
Black-headed night monkeyAotus nigricepsAoni3 –DRB
Three-striped night monkeyAotus trivirgatusAotr3 –DRB
Spix’s night monkeyAotus vociferansAovo3 –DRB
CapuchinPanamanian white-faced capuchinCebus imitatorCeim1 –DPB3 –DQB1 –DRB1 –DMB1 –DOB
Tufted capuchinSapajus apellaSaap1 –DPB4 –DQB3 –DRB1 –DMB
Squirrel MonkeyBlack-capped squirrel monkeySaimiri boliviensisSabo1 –DPB3 –DQB1 –DMB1 –DOB
Common squirrel monkeySaimiri sciureusSasc2 –DRB
Spider MonkeyWhite-bellied spider monkeyAteles belzebuthAtbe2 –DRB
Howler MonkeyGuatemalan black howlerAlouatta pittaAlpi2 –DRB
SakiWhite-faced sakiPithecia pitheciaPipi3 –DRB
TarsierTarsierPhilippine tarsierCarlito syrichtaCasy2 –DPB1 –DQB2 –DRB1 –DMB1 –DOB
StrepsirrhiniLemurRing-tailed lemurLemur cattaLeca1 –DPB, 1 –DPB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DPB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB
Gray mouse lemurMicrocebus murinusMimu1 –DPB3 –DQB2 –DRB1 –DMB1 –DOB
LorisSunda slow lorisNycticebus coucangNyco1 –DPB
GalagoNorthern greater galagoOtolemur garnettiiOtga1 –DPB1 –DQB1 –DRB1 –DMB1 –DOB
SifakaCoquerel’s sifakaPropithecus coquereliPrco1 –DPB1 –DQB1 –DMB1 –DOB
Flying LemurFlying LemurSunda flying lemurGaleopterus variegatusGava2 –DPB2 –DRB1 –DMB1 –DOB
Tree ShrewTree ShrewChinese tree shrewTupaia chinensisTuch1 –DPB1 –DMB1 –DOB
GliresRodentGroundhogMarmota monaxMamo1 –DPB, 1 –DPB1 –DPB1 –DPB1 –DPB, 1–DMB1 –DPB
Brown ratRattus norvegicusRano1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB2 –DQB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DQB, 3 –DRB, 1 –DRB, 1 –DMB, 1 –DOB1 –DQB, 1 –DRB, 1 –DMB, 1 –DMB, 1 –DOB1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB
PikaAmerican pikaOchotona princepsOcpr1 –DQB1 –DRB1 –DMB
Plateau pikaOchotona curzoniaeOccu1 –DPB2 –DQB1 –DOB
LaurasiatheriaArtiodactylaBactrian camelCamelus bactrianusCaba1 –DMB1 –DOB
Dromedary camelCamelus dromedariusCadr1 –DPB1 –DPB1 –DPB1 –DPB1 –DPB
Wild boarSus scrofaSLA1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB2 –DQB, 1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DQB, 4 –DRB, 1 –DRB, 1 –DMB, 1 –DOB1 –DQB, 1 –DRB, 1 –DMB, 1 –DOB1 –DQB, 1 –DRB, 1 –DMB, 2 –DOB, 1 –DOB
Even-toed ungulatesBos sp.BoLA1 –DQB2 –DRB
Water buffaloBubalus bubalisBubu1 –DQB
Wild Bactrian camelCamelus ferusCafe1 –DQB
SheepOvis ariesOvar6 –DQB1 –DRB
GoatCapra hircusCahi2 –DRB
Bighorn sheepOvis canadensisOvca1 –DRB
FerungulataHorseEquus caballusEqca6 –DQB3 –DRB5 –DMB3 –DOB
Sunda pangolinManis javanicaMaja1 –DMB1 –DOB
CougarPuma concolorPuco1 –DPB1 –DMB
JaguarundiPuma yagouaroundiPuya1 –DMB
Northern elephant sealMirounga angustirostrisMian1 –DRB1 –DOB
Sea otterEnhydra lutrisEnlu1 –DPB1 –DQB
Steller sea lionEumetopias jubatusEuju1 –DQB
BatBig brown batEptesicus fuscusEpfu2 –DQB1 –DRB1 –DMB
Large flying foxPteropus vampyrusPtva1 –DQB1 –DOB
Kuhl’s pipistrellePipistrellus kuhliiPiku1 –DPB1 –DQB
MoleStar-nosed moleCondylura cristataCocr2 –DQB1 –DRB1 –DMB1 –DOB
AtlantogenataXenarthraLinnaeus’s two-toed slothCholoepus didactylusChdi2 –DPB1 –DQB1 –DMB1 –DOB
Nine-banded armadilloDasypus novemcinctusDano1 –DPB
AfrotheriaAardvarkOrycteropus aferOraf1 –DQB1 –DOB
Cape elephant shrewElephantulus edwardiiEled1 –DPB
West Indian manateeTrichechus manatusTrma1 –DPB
Lesser hedgehog tenrecEchinops telfairiEcte1 –DRB
Table 5
Structures used to calculate distances to peptide.

This table lists the Protein Data Bank (Berman et al., 2000) structure codes and references for all structures used to calculate peptide distances.

GeneStruct.Reference
MHC-A1ZVSChu et al., 2007
3JTTDai et al., 2010
3OX8Liu et al., 2011
3OXRLiu et al., 2011
3OXSLiu et al., 2011
3RL2Zhang et al., 2011
4HX1Niu et al., 2013
6J1VZhu et al., 2019
6J1WZhu et al., 2019
6MPPFlores-Solis et al., 2019
6PBHvan de Sandt et al., 2019
7SR0Finton et al., 2023
7SRKFinton et al., 2023
7WT5Asa et al., 2022
8I5CLu et al., 2023
MHC-B1JGDHillig et al., 2004
3BVNKumar et al., 2009
3KPLMacdonald et al., 2009
3KPNMacdonald et al., 2009
3LN4Bade-Doding et al., 2011
3LN5Bade-Doding et al., 2011
3RWJWu et al., 2011
3W39Yagita et al., 2013
3X13Saunders et al., 2015
4JQVRist et al., 2013
4JRYLiu et al., 2013
4MJIMotozono et al., 2014
4O2ESun et al., 2014
4PRALiu et al., 2014
4PRBLiu et al., 2014
5EO0Du et al., 2016
5IEKAlpizar et al., 2016
5VUDIlling et al., 2018
5VVPIlling et al., 2018
5VWFIlling et al., 2018
6IWGYamamoto et al., 2019
6MTMGrant et al., 2018
6PYLLim Kam Sian et al., 2019
6PYVLim Kam Sian et al., 2019
6UZPSchutte et al., 2020
6VIUSchutte et al., 2020
6Y27Loll et al., 2020
7R7VLi et al., 2023
7T0LVivian and Rossjohn, 2022
7TUCJiang et al., 2022a
7X1 CHuan et al., 2023
7YG3Jiang et al., 2022b
MHC-C4NT6Choo et al., 2014
5VGDKaur et al., 2017
5VGEKaur et al., 2017
5W67Mobbs et al., 2017
6PAGMoradi et al., 2021
7WJ3Asa et al., 2022
MHC-E2ESVHoare et al., 2006
3CDGPetrie et al., 2008
5W1VSullivan et al., 2017
7P49Walters et al., 2022
7P4BWalters et al., 2022
MHC-F5IUEDulberger et al., 2017
MHC-G1YDPClements et al., 2005
2DYPShiroishi et al., 2006
3KYNWalpole et al., 2010
MHC-DM1HDMMosyak et al., 1998
2BC4Nicholson et al., 2006
4FQXPos et al., 2012
4GBXPos et al., 2012
4I0PGuce et al., 2013
MHC-DO4I0PGuce et al., 2013
MHC-DP3LQZDai et al., 2010
3WEXKusano et al., 2014
7T2ACiacchi et al., 2023
7T6IKlobuch et al., 2022
7ZAKRacle et al., 2023
MHC-DQ2NNAHenderson et al., 2007
4D8PTollefsen et al., 2012
5KSAPetersen et al., 2016
5KSUNguyen et al., 2017
6DIGJiang et al., 2019
6PX6Ting et al., 2020
MHC-DR1BX2Smith et al., 1998
1FV1Li et al., 2000
1H15Lang et al., 2002
1T5XZavala-Ruiz et al., 2004
2Q6WParry et al., 2007
3C5JDai et al., 2008
4FQXPos et al., 2012
4H1LYin et al., 2012
5JLZGerstner et al., 2016
5V4MOoi et al., 2017
6ATFScally et al., 2017
8EUQKassardjian et al., 2023

Additional files

MDAR checklist
https://cdn.elifesciences.org/articles/103547/elife-103547-mdarchecklist1-v1.pdf
Source code 1

This zip file contains all xml files we used to run BEAST2 with SubstBMA on each gene group/exon alignment.

https://cdn.elifesciences.org/articles/103547/elife-103547-code1-v1.zip
Supplementary file 1

This zip file contains lists of IPD database allele names or RefSeq accession numbers for the sequences we analyzed in each gene group.

https://cdn.elifesciences.org/articles/103547/elife-103547-supp1-v1.zip

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Alyssa Lyn Fortier
  2. Jonathan K Pritchard
(2025)
Ancient trans-species polymorphism at the Major Histocompatibility Complex in primates
eLife 14:RP103547.
https://doi.org/10.7554/eLife.103547.3