A. EP for non-synonymous SNPs binned by allele frequency. Both alleles of each SNP are included. Each bin includes a 95% confidence interval on the mean. Sites with higher EP are found at a higher frequency on average while sites with lower EP are found at lower frequencies.

B. Mean derived-allele frequency binned by ΔEP values. Each bin includes a 95% confidence interval on the mean. Dotted line represents the average frequency of a neutral (non-coding, non-regulatory) site. Higher positive ΔEP bins have a higher frequency on average as expected if these sites are beneficial.

C. EP calculation and age estimation targets for GEVA and tc for a hypothetical site with three copies of the derived allele in a sample of 10 genomes.

ΔEP measures for fixed and polymorphic alleles.

A. Allele age estimates using GEVA by allele frequency, with each frequency bin holding 75,000 neutral sites.

B. Age rank (GEVA) as a function of ΔEP. Age rank for each derived allele was the rank position of the GEVA estimate in a list of all GEVA ages for neutral alleles with frequency matched derived alleles.

C. Same as B, but for tc.

A. Mean recombination rate per base per generation as a function of ΔEP for fixed and segregating alleles.

B. Figurative example of the frequency trajectory of an allele under the staggered sweep (SS) or diploid fisher’s geometric (DFG) model. Both begin with a phase of rising frequency (A) towards a period of equilibrium (B) caused by heterozygote advantage when homozygous genotypes are disfavored, either due to recessive deleterious linked variation (SS) or an overshooting of the optimal phenotype (DFG). Under DFG, variants are ultimately replaced by new mutations that are simply favored. Under SS, alleles eventually cross over onto chromosomes without linked deleterious alleles, and then rise to fixation (C).