(A) Map of Cabo Verde islands and sample sizes for number of individuals from each island region. (B) The distribution of West African-related local ancestry proportion across the genome by SNP (n = …
The mean is indicated by the solid horizontal line, and dashed horizontal lines represent three standard deviations from the mean. Again, this plot demonstrates Duffy-null (red dot) as the highest …
*indicates significant p-value <0.001 for binomial test (see Table 1 for sample sizes and further details).
(A) The distribution of West African (purple) and European (green) ancestry tract lengths spanning the DARC locus (dashed line). Each horizontal line represents a single chromosome in the population …
Solid gray lines indicate mean windowed standardized iDAT score for each island (Fogo, 0.006; NW Cluster, −0.024) and dashed gray lines indicate three standard deviations from the mean. Vertical …
Summary statistics were calculated from a random sample of 172 individuals from each simulated population, matching the number of individuals from Santiago included in our analyses. High population …
Simulations shown assumed a single pulse of admixture with exponential growth at a rate of 0.05 per generation and an initial population size of N = 10,000. Initial admixture contributions were …
Each plot corresponds to number of generations since admixture (10 – left; 100 – middle; 1000 – right). Line and point colors correspond to source population one admixture contribution at (gray), …
Line and point colors correspond to simulated human chromosome and corresponding size (chr 1 – green; chr 7 – blue; chr 15 – yellow; chr 22 – gray). X-axis shows DAT cut-off values, and y-axis shows …
iHS was calculated using the hapbin software and standardized using the default method based on allele frequencies. (A) Santiago, (B) Fogo, and (C) NW Cluster. Value for Duffy-null SNP is indicated …
(A) Pairs of and that result in a small difference in final allele frequency calculated under the model and the allele frequency observed in the Santiago genetic data, under a deterministic …
Duffy-null allele was modeled as additive (blue; ), dominant (yellow; in SLiM), or recessive (pink; in SLiM). Posterior median estimates for selection coefficient: , , ; initial …
(A) Selection coefficient (, ) and (B) initial West African admixture contribution (, ).
(A) Inferred (dark gray), simulated (white), and observed (red) mean of global ancestry in Santiago over time. The dark gray histogram plots the posterior distribution for initial West African …
Pink circles indicate West African mean global ancestry after 20 generations versus selection coefficient for whole autosome (22 chromosome) simulations, using a uniform recombination rate within …
With our ancestry-based measures, SWIF(r) achieved an area under the curve (AUC) of 0.966, where an AUC of 1 represents a classifier with perfect skill. Horizontal dashed line indicates the no-skill …
(A) Confusion matrix with threshold P(selection)>0.5. There are no false positives in test set and a high rate of false negatives. (B) Scatterplot of initial admixture contribution vs selection …
Expected Duffy-null frequencies are approximated by mean West African global ancestry proportion for each island, calculated using the admixture software.
Population | n (sampled individuals) | Expected frequency | Observed frequency | Binomial test p-value |
---|---|---|---|---|
Santiago | 172 | 0.737 | 0.834 | 2.193 ×10−5 |
Fogo | 129 | 0.498 | 0.539 | 0.192 |
NW Cluster | 236 | 0.552 | 0.557 | 0.817 |
GWD | 107 | 0.997 | 1.000 | - |
IBS | 107 | 0.002 | 0.019 | - |
Initial population size (N) | Population growth model | Population growth rate (per generation) | Admixture type | Proportion of new migrants (per generation) | Scenario number |
---|---|---|---|---|---|
1000 | Constant size | - | Single-pulse | - | 1 |
Continuous | 0.01 | 2 | |||
Exponential | 0.05 | Single-pulse | - | 3 | |
Continuous | 0.01 | 4 | |||
10,000 | Constant size | - | Single-pulse | - | 5 |
Continuous | 0.01 | 6 | |||
Exponential | 0.05 | Single-pulse | - | 7 | |
Continuous | 0.01 | 8 |
Chromosome 16:46582888–60359576 GO terms.
File containing ENSEMBL gene IDs and associated GO terms for the 10 genes that overlap with region showing extreme iDAT signatures.