Efficient estimation for large-scale linkage disequilibrium patterns of the human genome

  1. Xin Huang
  2. Tian-Neng Zhu
  3. Ying-Chao Liu
  4. Guo-An Qi
  5. Jian-Nan Zhang
  6. Guo-Bo Chen  Is a corresponding author
  1. Institute of Bioinformatics, Zhejiang University, China
  2. Center for General Practice Medicine, Department of General Practice Medicine, Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College, China
  3. Center for Reproductive Medicine, Department of Genetic and Genomic Medicine, and Clinical Research Institute, Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College, China
  4. Hainan Institute of Zhejiang University, China
  5. Alibaba Group, China
  6. Key Laboratory of Endocrine Gland Diseases of Zhejiang Province, China
7 figures, 5 tables and 2 additional files

Figures

Schematic illustration for large-scale linkage disequilibrium (LD) analysis as exampled for CONVERGE cohort.

(A) The 22 human autosomes have consequently 22 l^i and 231 l^ij , without (left) and with (right) scaling transformation; Scaling transformation is given in Equation 8. (B) If zoom into chromosome 2 of 420,946 single-nucleotide polymorphisms (SNPs), a chromosome of relative neutrality is expected to have self-similarity structure that harbors many approximately strong l^u along the diagonal, and relatively weak l^uv off-diagonally. Here chromosome 2 of CONVERGE has been split into 1000 blocks and yielded 1000 l^u LD grids, and 499,500 l^uv LD grids. (C) An illustration of the construction process for the LD-decay regression model.

Figure 2 with 2 supplements
Reconciliation for linkage disequilibrium (LD) estimators in the 26 cohorts of 1KG.

(A) Consistency examination for the 26 1KG cohorts for their l^i and l^ij estimated by X-LD and PLINK (--r2). In each figure, the 22 l^i fitting line is in purple, whereas the 231 l^ij fitting line is in green. The gray solid line, y=1n+x, in which n the sample size of each cohort, represents the expected fit between PLINK and X-LD estimates, and the two estimated regression models at the top-right corner of each plot show this consistency. The sample size of each cohort is in parentheses. (B) Distribution of R2 of l^i and l^ij fitting lines is based on X-LD and PLINK algorithms in the 26 cohorts; R2 represents variation explained by the fitted model. 26 1KG cohorts: MSL (Mende in Sierra Leone), GWD (Gambian in Western Division, The Gambia), YRI (Yoruba in Ibadan, Nigeria), ESN (Esan in Nigeria), ACB (African Caribbean in Barbados), LWK (Luhya in Webuye, Kenya), ASW (African Ancestry in Southwest US), CHS (Han Chinese South), CDX (Chinese Dai in Xishuangbanna, China), KHV (Kinh in Ho Chi Minh City, Vietnam), CHB (Han Chinese in Beijing, China), JPT (Japanese in Tokyo, Japan), BEB (Bengali in Bangladesh), ITU (Indian Telugu in the UK), STU (Sri Lankan Tamil in the UK), PJL (Punjabi in Lahore, Pakistan), GIH (Gujarati Indian in Houston, TX), TSI (Toscani in Italia), IBS (Iberian populations in Spain), CEU (Utah residents [CEPH] with Northern and Western European ancestry), GBR (British in England and Scotland), FIN (Finnish in Finland); MXL (Mexican Ancestry in Los Angeles, CA), PUR (Puerto Rican in Puerto Rico), CLM (Colombian in Medellin, Colombia), and PEL (Peruvian in Lima, Peru).

Figure 2—figure supplement 1
Reconciliation for linkage disequilibrium (LD) estimators in AFR, EAS, and EUR.

In each figure, the 22 l^i fit line is in purple, whereas the 231 l^ij fit line is in green. The gray solid line, y=1n+x, in which n the sample size, represents the expected fit between PLINK and X-LD, and the two estimated regression models at the top-right corner show this consistency.

Figure 2—figure supplement 2
The computational efficiency of X-LD algorithm.

Considering the high computational cost of PLINK, only the first chromosome was chosen. In the process of evaluating computational efficiency, we kept adding single-nucleotide polymorphisms (SNPs) until the inclusion of the entire chromosome. The bar chart and line chart show the actual calculation time and theoretical calculation complexity, respectively.

Figure 3 with 1 supplement
Various linkage disequilibrium (LD) components for the 26 1KG cohorts.

(A) Chromosomal scale LD components for five representative cohorts (CEU, CHB, YRI, ASW, and 1KG). The upper parts of each figure represent l^i (along the diagonal) and l^ij (off-diagonal), and the lower part l^ij as in Equation 8. For visualization purposes, the quantity of LD before scaling is transformed to a -log10 scale, with smaller values (red hues) representing larger LD, and a value of 0 representing that all single-nucleotide polymorphisms (SNPs) are in LD. (B) The relationship between the degree of population structure (approximated by F-st) and l^i , l^g , and l^ij in the 26 1KG cohorts.

Figure 3—figure supplement 1
Chromosomal scale linkage disequilibrium (LD) components for 26 cohorts in 1KG.

The upper and lower parts of each figure represent the LD before and after scaling according to Equation 8. l^i and l^ij are represented by the diagonal and the off-diagonal elements, respectively. For visualization purposes, LD before scaling is transformed to a -log10-scale, with smaller values (red hues) representing larger LD, and a value of 0 representing that all single-nucleotide polymorphisms (SNPs) are in LD.

Figure 4 with 2 supplements
High-resolution illustration for linkage disequilibrium (LD) grids for CEU, CHB, YRI, and ASW (m=250).

For each cohort, we partition chromosomes 6 and 11 into high-resolution LD grids (each LD grid contains 250 ×250 single-nucleotide polymorphism [SNP] pairs). The bottom half of each figure shows the LD grids for the entire chromosome. Further zooming into HLA on chromosome 6 and the centromere region on chromosome 11, and their detailed LD in the relevant regions are also provided in the upper half of each figure. For visualization purposes, LD is transformed to a -log10-scale, with smaller values (red hues) representing larger LD, and a value of 0 representing that all SNPs are in LD.

Figure 4—figure supplement 1
High-resolution illustration for linkage disequilibrium (LD) grids for CEU, CHB, YRI, and ASW (m=500).

For each cohort, we partitioned each chromosome into consecutive LD grids (each LD grid containing 500 single-nucleotide polymorphisms [SNPs]). For visualization purposes, LD is transformed to a -log10-scale, with smaller values (red hues) representing larger LD, and a value of 0 representing that all SNPs are in LD.

Figure 4—figure supplement 2
Influence of HLA region on chromosome 6 and centromere region on chromosome 11 on chromosomal linkage disequilibrium (LD) in CEU, CHB, YRI, and ASW.

When another region was removed, to avoid chance, the same number of consecutive single-nucleotide polymorphisms (SNPs) as HLA region or centromere region was randomly removed from the genomic region, and this operation was repeated 100 times.

Figure 5 with 1 supplement
Linkage disequilibrium (LD) decay analysis for 26 1KG cohorts.

(A) Conventional LD decay analysis in PLINK for 26 cohorts. To eliminate the influence of sample size, the inverse of sample size has been subtracted from the original LD values. The YRI cohort, represented by the orange dotted line, is chosen as the reference cohort in each plot. The top-down arrow shows the order of LDdecay values according to Table 5. (B) Model-based LD decay analysis for the 26 1KG cohorts. We regressed each autosomal l^i against its corresponding inversion of the single-nucleotide polymorphism (SNP) number for each cohort. Regression coefficient b1 quantifies the averaged LD decay of the genome and intercept b0 provides a direct estimate of the possible existence of long-distance LD. The R values in the first three plots indicate the correlation between b^1 and LD decay score in three different physical distance and the correlation between b^1 (left-side vertical axis) and LD decay score (right-side vertical axis) and the correlation between b^0 (left-side vertical axis) and F-st (right-side vertical axis), respectively. The last plot assessed the impact of centromere region of chromosome 11 on the linear relationship between chromosomal LD and the inverse of the SNP number. The dark and light gray dashed lines represent the mean of the R with and without the presence of centromere region of chromosome 11.

Figure 5—figure supplement 1
The correlation between the inverse of the single-nucleotide polymorphism (SNP) number and chromosomal linkage disequilibrium (LD) in 26 cohorts of 1KG.
Figure 6 with 1 supplement
The correlation between the inversion of the single-nucleotide polymorphism (SNP) number and l^i.

(A) The correlation between the inversion of the SNP number and l^i in CEU, CHB, YRI, and ASW. (B) Leave-one-chromosome-out strategy is adopted to evaluate the contribution of a certain chromosome on the correlation between the inverse of the SNP number and l^i . (C) The correlation between the inversion of the SNP number and chromosomal linkage disequilibrium (LD) in CEU, CHB, YRI, and ASW after removing the centromere region of chromosome 11. (D) High-resolution illustration for LD grids for chromosome 8 in CEU, CHB, YRI, and ASW. For each cohort, we partition chromosome 8 into consecutive LD grids (each LD grid contains 250 ×250 SNP pairs). For visualization purposes, LD is transformed to a -log10-scale, with smaller values (red hues) representing larger LD, and a value of 0 representing that all SNPs are in LD.

Figure 6—figure supplement 1
Influence of expanding of single-nucleotide polymorphism (SNP) numbers on the correlation between the inverse of the SNP number and chromosomal linkage disequilibrium (LD) in ASW.

Randomly selected SNPs that were presented in ASW but were not 2,997,635 consensus SNPs were added to the ASW cohort to demonstrate the stable pattern of chromosome 8.

Author response image 1

Tables

Table 1
Notation definitions.
NotationDefinition
CThe number of chromosomes.
i and jSubscripts index chromosome i and j.
βiThe number of SNP segments of chromosome i, each of which has m SNPs.
Dl1l2The difference between the observed and expected haplotype frequencies, with Dl1l2=pl1l2-pl1pl2 .
FThe inbreeding coefficient.
KiGenetic relatedness matrix for chromosome i, and two vectors, kio and kid , from Ki , where kio stacks the off-diagonal elements and kid stacks the diagonal elements.
kSubscript indexes individual.
l1 and l2Subscripts index a pair of SNPs.
mThe number of SNPs; mi the number of SNPs on chromosome i.
nThe number of samples; ni , the number of samples in subpopulation i.
pl and qlFrequency of the lth reference allele and alternative allele in the population.
θk1k2The relatedness score between individual k1 and k2 .
xklThe genotype for the kth individual at the lth biallelic locus.
Xi and X~iGenotype and standardized genotype matrixes for chromosome i.
ρl1l22Squared Pearson’s correlation coefficient for any pair of SNPs, including an SNP to itself when l1=l2 .
r2Squared Pearson’s correlation metric for LD but estimated from PLINK (--r2) or PopLDdecay.
lgThe mean LD of the whole genome-wide m2 SNP pairs.
liThe intra-chromosomal mean LD for the ith chromosome of mi2 SNP pairs.
lijThe inter-chromosomal mean LD ith and jth chromosomal mimj SNP pairs, a scaled version is ij .
luThe mean LD for a diagonal grid.
luvThe mean LD for off-diagonal grids.
  1. LD, linkage disequilibrium; SNP, single-nucleotide polymorphism.

Table 2
Computational time for the demonstrated estimation tasks.
CohortTask descriptionTime costComputational time complex
CHB (n=103, m=2,997,655)Estimation for 22 autosomal i , and 231 inter-chromosomal ij . For results, see Figure 3 and Table 3.101,34 sO(n2m)
1KG (n=2,503, m=2,997,655)Same as above.3008.29 sSame as above
CONVERGE (n=10,640, m=5,215,820)Same as above. For results, see Figure 1A.77,508.00 sSame as above
Estimation for high-resolution LD interaction given bin size of 250 SNPs
CHB (n=103, m2=241,241)Chromosome 2, estimation for 965 li, and 465,130 lij . For results, see Figure 4.66.86 sO(n2(mi+(mi250)2))
CHB (n=103, m22=40,378)Chromosome 22, estimation for 162 li, and 13,041 lij . For results, see Figure 4.3.22 sSame as above
CONVERGE (n=10,640, m22=71,407)Chromosome 22, estimation for 286 li, and 40,755 lij .8,736.29 sSame as above
CONVERGE (n=10,640, m2=420,949)Chromosome 2, estimation for 1000 li, and 499,500 lij . For results, see Figure 1B.45,125.00 sChromosome 2 was split into 1000 blocks, each of which had about 420 SNPs
  1. For the sake of fair comparison, 10 CPUs were used for multi-thread computing.

  2. LD, linkage disequilibrium; SNP, single-nucleotide polymorphism.

Table 3
X-LD estimation for complex LD components (2,997,635 SNPs).
Cohort (n)Ancestryλ1(Fst)*l^g (SE)l¯i^ (SD) l¯ij^ (SD) l^ij (SD) Lower bound of LD §
MSL (85)AFR1.10 (0.013)1.9e-4 (1.21e-6)6.9e-4 (2.0e-4)1.7e-4 (1.7e-5)0.26 (0.053)0.161971831
GWD (113)AFR1.07 (0.009)1.1e-4 (5.61e-7)6.0e-4 (2.0e-4)8.7e-5 (8.1e-6)0.16 (0.037)0.247218789
YRI (107)AFR1.05 (0.010)1.1e-4 (4.23e-7)5.9e-4 (2.0e-4)8.8e-5 (6.9e-6)0.16 (0.04)0.242001641
ESN (99)AFR1.09 (0.011)1.4e-4 (7.67e-7)7.0e-4 (2.2e-4)1.2e-4 (1.2e-5)0.19 (0.043)0.217391304
ACB (96)AFR2.01 (0.021)2.9e-4 (3.78e-6)9.1e-4 (2.5e-4)2.5e-4 (3.6e-5)0.29 (0.070)0.147727273
LWK (99)AFR1.35 (0.014)2.2e-4 (2.38e-6)8.4e-4 (2.5e-4)1.9e-4 (3.2e-5)0.24 (0.052)0.173913043
ASW (61)AFR1.90 (0.031)1.1e-3 (2.73e-5)2.0e-3 (3.2e-4)1.1e-3 (6.2e-5)0.57 (0.059)0.079681275
CHS (105)EA1.08 (0.010)1.4e-4 (9.39e-7)9.5e-4 (3.4e-4)1.0e-4 (1.3e-5)0.12 (0.030)0.31147541
CDX (93)EA1.11 (0.012)1.8e-4 (1.38e-6)1.1e-3 (3.6e-4)1.4e-4 (2.0e-5)0.14 (0.040)0.272277228
KHV (99)EA1.07 (0.011)1.4e-4 (7.67e-7)9.5e-4 (3.5e-4)1.0e-4 (1.2e-5)0.12 (0.031)0.31147541
CHB (103)EA1.07 (0.010)1.3e-4 (6.94e-7)9.3e-4 (3.4e-4)9.5e-5 (1.1e-5)0.11 (0.030)0.317948718
JPT (104)EA1.06 (0.010)1.3e-4 (7.22e-7)1.0e-3 (3.8e-4)9.3e-5 (1.2e-5)0.10 (0.028)0.338638673
BEB (86)SA1.07 (0.012)1.7e-4 (8.09e-7)9.1e-4 (3.1e-4)1.4e-4 (1.5e-5)0.17 (0.042)0.236363636
ITU (102)SA1.61 (0.016)1.9e-4 (1.84e-6)9.5e-4 (3.1e-4)1.5e-4 (1.7e-5)0.18 (0.044)0.231707317
STU (102)SA1.56 (0.015)2.6e-4 (3.21e-6)1.0e-3 (3.3e-4)2.3e-4 (3.1e-5)0.23 (0.047)0.171526587
PJL (96)SA1.67 (0.017)2.4e-4 (2.74e-6)1.1e-3 (3.4e-4)2.0e-4 (2.2e-5)0.21 (0.048)0.20754717
GIH (103)SA1.73 (0.017)2.7e-4 (3.41e-6)1.1e-3 (3.4e-4)2.4e-4 (1.9e-5)0.23 (0.049)0.179153094
TSI (107)EUR1.07 (0.010)1.2e-4 (6.10e-7)9.1e-4 (3.3e-4)9.0e-5 (1.1e-5)0.11 (0.029)0.325
IBS (107)EUR1.07 (0.010)1.2e-4 (6.10e-7)9.1e-4 (3.3e-4)8.8e-5 (1.1e-5)0.11 (0.028)0.329949239
CEU (99)EUR1.07 (0.011)1.4e-4 (7.67e-7)9.6e-4 (3.4e-4)1.1e-4 (1.3e-5)0.12 (0.030)0.293577982
GBR (91)EUR1.11 (0.012)1.7e-4 (1.08e-6)1.0e-3 (3.6e-4)1.4e-4 (1.8e-5)0.15 (0.036)0.253807107
FIN (99)EUR1.09 (0.011)1.5e-4 (9.69e-7)1.1e-3 (3.8e-4)1.0e-4 (1.5e-5)0.10 (0.027)0.34375
MXL (64)AMR2.29 (0.036)7.2e-4 (1.49e-5)2.1e-3 (4.1e-4)6.3e-4 (9.6e-5)0.32 (0.072)0.136986301
PUR (104)AMR1.43 (0.014)1.6e-4 (1.30e-6)1.2e-3 (4.2e-4)1.2e-4 (1.7e-5)0.11 (0.026)0.322580645
CLM (94)AMR1.58 (0.017)2.3e-4 (2.49e-6)1.4e-3 (4.5e-4)1.7e-4 (2.6e-5)0.13 (0.035)0.281690141
PEL (85)AMR2.38 (0.028)4.5e-4 (7.33e-6)1.9e-3 (5.1e-4)3.7e-4 (8.5e-5)0.21 (0.062)0.196483971
1KG (2503)MIX164.20 (0.066)5.8e-3 (4.63e-6)6.5e-3 (4.1e-4)5.7e-3 (2.4e-4)0.88 (0.028)0.051505547
  1. LD, linkage disequilibrium; SNPs, single-nucleotide polymorphisms.

  2. *

    Eigenvalue was estimated. In parentheses is the ratio between the listed largest eigenvalue and the sample size. Since there exists an approximation that F-stλ1n , the ratio can be taken as an approximation of population structure.

  3. Standard error was calculated as 2n(n1)[l^g1(n1)2], as Equation 7.

  4. Estimated empirically from C chromosomal l^i ; Estimated empirically from C(C1)2 inter-chromosomal l^ij .

  5. §

    It is estimated by 22¯^i22¯^i+231¯^ij , indicating lower bound of true LD.

Table 4
Estimates for 22 autosomal l^i in CEU, CHB, YRI, and ASW, respectively.
ChromosomeSNP numberl^i
CEUCHBYRIASW
1225,9675.0e-4 (8.2e-6)0.00049 (7.8e-6)0.00032 (4.3e-6)0.0015 (4e-05)
2241,2415.0e-4 (8.1e-6)5.0e-4 (7.9e-6)3.0e-4 (4.1e-6)0.0015 (4e-05)
3212,6706.0e-04 (1.0e-5)0.00058 (9.5e-6)0.00039 (5.7e-6)0.0018 (5.1e-5)
4222,2410.00062 (1.0e-5)0.00061 (1.0e-5)0.00038 (5.4e-6)0.0018 (5.0e-5)
5193,6320.00069 (1.2e-5)7.0e-04 (1.2e-5)0.00043 (6.5e-6)0.0018 (4.9e-5)
6206,1650.0010 (1.9e-5)9.0e-04 (1.6e-5)0.00064 (1.0e-5)0.0019 (5.4e-5)
7177,4140.00073 (1.3e-5)0.00071 (1.2e-5)0.00045 (6.8e-6)0.0016 (4.3e-5)
8163,4360.00075 (1.3e-5)0.00069 (1.2e-5)0.00043 (6.5e-6)0.0022 (6.4e-5)
9129,4400.00074 (1.3e-5)0.00074 (1.3e-5)0.00047 (7.2e-6)0.0018 (5.0e-5)
10152,2510.00078 (1.4e-5)8.0e-04 (1.4e-5)0.00058 (9.3e-6)0.0019 (5.6e-5)
11151,7510.0012 (2.3e-5)0.0012 (2.2e-5)0.00084 (1.4e-5)0.0022 (6.2e-5)
12139,6848.0e-4 (1.4e-5)0.00073 (1.2e-5)0.00049 (7.5e-6)0.0017 (4.8e-5)
13113,3900.0010 (1.8e-5)0.00094 (1.6e-5)0.00061 (9.8e-6)0.0018 (4.9e-5)
1497,3350.0011 (2.0e-5)0.0010 (1.8e-5)0.00065 (1.1e-5)0.0020 (5.6e-5)
1585,3070.0010 (1.8e-5)0.00098 (1.7e-5)6.0e-4 (9.6e-6)0.0020 (5.8e-5)
1692,0070.00088 (1.6e-5)0.00084 (1.5e-5)0.00054 (8.4e-6)0.0021 (6.2e-5)
1779,4780.0012 (2.3e-5)0.0011 (2.0e-5)0.00069 (1.1e-5)0.0021 (6.0e-5)
1887,1050.0010 (1.8e-5)0.00095 (1.7e-5)0.00058 (9.2e-6)0.0023 (6.8e-5)
1972,7940.0012 (2.3e-05)0.0012 (2.1e-5)0.00082 (1.4e-5)0.0022 (6.2e-5)
2068,8810.0014 (2.6e-5)0.0015 (2.7e-5)0.00078 (1.3e-5)0.0024 (7.0e-5)
2145,0680.0018 (3.4e-5)0.0017 (3.2e-5)0.00098 (1.7e-5)0.0024 (7.1e-5)
2240,3780.0016 (3.1e-5)0.0016 (2.9e-5)0.0010 (1.8e-5)0.0027 (8.1e-5)
  1. Each l^i and its standard error are in parentheses, as estimated in Equation 7.

  2. SNP, single-nucleotide polymorphism.

Table 5
LD decay regression analysis for 26 cohorts.
Cohort (n)LD-decay regression*Population parameters
b^0b^1RLD decay scoreFst¯ (%)AncestryTrue LD
MSL (85)0.0004129.970.840.04210.013AFR0.62727273
GWD (113)0.0003130.170.830.04390.009AFR0.65934066
YRI (107)0.0003030.640.850.04360.010AFR0.66292135
ESN (99)0.0003734.820.870.04360.011AFR0.65420561
ACB (96)0.0005339.620.880.04510.021AFR0.63194444
LWK (99)0.0004640.520.920.04470.014AFR0.64615385
ASW (61)0.001546.880.830.04720.031AFR0.57142857
CHS (105)0.0004652.360.870.05550.010EA0.67375887
CDX (93)0.0005553.770.830.05570.012EA0.66666667
KHV (99)0.0004453.790.870.05600.011EA0.68345324
CHB (103)0.0004154.900.900.05580.010EA0.69402985
JPT (104)0.0004557.750.850.05680.010EA0.68965517
BEB (86)0.0004548.840.880.05560.012SA0.66911765
ITU (102)0.0004849.580.890.05460.016SA0.66433566
STU (102)0.0005552.840.890.05460.015SA0.64516129
PJL (96)0.0005454.000.900.05460.017SA0.67073171
GIH (103)0.0005755.810.910.05620.017SA0.65868263
TSI (107)0.0004153.170.910.05580.010EUR0.68939394
IBS (107)0.0003954.220.920.05550.010EUR0.7
CEU (99)0.0004554.230.890.05590.011EUR0.68085106
GBR (91)0.0004758.230.910.05550.012EUR0.68027211
FIN (99)0.0005459.240.860.05790.011EUR0.67073171
MXL (64)0.001466.130.890.05580.036AMR0.6
PUR (104)0.0005967.200.890.05710.014AMR0.67039106
CLM (94)0.0006975.970.950.05720.017AMR0.66985646
PEL (85)0.001278.150.850.05980.028AMR0.61290323
1KG (2503)0.006140.650.550.066Mixed0.51587302
  1. LD, linkage disequilibrium; SNP, single-nucleotide polymorphism.

  2. *

    The regression intercept b^0 and the coefficients b^1 are as represented in Equation 3.

  3. The column for LD decay score was taken as the mean of the estimated r2-1n from PopLDdecay in a physical distance of 1500 kb, which was approximated to the area under the curve in Figure 5A for each cohort; Fst was approximated by λ1n , in which λ1 the largest eigenvalue for the cohort. r2 was the estimated LD statistic from PLINK (--r2).

  4. True LD is defined as l¯^ijl¯^ij+b^0 .

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Xin Huang
  2. Tian-Neng Zhu
  3. Ying-Chao Liu
  4. Guo-An Qi
  5. Jian-Nan Zhang
  6. Guo-Bo Chen
(2023)
Efficient estimation for large-scale linkage disequilibrium patterns of the human genome
eLife 12:RP90636.
https://doi.org/10.7554/eLife.90636.3