Free-living human cells reconfigure their chromosomes in the evolution back to uni-cellularity
Figures

Convergence in sex chromosomes in culture human cells.
(A) Percentage of lines with LOH (loss of heterozygosity). Each black dot represents an autosome and the red dot represents X and Y. LOH in male and female lines are separately displayed on the X and Y-axes. (B) Percentage of cell lines with either one (n = 119, 34.80% of male cell lines) or two Xa’s (n = 108, 31.58% of male cell lines), cell lines with partial X’s are not included. (C) Expression level of XIST in male cell lines, [Y] means with or without Y chromosome. (D) Expression ratios of X-linked genes between Xa[Y] and XaXa[Y] cell lines. Each grey dot represents a gene, and significant differences are indicated by black dots (t-test, p<0.05). (E) Expression level of XIST in female cell lines with or without LOH. Female lines with LOH have very low levels of XIST, suggesting all X's being active. In non-LOH (XaXb) lines, the expression of XIST indicates the presence of inactive X’s. (F) Percentage of cell lines with either one (n = 30,10.79% of female cell lines) or two Xa’s (n = 31, 11% of female lines) in female lines with LOH of whole X chromosome. Non-LOH lines are not used because of the uncertainty in the number of Xa’s.

The frequency of chromosomes loss show negative correlation to their length.
There is a slight tendency for the smaller autosomes to have higher LOH's than for the larger ones (R =~ −0.4, p=~0.046). X chromosome shows significant deviated from the regression line.

X chromosomes in indivudial cells.
(A–B) Representative images of X chromosome FISH in the A549 cell line (A) with two Xs and HeLa (B) with three Xs. DNA is stained with DAPI (blue), and the X chromosome is labeled with Cy3 (red). (C–D) The distribution of the copy number of X’s among cells from A549 (n = 343) and HeLa (n = 170).

Increasing of expression ratio of X versus autosome (EX/A).
(A) EX/A distributions among normal (N) and cancer (C) cell lines. NF and NM (or CF and CM) designate normal (or cancer) female and male lines. EX/A in cancer cell lines become larger than those of the normal cell lines. Note that the expression in normal cell lines is narrowly distributed and is close to that of the normal tissue when compared. Although the numbers of NF and NM lines are much smaller than CF and CM lines (17 and 24 vs. 279 and 341), their EX/A distributions are much tighter than in cancer cell lines. The actual counts correspond to kernel density are given in Figure 3—figure supplement 2. (B) EX/A ratio in CF, CM, NF and NM lines with filtering with three different cutoffs (see Materials and methods). EX/A ratios are consistently higher in CF and CM lines than in NF and NM lines.

EX/A ratio in cancerous and normal cell lines by RNA-seq.
The gene expression information (gtf files) by RNA-seq was downloaded from UCSC (ftp://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCshlLongRnaSeq/). There are 7 cancerous and 11 normal cell lines respectively. E X/A was calculated by the median value of expressed X-linked and autosomal genes. Expressed genes were selected as RPKM > 1.

The frequency spectrum of EX/A in male and female cancerous cell lines compared to normal male and female cell lines.
The median expression values of X and autosomes genes were used to compute EX/A for each cell lines. The proportion of cell lines within in a bin (0.1) was plotted as Y-axis. The EX/A of cancerous cell lines show a strong right shift compared to that of normal cell lines.

Autosomes change in a cohesive manner and coevolution of X and A.
(A) The density plot of autosome copy number among 620 cell lines shows peaks at 2 and 3 per autosome. (B-C) The observed distributions of gain in copy number among autosomes in two HeLa sublines. The expected Poisson distributions are also given for sublines with different means (λ = 10/13, 16/13; see text). (D) The percentages of C(Xa:A) types among the 620 cell lines.

Both the ancestral and sub-clonal HeLa population have 55–75 chromosomes centering around the triploid count of 69.
(B) The composite distribution of the five lines with λ < 1. And the comparison to Poisson distribution.

The frequency spectrum of C(Xa:A) across all male and female cancerous cell lines.
The median values for copy numbers on autosomes and X chromosome were used to calculate C(Xa:A). The proportion of cell lines within a bin (0.1) was plotted as the Y-axis. The discrete peaks denote the four major genotypes (X:3A; X:2A; 2X:3A; 2X:2A).

A model of karyotypic evolution driven by fitness differences.
(A) Evolutionary pathways of chromosomal changes. Each node represents a karyotype C(Xa:A) and the size roughly corresponds to its frequency. Cell fitness is assumed to be a function of the Xa/A ratio, which is represented by the Y-axis. The four abundant karyotypes are shown by solid black circles. Red arrows indicate faster changes in X and black arrows indicate slower changes in autosome. Main transitions between the common karyotypes are indicated by thicker arrows. (B) Changes in the frequencies of the three key genotypes as a function of time (T, expressed in units of 1/v) under fitness neutrality with all Wi’s = 1. The parameters for Equations 1 and 2 are u = 10 v, a = 2 and b = 1.5. Both the theoretical trajectories and the observed values are given. The C(2:3) cluster (x3) is far more common in the observation than in the neutral model. (C) Changes in 4 karyotypic frequencies under selection according to Equation S3 with s = −0,5 and t = 0.5. All other conditions are the same as above. Under selection, a reasonable agreement between the model and the observation can be obtained.
Additional files
-
Supplementary file 1
XIST expression in human B-cell, HeLa and A549.
The expression of XIST was detected by Real-Time PCR (see note). The △△Ct values were 12 and 11 higher in HeLa and A549 than B-cell, which illustrates nearly 2^10 lower expression of XIST gene in HeLa and A549. The primers of XIST and control gene, used for Real-Time PCR are listed.
- https://doi.org/10.7554/eLife.28070.013
-
Supplementary file 2
Copy number of 13 largest autosomes in 6 HeLa sublines measured by low coverage whole genome sequencing.
- https://doi.org/10.7554/eLife.28070.014
-
Supplementary file 3
The information of cancer cell lines.
- https://doi.org/10.7554/eLife.28070.015
-
Supplementary file 4
The information of non-cancer cell lines.
- https://doi.org/10.7554/eLife.28070.016
-
Supplementary file 5
4-state model that incorporates selection as presented in Figure 5A.
- https://doi.org/10.7554/eLife.28070.017
-
Transparent reporting form
- https://doi.org/10.7554/eLife.28070.018