(A) Schematic representation of the correlation analysis between genome-wide TF ChIP-seq data and Sm4 affected genes from transcriptomics data. The chromatin around the transcription start sites (TSS) of Sm4 affected genes (purple) was investigated for transcription factor binding peaks (grey), to calculate the ‘distance from TSS’ to closest binding site for a given transcription factor. This distance from TSS was used as a proxy for the likelihood of transcriptional regulation, and thus make an association between Sm4 affected genes and transcription factors. Included in the analysis where the ChIP-seq peaks of SOX18 and SOX7, and of all transcription factors available from the Encode consortium (GATA2, c-FOS, c-JUN, CTCF, EZH2, MAX and c-MYC), performed in HUVECs. A random group of genes was analysed as a control distribution as would be found by chance. (B) Sm4 affected genes were grouped into down-regulated (Sm4-down), unaffected (Sm4-unchanged) and up-regulated (Sm4-up). The plots show the cumulative distribution of the distance between the TSS of Sm4 affected genes (purple line, absolute fold change ≥2) and the closest genomic location of binding sites for SOX18, and control transcription factors SOX7 and GATA2. The median distance from the TSS of differentially expressed genes to the nearest binding event of a given transcription factor was compared to the median distance that is expected by chance from a random gene set (green line). Sm4 down regulated genes are significantly closer (bold) to the SOX18 peaks, but not to SOX7 or GATA2 peaks.