Science Forum: Considerations when investigating lncRNA function in vivo
Abstract
Although a small number of the vast array of animal long non-coding RNAs (lncRNAs) have known effects on cellular processes examined in vitro, the extent of their contributions to normal cell processes throughout development, differentiation and disease for the most part remains less clear. Phenotypes arising from deletion of an entire genomic locus cannot be unequivocally attributed either to the loss of the lncRNA per se or to the associated loss of other overlapping DNA regulatory elements. The distinction between cis- or trans-effects is also often problematic. We discuss the advantages and challenges associated with the current techniques for studying the in vivo function of lncRNAs in the light of different models of lncRNA molecular mechanism, and reflect on the design of experiments to mutate lncRNA loci. These considerations should assist in the further investigation of these transcriptional products of the genome.
https://doi.org/10.7554/eLife.03058.001Main text
Complex transcription interwoven between and within protein-coding genes produces many thousands of long non-coding RNAs (lncRNAs) that are greater than 200 nucleotides (nt) in length but that appear to lack protein-coding potential (Djebali et al., 2012). Nevertheless, even for the earliest discovered lncRNAs, such as mammalian H19, Xist or fruitfly roX, molecular effects and functional significance have proven difficult to establish (Gabory et al., 2010; Ilik et al., 2013; Sado and Brockdorff, 2013). Furthermore, no or only subtle mouse phenotypes were revealed by detailed loss-of-function studies of Malat1 or Evf-2. In contrast, mutation of Fendrr results in early lethality, and targeted replacement of BC1 results in seizures for some mice (Table 1). It is not possible to accurately predict from the level or extent of its expression, or its sequence composition, whether disruption of a lncRNA locus will result in an overt phenotype. This makes loss- or gain-of-function experiments crucial to understanding the roles of lncRNAs in vivo.
Many lncRNAs are known to act as primary host transcripts for classes of small non-coding RNAs (da Rocha et al., 2008; Royo and Cavaille, 2008). However, lncRNAs are also presumed to regulate the expression either of their neighbouring genes in cis, or of more distant genes in trans (Figure 1). The function of a lncRNA may be mediated by the gene's RNA product which can bind to proteins or to other nucleic acids thereby modulating their functions. This could act by competing with endogenous mRNAs for miRNA binding (Franco-Zorrilla et al., 2007; Poliseno et al., 2010; Jeck and Sharpless, 2014), providing binding sites for small RNAs that elicit transcriptional silencing (Wierzbicki et al., 2009), or through altering protein activity (Feng et al., 2006), binding or specificity (reviewed in Guttman and Rinn, 2012). Alternatively, the act of transcription per se through a lncRNA locus could be critical because of the changes this generates in chromatin structure, modification or protein binding: in this case the resultant RNA could be an incidental by-product (Petruk et al., 2006; Latos et al., 2012; Marquardt et al., 2014). In these latter cases, any technique intended to dissect mechanism must alter the act and extent of transcription rather than change RNA levels. This multiplicity of lncRNA functional mechanisms means that a toolkit of experimental strategies to dissect their modes of action will need to be added to those currently employed for investigating protein-coding genes. Protein-coding genes have been shown to contribute greatly to biological function, which is not yet the case for lncRNA loci, rendering their rigorous investigation particularly important.
Analysis of lncRNA localisation both on a tissue and subcellular level by techniques such as fluorescent in situ hybridisation (FISH, Chakraborty et al., 2012) can give important insights into the cell types that are important for their function, and in which subcellular compartment they act. Understanding the mechanism of action of lncRNAs often relies on identification of interacting proteins or nucleic acids by RNA-protein (e.g., crosslinking immunoprecipitation, CLIP, Huppertz et al., 2014), RNA–RNA (e.g., crosslinking analysis of synthetic hybrids, CLASH, Helwak et al., 2013) or RNA-DNA (e.g., CHART, Simon et al., 2011; Vance and Ponting, 2014 and ChIRP, Chu et al., 2011) interaction assays. However, due to the nature of the RNA molecule, many assays are prone to non-specific binding, and it is critical to ensure that appropriate controls are performed (Brockdorff, 2013). Several of these techniques have therefore been designed to identify direct interactors by crosslinking, and subsequent use of denaturing conditions to remove non-specific interactions. These techniques are clearly important in determining the mechanism of action of lncRNAs, and are critical to guide experimental genetic knockout design. However, an understanding of the functional importance of lncRNAs in the context of the whole organism still relies on manipulating their expression by genetic modification, overexpression or knockdown strategies, and analysis of the resulting phenotypes.
The earliest studied lncRNAs were those associated with imprinting, such as Airn and H19, or X chromosome regulation, such as Xist or roX1/2 (Table 1). In these cases, lncRNA expression was initially linked genetically to a known phenotype, and cell line models accurately reflected the in vivo models (Hao et al., 1993; Keniry et al., 2012; Latos et al., 2012; Helwak et al., 2013; Huppertz et al., 2014). These results highlight that the early models of lncRNAs involved in imprinting and X chromosomal dosage compensation could act as paradigms for the study of lncRNAs today (Kohtz, 2014). In the absence of a priori phenotypic associations, some lncRNAs have been chosen for study on the basis of their tissue restricted patterns of expression, sequence conservation, or cellular localisation. Others, such as MALAT1 (whose level of expression is associated with metastasis) have been selected on the basis of their suggested association with disease. Neat1 and Malat1 (also known as Neat2) are linked loci that produce highly expressed lncRNAs whose sequences are well conserved across diverse mammals and which have specific nuclear localisations (Chu et al., 2011). In cells, Neat1 was shown to be essential for nuclear paraspeckle assembly and maintenance (Clemson et al., 2009; Sasaki and Hirose, 2009; Sunwoo et al., 2009; Mao et al., 2011a, 2011b; Zhang et al., 2012) and Malat1/Neat2 binds to the Polycomb 2 (PC2) protein which is required for activating growth-control genes (Vance and Ponting, 2014). Nevertheless, in vivo disruption of either of these lncRNA loci results in viable and fertile mouse models (Table 1).
Confirmation or rejection of lncRNA functionality requires experimental evidence that clearly separates the role of the genomic locus from the role of its RNA products. Here we recommend experimental techniques that achieve this separation whilst minimising disruption of the DNA sequence. Furthermore, we propose some considerations that may assist in interpreting phenotypes arising from mutation of a lncRNA or lncRNA locus (Box 1).
Considerations when interpreting phenotypes resulting from lncRNA mutation
General Considerations
The design of functional experiments should be guided by the essential RNA biology of the chosen lncRNA locus: its proximity to protein-coding genes, its chromatin signatures, stability, copy number, full-length transcript models and tissue expression profiles. If it shares a bidirectional promoter then minimise interference with the adjacent locus when designing targeting strategies. If more abundant and stable, with promoter-like chromatin marks at its transcriptional start site, then consider whether the lncRNA acts in trans in an RNA dependent manner.
Consider all available transcript, regulatory element and evolutionary evidence when designing mutations.
Consider whether, contrary to initial expectations, the lncRNA encodes protein or, as for H19, harbours a miRNA.
Choice of loss-of-function strategy and prediction of whether the lncRNA acts in cis or in trans should be informed by its cytoplasmic, nuclear or chromatin localisation. If found in the cytoplasm, consider whether it is, in fact, translated. If chromatin-associated consider whether it acts in cis. In contrast, if cytoplasmic or nucleoplasmic, consider whether it is trans-acting.
Choose cells for functional experiments in which the lncRNA is relatively highly expressed, certainly at greater than one molecule per cell.
Minimise genomic sequence disruptions when investigating lncRNA or lncRNA locus function. Use control manipulations to distinguish disruptions influencing flanking genes from those influencing the lncRNA.
Investigate each locus using multiple complementary strategies, for example introduction of minimal targeted DNA deletions, inversions or disruptions and, separately, of transcriptional truncation cassettes. Consider using controls for genetic manipulations of lncRNA loci: inverting the truncation cassette where possible, using a mutated truncation cassette, using a different type of truncation cassette, and using different sites to truncate the lncRNA. It is important to remove any selection cassettes and to consider the influence of reporter genes and loxP sites on the locus. Fully describe the mutated locus, including whether the selection cassette is retained.
Assay biological replicates separately. Embryonic stem (ES) or induced pluripotent (iPS) cells frequently vary in their differentiation kinetics, especially after undergoing gene targeting and selection, and mouse embryos, particularly early implantation stage mouse embryos, show considerable variation in developmental timing. Similarly, cancer cell lines are inherently genetically unstable. This variability makes it essential to study multiple clones of cells or independently derived mutants to ensure that the effects observed are due to the mutation of interest, and not dependent on other effects of the genetic background. This is especially important when the phenotypic effects are subtle.
Assessment of evidence for lncRNA functionality
Consider the evidence for each of the many known transcriptional or post-transcriptional, nuclear or cytoplasmic, cis or trans, RNA-dependent or -independent mechanisms of lncRNAs.
Employ RNAi-based techniques principally when investigating cytoplasmic RNAs and post-transcriptional RNA-dependent mechanisms. If using RNAi, the knockdown effect on the cytoplasmic and nuclear compartment should be determined separately. An alternative is to use antisense DNA oligos to induce an RNase H activity in the nucleus.
Only claim that a phenotype is caused by alteration of a trans-acting lncRNA transcript when it is successfully and repeatedly rescued upon expression of the lncRNA from an independent transgene.
Take advantage of carefully controlled biochemical approaches when assessing the potential function of a lncRNA.
Publications and reporting
Assess and report objectively all evidence for or against RNA sequence-dependent function or transcription-dependent (RNA sequence-independent) function.
Report phenotypes precisely. Commonly, gene knockouts kill embryos at critical periods for example, implantation, gastrulation, 12.5dpc when the cardiovascular system become essential, and at birth when lungs and many other systems become essential. In general the maternal organs rescue many organ defects of the embryo. For ES cells, phenotypes affecting pluripotency need to be defined and should be considered with caution due to the inherent instability of this state.
Explicitly caution when evidence for RNA-dependent vs–independent function, or trans- vs cis-acting function, is not clear-cut.
In vivo, loss-of-function strategies
Different genetic loss-of-function strategies can be employed in vivo to study the function of lncRNAs (Figure 2). Prioritisation of strategy should depend on the lncRNA's known biology, including its localisation to one or more of the cytoplasm, nucleus or chromatin. In one study, the majority of human lncRNAs were enriched in the cytoplasm (van Heesch et al., 2014) and these may associate with ribosomes and, contrary to expectations, some may be translated (Guttman et al., 2013; Kim et al., 2014; Wilhelm et al., 2014). Nuclear lncRNAs, particularly those that are chromatin-associated, could act as cis-acting transcriptional regulators, whereas cytoplasmic or nucleoplasmic lncRNAs might be predicted to function in trans; by contrast, some nucleoplasmic lncRNAs may of course be non-functional products of transcription.
Depletion of protein-coding transcripts is often achieved using RNAi-based techniques, which supply double-stranded RNA that is able to trigger post-transcriptional destabilisation of the mature mRNA and inhibit translation, predominantly in the cytoplasm. Although the presence of active RNAi factors in human cell nuclei has been proposed (Gagnon et al., 2014) the extent to which exclusively nuclear lncRNAs can be knocked down remains unclear. Whilst useful for studies of many trans-acting lncRNAs, RNAi-based knockdown acts post-transcriptionally, and therefore does not block the act of transcription, precluding analyses of lncRNAs which may produce their effects via this mechanism.
Another experimental approach is to genetically manipulate the lncRNA locus. When inserting transcriptional terminator sequences care must be taken to control for changes in spacing between DNA regulatory elements and to take account of regulatory elements that may be inadvertently inserted, such as promoters of resistance genes, since these may be able to drive expression of neighbouring genes or divert activities from nearby enhancers. Insertion of exogenous sequences can induce phenotypes (Steshina et al., 2006). Even single loxP sites can attract germline methylation that might potentially repress flanking regulatory elements (Rassoulzadegan et al., 2002). Extra controls are thus needed to identify possible gain-of-function effects arising from inserted sequences, such as reporters or selection cassettes. The advent of programmable nucleases (Kim and Kim, 2014) provides opportunities to investigate these possibilities. Transcriptional terminator sequences can vary in their efficacy depending on the genomic context into which they are inserted, which can cause termination to be highly inefficient. For example, a sequence that efficiently terminates transcription in multiple contexts in Airn, failed to do so when inserted close to a CpG island (Latos et al., 2012).
Other approaches include deletion of the full-length lncRNA locus or its promoter sequence, mutation of putative functional domains or targeted interruption between the promoter and the RNA sequence through an engineered inversion (Figure 2; Table 1). Whilst useful, such strategies may not always be successful. Promoter inversion, for instance, may not always abrogate transcription, because of the bidirectionality of promoters (Wu and Sharp, 2013), and promoter deletion may also disrupt the expression level of protein-coding transcripts with which lncRNAs share a bidirectional promoter. In all of these cases, it is important to minimise the removal or reorganisation of regulatory factor binding sites or other regulatory elements within the DNA, and to control for the addition of novel binding sites. For example, it should be borne in mind that many lncRNAs initiate within enhancers (Marques et al., 2013) and in these cases disruption of the lncRNA promoter could also cause unintended changes in gene expression. In the case of transcription terminators, to ensure effects are due to changes in RNA rather than DNA, inversions of the terminator sequence or a variety of different terminators can be used. In the experimental design it is also important to consider alternatively spliced transcripts and additional transcriptional start sites to ensure full abrogation of lncRNA expression.
Antisense oligonucleotides might provide an alternative technique for analysis of lncRNA function. They are thought to act by forming a DNA/RNA hybrid with the nascent RNA transcript, and triggering RNase H-dependent degradation of the RNA in the nucleus (Figure 2). This reduces the level of the RNA before the mature transcript is produced, but the nature and extent of off-target effects are not fully understood and may be substantial (Sahu et al., 2007). Also, it is not possible to generate stable transgenic lines, which restricts analysis to cell lines or to systems where the oligonucleotides can be supplied by injection. Other approaches to disrupting lncRNA function use morpholino antisense oligos targeting e.g. splice sites (Ulitsky et al., 2011), or locked nucleic acid antisense oligonucleotides (Sarma et al., 2010).
Recent developments in rational design of DNA binding factors using transcription activator-like effector (TALE) proteins or the clustered regularly interspersed palindromic repeats (CRISPR) system have enabled recruitment of transcriptional activation (Cheng et al., 2013) or repression domains (Cong et al., 2012; Gilbert et al., 2013) to defined sites within the genome to modulate transcription, or to directly interfere with the passage of the RNA polymerase. These techniques could be used to modulate the rate of transcriptional initiation or elongation of the lncRNA (Figure 2), but care must be taken to control for direct effects of these factors on the transcription of neighbouring genes.
Separating RNA- from DNA-sequence dependent effects
Deletion of a lncRNA genomic locus does not cleanly separate a role of the lncRNA per se from a role of other functional elements contained within the underlying DNA. Such elements might be irrelevant to the lncRNA's function, yet critical to the normal function of a neighbouring protein-coding gene. Eighteen mouse knockout lines were recently described in which genomic regions containing intergenic lncRNA loci (21.6 kb mean size, 4.8 kb–49.7 kb range) were deleted and replaced by a lacZ reporter cassette (Sauvageau et al., 2013). For 13 of these lines no overt phenotypes were reported. In contrast, strong phenotypes from 5 knockout lines were observed: Peril−/− or Fendrr−/− mice have reduced viability; Mdgt−/− and linc-Pint−/− mice show growth defects; and linc-Brn1b−/− mice exhibit abnormal cortical anatomy. The authors conclude that these developmental disorders generated by DNA deletions demonstrate the critical roles that lncRNAs play in vivo (Sauvageau et al., 2013).
While this may be the correct interpretation, the strong phenotypes observed in these lines may derive from the engineered deletion of cis-regulatory DNA elements lying within these large DNA deletions that are critical for the normal functions of proximal protein-coding genes. For instance Fendrr is 1.4 kb from Foxf1, and Mdgt starts only 84 bp from the 5′ exon of Hoxd1 and terminates close to Hoxd3 (Figure 3). Consistent with this notion, data from the ENCODE project indicate that the genomic region deleted in Mdgt−/− lines contains binding sites for several transcription factors and chromatin regulatory proteins (Figure 3). Whilst the authors detected no global change of neighbouring protein-coding gene expression as assessed by limited RNAseq of tissues, it is still possible that altered cell type or developmental stage specific expression of these genes escaped detection. LncRNAs are often transcribed in a highly restricted cell population and a global, high-throughput analysis of even the full embryo may not have been informative. Ultimately, the best evidence for RNA-dependent lncRNA function derives from loss-of-function, followed by complementation approaches, as for example described in Grote et al. (2013).
This issue is also relevant for other lncRNAs transcribed from within Hox gene clusters. In the case of Hotair (Rinn et al., 2007), a several kb large deletion of the entire Hotair genomic DNA in vivo induces a subtle morphological phenotype in the spine, which was interpreted as a gain-of-function of Hoxd genes in trans (Li et al., 2013). However, Hotair is embedded in the HoxC gene cluster and topological modifications or re-arrangements in such a dense series of transcription units are likely to modify the expression of neighbouring genes. Further insights have been acquired by removing the entire HoxC locus, including both the lncRNA locus and flanking genes (Suemori and Noguchi, 2000; Schorderet and Duboule, 2011). Even when multiple alleles are available, as for Hotair, lncRNA function remains difficult to evaluate.
Expression specificity and allelic series
Deletion of the mouse Hotair lncRNA also induced a subtle developmental phenotype in the wrist (Li et al., 2013). However, because murine Hotair transcripts were not detected in developing forelimb buds (Schorderet and Duboule, 2011) it remains possible that this phenotype develops from a lack of Hotair RNAs during subsequent stages of wrist development. This possibility could only be assessed by further analysis of the expression pattern of this lncRNA. The systematic introduction of a reporter cassette into lncRNAs (Sauvageau et al., 2013) can help solve this problem, provided the difference between the stability of the reporter staining and the half-life of the RNA is kept in mind, in particular for small and dynamic cell populations (Zakany et al., 2001).
As for protein-coding genes, an exhaustive description of functional traits associated with a particular lncRNA cannot be achieved by using a single mutant allele, hence allelic series are necessary. As indicated above, the nature of the alleles required to assess the function of a given lncRNA depends upon its genomic location and its expression specificity during development and adulthood. This can be quite challenging, as exemplified by the bidirectional Hotdog and Twin of hotdog lncRNAs: even though these RNAs are located hundreds of kb distant from the HoxD gene cluster in the middle of a gene desert, their shared start site physically interacts with Hoxd genes as part of a general regulatory structure. In this case, a cis-effect could in principle be evaluated by separating the lncRNA loci from the HoxD cluster via a large inversion with a breakpoint in-between. It turns out, however, that this inversion globally disrupts the regulation of HoxD by displacing long-range acting enhancers along with the lncRNA loci, making interpretation difficult (Delpretti et al., 2013).
Discrepancies between different strategies
The lncRNA Fendrr has been studied using two independent strategies: genetic deletion (Sauvageau et al., 2013) and transcriptional terminator insertion (Grote et al., 2013). Whilst both studies describe a lethal phenotype, highlighting the potential importance of this lncRNA in development, the outcomes differ. Genetic deletion results in lung maturation and mesenchymal differentiation defects (Sauvageau et al., 2013), whilst terminator insertion leads to heart and body wall defects and to effects on the expression of the neighbouring Foxf1 gene (Grote et al., 2013). Importantly, the defects caused by terminator insertion were rescued by a transgene containing a single wild type copy of the Fendrr lncRNA locus (without its functional Foxf1 neighbour); this strongly implicates deletion of the RNA product, rather than its genomic DNA, as causing the observed phenotypes (Grote et al., 2013). Transgene rescue experiments are thus crucial for establishing RNA-dependent lncRNA function. An earlier successful illustration of this principle was the rescue of developmental defects in zebrafish by co-injection of spliced RNA for each of two lncRNAs, cyrano and megamind, whose precursor RNAs had been knocked down using morpholino antisense oligos (Ulitsky et al., 2011). However, regulatory sequences necessary for the transcription of the lncRNA itself should ideally be included in the rescue construct so as to maintain physiological levels of expression. This, added to the length of lncRNAs that can sometimes reach several hundred kb, may represent a challenge for a transgenic approach.
Substantial differences have also been observed between RNAi-mediated knockdown and transcriptional terminator insertion at the Evf-2 lncRNA locus (Feng et al., 2006; Bond et al., 2009; Berghoff et al., 2013; Kohtz, 2014). This lncRNA is transcribed across an enhancer element between the Dlx5 and Dlx6 genes, and initial studies in cell culture using RNAi suggested a model whereby Evf-2 was important for activation of Dlx5/6 (Feng et al., 2006). However, transcriptional terminator insertion in mice has shown the opposite effect on expression of Dlx5/6 (Bond et al., 2009) and causes specific changes in DNA methylation at the enhancer. Importantly these changes can be rescued by Evf-2 expression from a separate transgene, implying that they are dependent on the lncRNA itself (Berghoff et al., 2013).
Similarly to this example, knockdown of lincRNA-p21 by RNAi originally suggested a trans-acting mechanism, in which the lncRNA was involved in recruiting protein complexes to chromatin (Huarte et al., 2010). Nevertheless, subsequent studies where the promoter of the lncRNA was deleted or its transcription was blocked by antisense oligonucleotides have highlighted a different role, as this lncRNA regulates the adjacent p21 gene in cis, without having trans-acting effects (Dimitrova et al., 2014). Whilst both studies analysed by RNAseq the effect of lncRNA depletion on global gene expression in mouse embryonic fibroblasts, the two sets of differentially expressed genes did not overlap significantly. When analysing lncRNA function, it is thus important to consider multiple loss-of-function strategies that address multiple mechanisms of action.
The potential confounding effects of techniques used to separate DNA- from RNA-dependent function are further exemplified by studies of the Drosophila bxd lncRNA, which is expressed from within the HOX cluster, adjacent to the Ultrabithorax (Ubx) gene. Its expression is highly specific and occurs in the same broad region of the embryo as the Ubx gene, although notably never within the same cell (Petruk et al., 2006). Studies of bxd loss-of-function using different techniques have yielded conflicting interpretations. It has long been known that small deletions within this lncRNA cause dramatic effects on expression of the neighbouring Ubx gene (Lewis, 1978), resulting in homoeotic transformations. Indeed, certain allelic combinations are able to generate a four-winged fly. More recent studies of the same deletions suggest that the act of transcription of this lncRNA represses Ubx in cis by altering protein binding to the Ubx promoter (Petruk et al., 2006). In contrast, it was reported that inversion of the bxd promoter, driving transcription in the wrong direction whilst maintaining genomic composition, results in very minor effects on Ubx expression, and then only later in development (Pease et al., 2013). Also, a deletion removing the promoter induced a Cdx-like gain of function of Ubx (Sipos et al., 2007). Clearly, correct interpretation of such loss-of-function experiments, at such complex loci, requires careful consideration of potentially confounding factors.
Contrasting results of different experiments may also arise because of a lncRNA's involvement in different mechanisms in different cellular contexts. For example, in embryonic cells, transcription of Airn silences the adjacent Igf2r gene (Latos et al., 2012), whereas in extraembryonic tissues it acts more distally by recruiting the histone methyltransferase G9a to imprinted genes (Nagano et al., 2008).
The end of the beginning: a maturing lncRNA field
The study of lncRNAs is still in its infancy, and the biochemical and genetic techniques used to address the true significance and mechanisms of action of this class of RNA have only recently been developed or adapted from those used for investigating protein-coding genes. Such techniques must therefore be used with caution and with appropriate controls (Brockdorff, 2013; Riley and Steitz, 2013). From the examples described above, it is apparent that the optimal strategy with which to study a lncRNA's loss of function depends both on the mechanism by which it acts, in particular in a cis or trans configuration, and the regulatory sequences present within its locus. We suggest that early lessons learnt from paradigm repressor lncRNAs, such as Xist, and imprinted lncRNAs such as Airn or Kcnq1ot1, should guide the design of experiments on more recently identified lncRNAs. We have attempted to distil these lessons into the proposed considerations in Box 1. Introduction of the multiple alleles that will be necessary to adequately dissect lncRNA in vivo function will be greatly aided by recent advances in genome engineering using designer site-specific nucleases such as CRISPR/Cas9 and TALENs. The introduction of fast acute loss-of-function systems for lncRNAs, for example those that insert a sequence-specific ribonuclease site whose nuclease is under drug inducible control, would also greatly facilitate lncRNA investigation.
The trans function of a lncRNA may be investigated using locus deletion, promoter deletion, inversions, transcriptional termination or RNAi. Where possible, these strategies should be combined with genetic rescue experiments, where the lncRNA is expressed from an independent transgene inserted at a location distinct from the lncRNA locus. This strategy separates RNA-dependent effects from those arising from the manipulation of the underlying DNA. Rescue experiments using expression of the lncRNA from an independent transgene are only possible for trans-acting lncRNAs where the RNA moiety itself and not the act of transcription is critical for function.
The cis function of a lncRNA may be investigated using a combination of several alleles, such as insertion of transcriptional terminators, promoter deletions and inversions. Several alleles are likely to be required to separate lncRNA-dependent from other effects and, as controls, to reveal artefacts of genetic engineering. Engineered inversions can also be used to separate the lncRNA locus from its potential neighbouring target gene to investigate its roles in cis. Use of site-specific recombinases, such as the phiC31/attP system (Bateman et al., 2006; Zhu et al., 2014) as ‘landing sites’ or for recombination mediated cassette exchange, will greatly enhance our ability to generate such allelic series. For example, the lncRNA locus may be deleted and replaced by a recombinase ‘landing site’ into which different constructs can be introduced to investigate phenotype rescue.
In summary, if lncRNA biologists are to resolve the true in vivo functions of these numerous and enigmatic transcripts, then the strengths and weaknesses of available techniques will need to be acknowledged. Resolution will no doubt derive from the careful and comprehensive genetic dissection of individual loci using multiple alleles. The field of lncRNA biology would benefit greatly from the development of additional approaches that are effective in distinguishing effects mediated by lncRNAs as molecular species from their effect on gene regulatory elements with which lncRNA loci are interleaved across the mammalian genome.
Data availability
-
ENCODE - Encyclopedia of DNA elementsFreely available at ENCODE Data Coordination Center (DCC).
References
-
Genomic imprinting at the mammalian Dlk1-Dio3 domainTrends in Genetics 24:306–316.https://doi.org/10.1016/j.tig.2008.03.011
-
Target mimicry provides a new mechanism for regulation of microRNA activityNature Genetics 39:1033–1037.https://doi.org/10.1038/ng2079
-
Detecting and characterizing circular RNAsNature Biotechnology 32:453–461.https://doi.org/10.1038/nbt.2890
-
The H19 lincRNA is a developmental reservoir of miR-675 that suppresses growth and Igf1rNature Cell Biology 14:659–665.https://doi.org/10.1038/ncb2521
-
A guide to genome engineering with programmable nucleasesNature Reviews Genetics 15:321–334.https://doi.org/10.1038/nrg3686
-
Long non-coding RNAs learn the importance of being in vivoFrontiers in Genetics 5:45.https://doi.org/10.3389/fgene.2014.00045
-
Biogenesis and function of nuclear bodiesTrends in Genetics 27:295–306.https://doi.org/10.1016/j.tig.2011.05.006
-
Xist-deficient mice are defective in dosage compensation but not spermatogenesisGenes & Development 11:156–166.https://doi.org/10.1101/gad.11.2.156
-
Paraspeckles are subpopulation-specific nuclear bodies that are not essential in miceJournal of Cell Biology 193:31–39.https://doi.org/10.1083/jcb.201011110
-
Non-coding RNAs in imprinted gene clustersBiology of the Cell 100:149–166.https://doi.org/10.1042/BC20070126
-
Advances in understanding chromosome silencing by the long non-coding RNA XistPhilosophical Transactions of the Royal Society of London Series B, Biological sciences 368:20110325.https://doi.org/10.1098/rstb.2011.0325
-
Antisense technology: a selective tool for gene expression regulation and gene targetingCurrent Pharmaceutical Biotechnology 8:291–304.https://doi.org/10.2174/138920107782109985
-
Locked nucleic acids (LNAs) reveal sequence requirements and kinetics of Xist RNA localization to the X chromosomeProceedings of the National Academy of Sciences of USA 107:22196–22201.https://doi.org/10.1073/pnas.1009785107
-
The genomic binding sites of a noncoding RNAProceedings of the National Academy of Sciences of USA 108:20497–20502.https://doi.org/10.1073/pnas.1113536108
-
In situ dissection of a Polycomb response element in Drosophila melanogasterProceedings of the National Academy of Sciences of USA 104:12416–12421.https://doi.org/10.1073/pnas.0703144104
-
Hox C cluster genes are dispensable for overall body plan of mouse embryonic developmentDevelopmental biology 220:333–342.https://doi.org/10.1006/dbio.2000.9651
-
Transcriptional regulatory functions of nuclear long noncoding RNAsTrends in Genetics 30:348–355.https://doi.org/10.1016/j.tig.2014.06.001
-
RNA polymerase V transcription guides ARGONAUTE4 to chromatinNature Genetics 41:630–634.https://doi.org/10.1038/ng.365
-
BC1 regulation of metabotropic glutamate receptor-mediated neuronal excitabilityJournal of Neuroscience 29:9977–9986.https://doi.org/10.1523/JNEUROSCI.3893-08.2009
Decision letter
-
Detlef WeigelReviewing Editor; Max Planck Institute for Developmental Biology, Germany
eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.
Thank you for sending your work entitled “Considerations when investigating lncRNA function in vivo” for consideration at eLife. Your article has been favorably peer reviewed by Detlef Weigel and two outside reviewers.
There has been considerable recent excitement about the role of long noncoding RNAs (lncRNAs) in gene regulation. A major difficulty in studying lncRNA function genetically is that they can overlap with other elements of the genome that have proven (in the case of antisense RNA) or potential separate functions (such as insulators, enhancers, etc). Therefore, simple knockout experiments, using either insertion of large foreign sequences or sequence deletions may produce misleading results.
The article provides several examples from the literature that clearly illustrate how studies of the same lncRNA locus can come to different conclusions. The article is timely given the apparent lack of defined “gold standards” for this kind of work. A variety of approaches for the functional analysis of lncRNA loci is discussed, highlighting the pitfalls that can be encountered when trying to distinguish the contributions of genomic DNA from the act of transcription or the RNA product itself to the observed phenotypes. The difficulties associated with distinguishing cis versus trans effects of lncRNAs are discussed and rigorous and standardized approaches to evaluating lncRNA function in vivo are proposed. The article will thus serve as an important guide to those performing genetic analyses of lncRNA loci.
The discussion is overall very balanced and the reviewers had only a few suggestions:
1) Include either at the beginning or at the end a series of bullet points, with an introductory sentence that lncRNA functional evidence should include at least x out of y of these types of evidence.
2) Mention the need for appropriate controls, when genetic manipulations, e.g., in ES or similar cells, are followed by selection of clones derived from single cells. ES and other cultured cells (e.g., derived from tumors) are genetically not perfectly stable, and one needs to ensure that phenotypes are not due to second-site mutations. Multiple clones or transgenic rescue experiments should be the norm (obviously this applies not only to lncRNA analyses).
3) It would be best to distinguish true “loss of function”, in which transcription is completely abolished, from “reduction of function”, where a truncated transcript is produced or RNAi is used. To this end, Figure 2 should be amended.
4) A brief paragraph that mentions flanking cell biological (e.g., FISH) and biochemical (e.g., RNP analysis) approaches for evaluating lncRNA function would be worthwhile.
https://doi.org/10.7554/eLife.03058.006Author response
1) Include either at the beginning or at the end a series of bullet points, with an introductory sentence that lncRNA functional evidence should include at least x out of y of these types of evidence.
The intention of Box 1 “Considerations when interpreting phenotypes resulting from lncRNA mutation” was to provide such suggestions. However, we believe that each lncRNA has to be considered on its own merits, depending on its mechanism of action and it is therefore not possible to define a set of general guidelines that would be applicable to all cases. We have therefore deliberately given a set of “considerations” rather than “guidelines”.
2) Mention the need for appropriate controls, when genetic manipulations, e.g., in ES or similar cells, are followed by selection of clones derived from single cells. ES and other cultured cells (e.g., derived from tumors) are genetically not perfectly stable, and one needs to ensure that phenotypes are not due to second-site mutations. Multiple clones or transgenic rescue experiments should be the norm (obviously this applies not only to lncRNA analyses).
We agree with the reviewers that this is an important point, since genetic background will have a large effect, especially when the phenotypes are subtle, and that this can vary substantially between different clones or lines. We have added this concept to the text.
3) It would be best to distinguish true “loss of function”, in which transcription is completely abolished, from “reduction of function”, where a truncated transcript is produced or RNAi is used. To this end, Figure 2 should be amended.
We have amended Figure 2 and the text to make this clear.
4) A brief paragraph that mentions flanking cell biological (e.g., FISH) and biochemical (e.g., RNP analysis) approaches for evaluating lncRNA function would be worthwhile.
We have added a paragraph describing some of the complimentary techniques that can be used to evaluate lncRNA mechanism, and how these may be used to guide mutant design.
https://doi.org/10.7554/eLife.03058.007Article and author information
Author details
Funding
European Research Council (DARCGENS 249869)
- Andrew R Bassett
- Chris P Ponting
Medical Research Council
- Anne C Ferguson-Smith
- Wilfried Haerty
- Douglas R Higgs
- Chris P Ponting
Max-Planck-Gesellschaft
- Asifa Akhtar
European Research Council (SystemsHOX.ch)
- Denis Duboule
Swiss National Science Foundation
- Denis Duboule
Wellcome Trust
- Adrian P Bird
- Anne C Ferguson-Smith
Austrian Academy of Sciences
- Denise P Barlow
Austrian Science Fund (FWF F4302-B09)
- Denise P Barlow
European Molecular Biology Laboratory
- Anne Ephrussi
Cancer Research UK
- Eric A Miska
National Human Genome Research Institute (U54 HG007004-2)
- Thomas R Gingeras
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
AB & CPP: the European Research Council (DARCGENs, project number 249869) and the Medical Research Council. AA: Max Planck Institute. DD: the European Research Council (SystemsHox.ch) and the Swiss National Research Foundation. APB: the Wellcome Trust. DPB: Austrian Academy of Sciences and the Austrian Science Fund FWF F4302-B09. AE: the European Molecular Biology Laboratory. AFS: The Wellcome Trust and MRC. DRH: the Medical Research Council. EAM: Cancer Research UK. TRG: NHGRI U54 HG007004-2.
Publication history
- Received:
- Accepted:
- Version of Record published:
Copyright
© 2014, Bassett et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 11,711
- views
-
- 1,947
- downloads
-
- 297
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Developmental Biology
- Genetics and Genomics
The establishment and growth of the arterial endothelium requires the coordinated expression of numerous genes. However, regulation of this process is not yet fully understood. Here, we combined in silico analysis with transgenic mice and zebrafish models to characterize arterial-specific enhancers associated with eight key arterial identity genes (Acvrl1/Alk1, Cxcr4, Cxcl12, Efnb2, Gja4/Cx37, Gja5/Cx40, Nrp1 and Unc5b). Next, to elucidate the regulatory pathways upstream of arterial gene transcription, we investigated the transcription factors binding each arterial enhancer compared to a similar assessment of non-arterial endothelial enhancers. These results found that binding of SOXF and ETS factors was a common occurrence at both arterial and pan-endothelial enhancers, suggesting neither are sufficient to direct arterial specificity. Conversely, FOX motifs independent of ETS motifs were over-represented at arterial enhancers. Further, MEF2 and RBPJ binding was enriched but not ubiquitous at arterial enhancers, potentially linked to specific patterns of behaviour within the arterial endothelium. Lastly, there was no shared or arterial-specific signature for WNT-associated TCF/LEF, TGFβ/BMP-associated SMAD1/5 and SMAD2/3, shear stress-associated KLF4 or venous-enriched NR2F2. This cohort of well characterized and in vivo-verified enhancers can now provide a platform for future studies into the interaction of different transcriptional and signalling pathways with arterial gene expression.
-
- Developmental Biology
- Genetics and Genomics
Paternal obesity has been implicated in adult-onset metabolic disease in offspring. However, the molecular mechanisms driving these paternal effects and the developmental processes involved remain poorly understood. One underexplored possibility is the role of paternally induced effects on placenta development and function. To address this, we investigated paternal high-fat diet-induced obesity in relation to sperm histone H3 lysine 4 tri-methylation signatures, the placenta transcriptome, and cellular composition. C57BL6/J male mice were fed either a control or high-fat diet for 10 weeks beginning at 6 weeks of age. Males were timed-mated with control-fed C57BL6/J females to generate pregnancies, followed by collection of sperm, and placentas at embryonic day (E)14.5. Chromatin immunoprecipitation targeting histone H3 lysine 4 tri-methylation (H3K4me3) followed by sequencing (ChIP-seq) was performed on sperm to define obesity-associated changes in enrichment. Paternal obesity corresponded with altered sperm H3K4me3 at promoters of genes involved in metabolism and development. Notably, altered sperm H3K4me3 was also localized at placental enhancers. Bulk RNA-sequencing on placentas revealed paternal obesity-associated sex-specific changes in expression of genes involved in hypoxic processes such as angiogenesis, nutrient transport, and imprinted genes, with a subset of de-regulated genes showing changes in H3K4me3 in sperm at corresponding promoters. Paternal obesity was also linked to impaired placenta development; specifically, a deconvolution analysis revealed altered trophoblast cell lineage specification. These findings implicate paternal obesity effects on placenta development and function as one potential developmental route to offspring metabolic disease.