Dev Cell: (2010 Oct) 1;143(1):46-58.
Cell, Volume 143, Issue 1, 46-58, 1 October 2010
DOI 10.1016/j.cell.2010.09.001
http://www.cell.com/retrieve/pii/S0092867410010111


"Long noncoding RNAs with enhancer-like function in human cells."

Ulf Andersson Ørom 1, Thomas Derrien 2, Malte Beringer 1, Kiranmai Gumireddy 1, Alessandro Gardini 1, Giovanni Bussotti 2, Fan Lai 1, Matthias Zytnicki 2, Cedric Notredame 2, Qihong Huang 1, Roderic Guigo 2, and Ramin Shiekhattar 1, 2, 3, @.

1 The Wistar Institute, 3601 Spruce Street, Philadelphia, PA 19104, USA.
2 Centre for Genomic Regulation (CRG), UPF, Barcelona, Spain
3 Institutio Catalana de Recerca i Estudis Avancats (ICREA), Barcelona, Spain

@ Correspondence: shiekhattar@wistar.org

Received: April 23, 2010,  Revised: July 1, 2010,  Accepted: August 13, 2010,
Published: September 30, 2010



NetworkEditors' Perspectives: "Long noncoding RNAs as Activators of Specific Gene Transcription".
Abstract:
Highlights:
Summary:
Introduction:
Figure 1. Identification of Novel Long ncRNAs in Human Annotated by GENCODE
Results:
   Figure 2. Long ncRNAs Display Responsiveness to Differentiation Signals in Human Keratinocytes
   Figure 3. Stimulation of Gene Expression by Activating RNAs
   Figure 4. Knockdown of ncRNA-a7 Specifically Targets Snai1 Expression
   Figure 5. Microarray Analysis of Snai1 and ncRNA-a7 Knockdown
   Figure 6. ncRNA-Activators Potentiate Transcription of a Reporter Gene
   Figure 7. RNA-Dependent Activation of a Reporter Gene by ncNRA-a7
Discussion:
Experimental Procedures;
Supplemental Information:
Acknowledgments:
References:
Additional References:
Conclusions from:   Euchromatin,   Embryomas,   Entropy,  EMT,   Enhancers..
Further Topics:




Abstract:

While the long noncoding RNAs (ncRNAs) constitute a large portion of the mammalian transcriptome, their biological functions has remained elusive. A few long ncRNAs that have been studied in any detail silence gene expression in processes such as X-inactivation and imprinting. We used a GENCODE annotation of the human genome to characterize over a thousand long ncRNAs that are expressed in multiple cell lines. Unexpectedly, we found an enhancer-like function for a set of these long ncRNAs in human cell lines. Depletion of a number of ncRNAs led to decreased expression of their neighboring protein-coding genes, including the master regulator of hematopoiesis, SCL (also called TAL1), Snai1 and Snai2. Using heterologous transcription assays we demonstrated a requirement for the ncRNAs in activation of gene expression. These results reveal an unanticipated role for a class of long ncRNAs in activation of critical regulators of development and differentiation.

    * Highlights
    * Long noncoding RNAs activate neighboring protein-coding genes
    * Activating long ncRNAs behave similarly to classically defined enhancer elements
    * Depletion of Snail1 or its adjacent ncRNA-7a show similar cellular migration defects

Summary:

While the long noncoding RNAs (ncRNAs) constitute a large portion of the mammalian transcriptome, their biological functions has remained elusive. A few long ncRNAs that have been studied in any detail silence gene expression in processes such as X-inactivation and imprinting. We used a GENCODE annotation of the human genome to characterize over a thousand long ncRNAs that are expressed in multiple cell lines. Unexpectedly, we found an enhancer-like function for a set of these long ncRNAs in human cell lines. Depletion of a number of ncRNAs led to decreased expression of their neighboring protein-coding genes, including the master regulator of hematopoiesis, SCL (also called TAL1), Snai1 and Snai2. Using heterologous transcription assays we demonstrated a requirement for the ncRNAs in activation of gene expression. These results reveal an unanticipated role for a class of long ncRNAs in activation of critical regulators of development and differentiation.

Supplemental Content:


INTRODUCTION:

Recent technological advances have allowed the analysis of the human and mouse transcriptomes with an unprecedented resolution. These experiments indicate that a major portion of the genome is being transcribed and that protein-coding sequences only account for a minority of cellular transcriptional output (Bertone et al., 2004; Birney et al., 2007; Cheng et al., 2005; Kapranov et al., 2007). Discovery of RNA interference (RNAi) (Fire et al., 1998) in C. elegans and the identification of a new class of small RNAs known as microRNAs (Lee et al., 1993; Wightman et al., 1993) led to a greater appreciation of RNA’s role in regulation of gene expression. MicroRNAs are endogenously expressed noncoding transcripts that silence gene expression by targeting specific mRNAs on the basis of sequence recognition (Carthew and Sontheimer, 2009). Over 1000 microRNA loci are estimated to be functional in humans, modulating roughly 30% of protein-coding genes (Berezikov and Plasterk, 2005).

While microRNAs represent a minority of the noncoding transcriptome, the tangle of long and short noncoding transcripts is much more intricate, and is likely to contain as yet unidentified classes of molecules forming transcriptional regulatory networks (Efroni et al., 2008; Kapranov et al., 2007). Long ncRNAs are transcripts longer than 100 nts which in most cases mirror the features of protein-coding genes without containing a functional open reading frame (ORF). Long ncRNAs have been implicated as principal players in imprinting and X-inactivation. The imprinting phenomenon dictates the repression of a particular
allele, depending on its paternal or maternal origin. Many clusters of imprinted genes contain ncRNAs, and some of them have been implicated in the transcriptional silencing (Yang and Kuroda, 2007). Similarly, the X chromosome inactivation relies on the expression of a long ncRNA named Xist, which is thought to recruit, in a cis-specific manner, protein complexes establishing repressive epigenetic marks that encompass the chromosome (Heard and Disteche, 2006). There is also a report indicating that a long ncRNA expressed from the HOXC locus may affect the expression of genes in theHOXDlocus which is located on a different chromosome (Rinn et al., 2007). More recently, a set of long ncRNAs has been identified in mouse, through the analysis of the chromatin signatures (Guttman et al., 2009). There has also been reports of divergent transcription of short RNAs flanking transcriptional start sites of the active promoters (Core et al., 2008; Preker et al., 2008; Seila et al., 2008). In search of a function for long ncRNAs, we used the GENCODE annotation (Harrow et al., 2006) of the human genome. To simplify our search we subtracted transcripts overlapping the protein-coding genes. Moreover, we filtered out the transcripts that may correspond to promoters of protein-coding genes and the transcripts that belong to known classes of ncRNAs. We identified 3019 putative long ncRNAs that display differential patterns of expression. Functional knockdown of multiple ncRNAs revealed their positive influence on the neighboring
protein-coding genes. Furthermore, detailed functional analysis of a long ncRNA adjacent to the Snai1 locus using reporter assays demonstrated a role for this ncRNA in an RNAdependent potentiation of gene expression. Our studies suggest---



Figure 1. Identification of Novel Long ncRNAs in Human Annotated by GENCODE

Figure 1. Identification of Novel Long ncRNAs in Human Annotated by GENCODE

(A) Analysis of coding potential using Gene ID for ancestral repeats (AR), long ncRNAs annotated
by GENCODE and protein-coding genes.

(B) Conservation of the genomic transcript sequences for AR, long ncRNAs, protein-coding genes, and

(C) of their promoters.

(D) Expression analysis of 3,019 long ncRNA in human fibroblasts, HeLa cells and primary human
keratinocytes, showing numbers for transcripts detected in each cell line and the overlaps between cell lines. All microarray experiments have been done in four replicates.

See also Figure S1 and Table S1 and Table S2.


---a role for a class of long ncRNAs in positive regulation of proteincoding genes.

RESULTS:

Noncoding RNAs Are Expressed and Respond to Cellular Differentiating Signals

To assign a function to uncharacterized human long ncRNAs, we identified unique long noncoding transcripts using the annotation of the human genome provided by the GENCODE (Harrow et al., 2006) and performed by human and vertebrate analysis and annotation (HAVANA) group at Sanger Institute. Such
genomic annotation is being produced in the framework of the ENCODE project (Birney et al., 2007). At the time of our analysis, the GENCODE annotation encompassed about one third of the human genome. Such an annotation relies on the human expert curation of all available experimental data on transcriptional
evidence, such as cloned cDNA sequences, spliced RNAs and ESTs mapped on to the human genome.

We focused on ncRNAs that do not overlap the protein-coding genes in order to simplify the interpretation of our functional analysis of ncRNAs. This included the subtraction of all transcripts mapping to exons, introns and the antisense transcripts overlapping the protein-coding genes. We also excluded transcripts
within 1 kb of the first and the last exons as to avoid promoter and 30-associated transcripts (Fejes-Toth et al., 2009; Kapranov et al., 2007), that display a complicated pattern of short transcripts (Core et al., 2008; Preker et al., 2008; Seila et al., 2008). Furthermore, we excluded all known noncoding transcripts from our list of putative long ncRNAs. This analysis resulted in 3019 ncRNAs, which are annotated by HAVANA to have no coding potential, expressed from 2286 unique loci (some loci display multiple alternative spliced transcripts) of the human genome (Experimental Procedures, Table S1 available online). The average size of the noncoding transcripts is about 800 nts with a range from 100 nts to 9100 nts. Interestingly, the long
ncRNAs display a simpler transcription unit than that of protein-coding genes (Figure S1A). Nearly 50% of our long ncRNAs contain a single intron in their primary transcript (Figure S1A). Moreover, analysis of their chromatin signatures indicated similarities with protein-coding genes. Transcriptionally active ncRNAs display histone H3K4 trimethylation at their 50-end (Figure S1B) and histone H3K36 trimethylation in the
body of the gene (Figure S1C).

Analysis of protein coding potential of the ncRNAs using GeneID (Blanco et al., 2007; Parra et al., 2000) shows ncRNAs coding potential comparable to that of ancestral repeats (Lunter et al., 2006), supporting the HAVANA annotation of these transcripts as noncoding (Figure 1A). Moreover, comparison of ncRNAs with protein-coding genes and control sequences corresponding to ancestral repeats (Lunter et al., 2006) reveals that ncRNA sequence conservation is lower than that of protein coding genes, but higher thanthat ofancestral repeats (Figure1B). Asimilar case is seen with the promoter regions (Figure 1C). These results are in concordance with previous observations in the mouse genome (Guttman et al., 2009; Ponting et al., 2009).

Next we used custom-made microarrays (Experimental Procedures) which were designed to include an average of six probes (nonrepetitive sequences) against each ncRNA transcript to detect their expression. We analyzed the expression pattern of ncRNAs using three different human cell lines (Figure 1D). Overall, we detected 1167 ncRNAs expressed in at least one of the three cell types and 576 transcripts common among the three cell types (Figure 1D). We validated the expression of 16 ncRNAs that mapped to the 1% of the human genome investigated by the original ENCODE study (Birney et al., 2007) using quantitative polymerase chain reaction (qPCR) in three different cell lines (Table S2). Furthermore, we could find evidence for expression of 80% of our noncoding transcripts in at least one human tissue in a recent high throughput sequencing of the human transcriptome (Wang et al., 2008).

To assess whether ncRNAs respond to cellular differentiating signals, we induced the differentiation of human primary keratinocytes using 12-O-tetradecanoylphobol 13-acetate (TPA). We monitored the expression of ncRNAs using custom microarrays. Expression of protein-coding genes was monitored using
conventional Agilent arrays containing nearly all human mRNAs. We prepared RNA from human primary keratinocytes before and following treatment with TPA. As shown in Figure 2A and Table S3, we could detect 687 ncRNAs in keratinocytes, where 104 (or 15.1%) respond to TPA treatment by over 1.5-fold. Similarly, 21.3% of protein coding-genes display a change in expression of over 1.5-fold (Figure 2B). While around half of the TPA-regulated protein-coding genes increase and a similar proportion decrease their expression following differentiation, 70% of the TPA-regulated ncRNAs increase their expression whereas only 30% show a decrease (Figures 2A and 2B). Furthermore, analysis of the protein-coding genes in the 500 kb window surrounding the TPA-regulated ncRNAs indicates a significant enrichment in genes involved in differentiation and morphogenesis (Figure 2C). An example of such change in expression of an important gene involved in extra-cellular matrix is shown in Figure 2D. Extracellular Matrix Protein 1 (ECM1) gene and an ncRNAadjacent to it displayed a 5 and 1.7 fold induction following TPA treatment, respectively. (Figure 2D, upper panel). qPCRanalysis shows the TPA-mediated induction of ECM1 and the ncRNA
as 14 and 4 fold, respectively (Figure 2D, bottom panel). Taken together, we found that many of the GENCODE annotated transcripts are expressed in multiple cell lines and that they display gene expression responsiveness to differentiation signals.

Noncoding RNAs Display a Transcriptional Activator Function

To assess the function of our set of long ncRNAs, we reasoned that similar to long ncRNAs function at the imprinting loci, our collection of ncRNAs may act to regulate their neighboring genes. To test this hypothesis, we used RNA interference to deplete a set of ncRNAs. We initially chose ncRNAs that showed
a differential expression following keratinocyte differentiation. However, to obtain a reproducible knockdown we had to use cell lines that are permissive to transfection by siRNAs. We used five different cell lines for our analyses in which the candidate ncRNAs display a detectable expression (Figure 3).

We validated the expression of our experimental set of ncRNAs and the absence of protein-coding potential using rapid amplification of 50 and 30 complementary DNA ends (50 and 30 RACE), PCR and in vitro translation (Figure S3). These experiments confirmed the expression of ncRNAs and showed that they do not yield a product in an in vitro translation assay (Figures S3A and S3B), supporting the noncoding annotation of our set of ncRNAs. In two cases, the ncRNAs adjacent to Snai2 and TAL1 loci, we found evidence of a longer ncRNA transcript than that annotated by HAVANA (Figure S3).

We began by examining small interfering RNAs (siRNAs) against the ncRNA next to ECM1 in order to assess its functional role following its depletion (for reasons that will follow, this class of RNA is designated as noncoding RNA-activating1 through 7, ncRNA-a1-7). HEK293 cells were used for these experiments because of the ease of functional knockdown and the detectable amounts of ncRNA-a1 and ECM1 in this cell line. We compared the results obtained using two siRNAs against ncRNA-a1 to data obtained following the transfection of two control siRNAs (for the visual simplicity only one siRNA is shown (Figure 3A), the values for both siRNAs can be seen in Table S4). The two siRNAs produced comparable results. We interrogated a 300 kb window around the ncRNA-a1 containing six protein-coding genes using
qPCR.

Surprisingly, unlike the silencing action of long ncRNAs in imprinting and X-inactivation, depletion of ncRNA-a1 adjacent to ECM1 resulted in a concomitant decrease in expression of the neighboring ECM1 gene (Figure 3A). This effect was specific, as we did not detect any change in the other protein-coding
genes surrounding ncRNA-a1 (Figure 3A). To ascertain that ncRNA-a1 is not a component of the ECM1 30 untranslated region, we used primer pairs spanning the ECM1 and ncRNA-a1 genes. We were not able to detect a transcript comprised of the two genes in HEK293 cells, supporting the contention that the two transcripts are independent transcriptional units (Figure S2A). Furthermore, published ChIP experiments (Euskirchen et al., 2007) show the presence of RNA polymerase II and trimethyl H3K4 peaks at the transcription start site of ncRNA-a1 in several cell lines, further attesting to an independent transcriptional
start site for ncRNA-a1. Moreover, knocking down the ECM1 gene did not affect the expression level of ncRNA-a1 or any of the other protein-coding genes analyzed in the locus, further supporting the independence of ECM1 transcript from that of ncRNA-a1 (Figure S2B).

Next we analyzed ncRNA-a2 flanking the histone demethylase JARID1B/KDAM 5B which also shows increased expression following keratinocyte differentiation. These experiments were performed in HeLa cells as they showed detectable expression of ncRNA-a2. Interestingly, while depletion of ncRNA-a2 did
not change JARID1B/KDAM 5B levels, the KLHL12, a gene known for its negative regulation of the Wnt-beta catenin pathway, on the opposite strand displayed a significant reduction (Figure 3B). Although the decrease in KLHL12 was small (about 20%), no other protein-coding gene in the locus displayed
a difference in expression (Figure 3B).

To extend our findings and to determine whether regulation of neighboring protein-coding genes is a common function of ncRNAs, we interrogated the ncRNA-a3 flanking the stem cell leukemia gene (SCL, also called TAL1). TAL1 is a basic helixloop-helix protein which serves as the master regulator of hematopoiesis (Lecuyer and Hoang, 2004). This locus contains two ncRNAs on different strands of DNA. We used MCF-7 cells to assess the depletion of ncRNA-a3, since the expression of ncRNA-a3 and TAL1 could be readily detected in these cells. However, neither PDZK1IP1 nor ncRNA-a4 could be detected
by qPCR in MCF-7 cells. Depletion of ncRNA-a3 resulted in a specific and potent reduction of TAL1 expression (Figure 3C). While depletion of ncRNA-a3 did not affect either STIL or CMPK1 genes, a significant reduction in CYP4A11 gene on the opposite strand of the DNA was detected (Figure 3C).

We next turned our attention to ncRNA-a4 which was not expressed
at a detectable level in MCF7 cells. We could reliably---




Figure 2. Long ncRNAs Display Responsiveness to Differentiation Signals in Human Primary Keratinocytes

Figure 2. Long ncRNAs Display Responsiveness to Differentiation Signals in Human Primary Keratinocytes

(A and B) Distribution of differentially expressed transcripts (dark colors) following TPA treatment for long ncRNAs (A), and mRNAs (B). Lighter colors show total number of transcripts, darker colors and percentage show number of differentially expressed transcripts. Bar-plots show number and fractions of transcripts induced (red) or repressed (green) at different fold-change cut-offs.

(C) Gene onthology analysis of genes flanking the differentially expressed long ncRNAs (red) compared to genes flanking random positions (black).

(D) Graphic representation of a locus with induction of the long ncRNA ncRNA-a1 and the adjacent ECM1 gene, with expression values from microarrays (upper panel) and qPCR quantification of transcripts (lower panel). Microarray experiments and qPCR validation are done in four replicates. Data shown are mean ± SD.

See also Figure S2 and Table S3.


---detect ncRNA-a4 in Jurkat cells. While we could not efficiently knockdown ncRNA-a3 in Jurkat cells, siRNAs specific to ncRNA-a4 reproducibly reduced its levels by about 50% (Figure 3D). Importantly, reduced levels of ncRNA-a4 resulted in a consistent and significant decrease in the level of the gene
CMPK1 which is over 150 kb downstream of ncRNA-a4




Figure 3. Stimulation of Gene Expression by Activating RNAs

Figure 3. Stimulation of Gene Expression by Activating RNAs

The thick black line representing each gene shows the span of the genomic region including exons and introns. The targeted activating RNAs are shown in red.

Bar-plots show RNA levels as determined by qPCR.

(A) ncRNA-a1 locus in HEK293 cells.
(B) ncRNA-a2 locus in HeLa cells.
(C) ncRNA-a3 locus in MCF-7 cells.
(D) ncRNA-a4 locus in Jurkat cells.
(E) ncRNA-a5 locus in HeLa cells.
(F) ncRNA-a6 locus in A549 cells.

All values are relative to GAPDH expression and relative to control siRNA transfected cells set to an average value of 1. The scale bar represents 100 kb and applies to all figure panels. Error bars show mean ± SEM of at least three independent experiments. *p < 0.05, **p < 0.01, ***p < 0.001
by two-tailed Student’s t test.

See also Figure S3 and Table S4. The results represent at least six independent experiments.


---(Figure 3D). We do not detect any changes in the other proteincoding genes surrounding ncRNA-a4. Next we depleted ncRNA-a5 which is adjacent to the E2F6 gene, an important component of a polycomb-like complex (Ogawa et al., 2002). Knockdown of ncRNA-a5 did not affect the E2F6 gene. However, depletion of ncRNA-a5 resulted in a specific reduction in ROCK2 expression levels in HeLa cells, which is located
upstream of ncRNA-a5 (Figure 3E).

Finally, we examined the Snai1 and Snai2 loci in A549 cells (Figure 3F and Figure 4). The Snail family of transcription factors are implicated in the differentiation of epithelia cells into mesenchymal cells (epithelial-mesenchymal transition) during embryonic development (Barrallo-Gimeno and Nieto, 2005; Savagner, 2001). Snai2 shows a significant reduction in expression when the adjacent ncRNA-a6 is depleted, an effect that is not seen on EFCAB1, the only other protein-coding gene within 300 kb of the ncRNA-a6 (Figure 3F). In total, we have examined 12 loci where we were able to efficiently knockdown the ncRNAs using siRNAs (Table S5). We were able to show that in 7 cases, the ncRNA acts to potentiate the expression of a protein-coding gene within 300 kb of the ncRNA. It is possible that the remaining ncRNAs which did not display a positive effect on the neighboring genes within the 300 kb window, exert their action over longer distances which was not assessed in our analysis. Taken together, our results indicate that a subset of ncRNAs has activating functions and therefore we have named them ncRNAactivator (ncRNA-a) followed by a number to distinguish each activating long ncRNA.

ncRNA-a7 Is a Regulator of Snai1

As mentioned above, Snai1 is a member of the Snail zinc-finger family, which comprises transcription factors with diverse functions in development and disease (Barrallo-Gimeno and Nieto, 2005; Nieto, 2002). The Snail gene family is conserved among species from Drosophila to human and has been shown to function
as mesodermal determinant genes (Barrallo-Gimeno and Nieto, 2005; Nieto, 2002). Snail genes are the regulators of cell adhesion, migration and epithelial-mesenchymal transition (EMT) (Barrallo-Gimeno and Nieto, 2005; Nieto, 2002). Analysis of the ncRNA close to the Snai1 gene provided us with an opportunity
to combine our gene expression analysis with analysis of changes in cellular migration. Knockdown of ncRNA-a7 resulted---




Figure 4. Knockdown of ncRNA-a7 Specifically Targets Snai1 Expression

Figure 4. Knockdown of ncRNA-a7 Specifically Targets Snai1 Expression

(A) As in Figure 3, the ncRNA-a7 locus is depicted showing effects on RNA levels for the surrounding
genes with and without knockdown of ncRNA-a7. The results represent mean ± SEM of at least six
independent experiments. **p < 0.01 by one-tailed Student’s t test.

(B) Migration assay of A549 cells with control (right panel) or ncRNA-a7 (left panel) siRNA transfections.

(C) Quantification of the data shown in (B). Experiments in (B) and (C) are done in three replicates
and are shown as mean ± SEM. ***p < 0.001 by two-tailed Student’s t test.

See also Figure S4 and Table S5.


---in a specific reduction in Snai1 levels (Figure 4A). The expression of the four other protein-coding genes in this locus does not change following the depletion of ncRNA-a7. Concomitantly, knockdown of ncRNA-a7 has a significant phenotypic effect in cell migration assays, reducing the number of migrating cells to about
10% of that of the control (Figures 4B and 4C), consistent with the phenotypic changes following the depletion of Snai1 (Figures 4B and 4C).

Since the knockdown of ncRNA-a7 or Snai1 had similar consequences on cellular migration, we assessed their depletion on gene expression in A549 cells using Agilent arrays. We could not detect the basal level of Snai1 on the array, while Snai1 was readily detectable using quantitative PCR. Interestingly, depletion of Snai1 or ncRNA-a7 resulted in similar changes in gene expression profiles (Figure 5A and Table S6). Not
only did we observe a similar trend in genes that were affected upon the knockdown of either gene but also a significant number of genes that were upregulated were in common in both treatments (Figures 5A and 5B). Since Snai1 is a known transcriptional repressor, depletion of Snai1 or ncRNA-a7 should result in an upregulation of Snai1 target genes. Indeed, a number of genes that were commonly upregulated were direct targets of Snai1 (Figure 5C, upper panel) (De Craene et al., 2005). Depletion of either ncRNA-a7 or Snai1
also resulted in downregulation of a set of genes with a partial overlap between the genes downregulated following the two treatments (Figure 5B). Interestingly, Aurora-kinase A a gene that is 6MB down-stream of ncRNA-a7 was specifically downregulated following the depletion of ncRNA-a7, suggesting a long
range effect for ncRNA-a7 (Figure 5C). Taken together, these results indicate that while the depletion of ncRNA-a7 partially mimic the gene expression profile observed following Snai1 depletion, there are a number of gene expression changes resulting from the ncRNA-a7 depletion that occur independently---




Figure 5. Microarray Analysis of Snai1 and ncRNA-a7 Knockdown

Figure 5. Microarray Analysis of Snai1 and ncRNA-a7 Knockdown

Snai1 or ncRNA-a7 were knocked down using siRNA in A549 cells and the isolated RNA analyzed
on microarrays in duplicate experiments.

(A) All genes differentially expressed (>1.5-fold or <0.6-fold compared to control) in either Snai1 or
ncRNA-a7 knockdown, or both, are shown clustered in a heat map according to expression profile. Numbers are log(2) transformed and color scale is shown below the heat map.

(B) Analysis of genes showing upregulation (>1.5 fold) or downregulation (<0.6 fold) in both Snai1 and ncRNA-a7 knockdown. Numbers represent number of genes regulated in the indicated condition.

(C and D) (C) Validation of microarray data by qPCR and

(D) analysis of the Snai1 locus and targets of Snai1 upon overexpression of ncRNA-a7.
ncRNA-a7 was overexpressed from a vector in A549 cells and expression of select genes were measured by qPCR. Y-axes show expression value relative to GAPDH of the indicated gene. Values are normalized to those of control siRNA transfected cells, set to 1. **p < 0.01, ***p < 0.001 by one-tailed Student’s t test.

See also Table S6.


---of changes in Snai1. Therefore, it is likely that depletion of ncRNA-a7 may have other effects on gene expression which may be mediated through other targets in trans. To specifically address whether ncRNA-a7 may exert its effects in trans, we assessed the gene expression changes in Snai1 locus as well as some of the targets that were changed by depletion of ncRNA-a7 or Snai1 following the overexpression of ncRNA-a7 (Figure 5D). Overall, we did not observe changes in gene expression for any of the ncRNA-a7 targets following its overexpression (Figure 5D, ncRNA-a7 was overexpressed 150 fold). While these results suggest that ncRNA-a7 exerts its local gene expression changes in cis, it is likely that other targets may be influenced in trans. Taken together, these experiments reveal a role for ncRNA-a in positive regulation of expression of neighboring protein-coding genes and show that this effect is not specific to any one locus and may represent a general function for ncRNAs in mammalian cells.

ncRNA Activation of Gene Expression of a Heterologous Reporter

Previous studies have shown that distal activating sequences/enhancers can stimulate transcription when placed adjacent to a heterologous promoter, a methodology widely used to validate potential enhancers (Banerji et al., 1983, 1981; Gillies et al., 1983; Heintzman et al., 2009; Kong et al., 1997). To functionally dissect the influence of the ncRNA activation on the expression of an adjacent gene, we constructed
vectors with inserts containing either ncRNA-a3 and -a4 from a bidirectional promoter, ncRNA-a5 or ncRNA-a7, and placed them downstream of Firefly luciferase driven by a thymidine kinase (TK)
promoter in a reporter vector (pGL3-TK-ncRNA-a), (Figure 6A). We included 1–1.5 kb upstream of the ncRNA-as to contain their endogenous promoters and 500 bps downstream in the reporter vector. We also produced a control vector (pGL3-TK-control) in which 4 kb of DNA without transcriptional potential was cloned down-stream of Firefly luciferase similar to the ncRNA activation reporters (Figure 6B). A vector containing Renilla luciferase was used to control for transfection efficiency. Importantly, inclusion of either of the three ncRNA-a inserts result in an enhancement of transcription ranging from 2- to 7-fold (Figures 6C–6E). This effect is specific as pGL3-TKcontrol vector do not enhance the basal TK promoter activity
(Figures 6C–6E). To demonstrate that the observed potentiation of gene expression is mediated through the action of ncRNA-a, we knocked down the ncRNA-a in question for each reporter construct using specific siRNAs (Figures 6C–6E). Interestingly while depletion of ncRNA-a7 and ncRNA-a5 completely abolished
the increased transcription, depletion of ncRNA-a3 and/or ncRNA-a4 resulted in a partial decrease in transcriptional enhancement (Figures 6C–6E). These results suggest that while ncRNA-a play a major role in transcriptional activation, other DNA elements in the cloned ncRNAa-3/4 region may also contribute to increased transcription.---




Figure 6. ncRNA-Activators Potentiate Transcription of a Reporter Gene

Figure 6. ncRNA-Activators Potentiate Transcription of a Reporter Gene

(A) ncRNA-a 3/4, 5 and 7 were cloned and inserted downstream of luciferase driven by a TK-promoter in a reporter plasmid as shown.

(B) Graphical representation of the inserts in the various vectors used. The pGL3-TK-Control vector contains an insert of approximately 4 kb containing no annotated evidence of transcription. The depicted inserts show exons and transcriptional direction of the ncRNA-a.

(C–E) Luciferase reporter assays. The Firefly luciferase vectors were co-transfected with a Renilla luciferase vector (pRL-TK) for transfection control.

(C) The vector containing ncRNA-a3 and ncRNA-a4 from a bidirectional promoter, with control siRNA or siRNAs toward either of the two ncRNA-a, or both.

(D) Reporter with ncRNA-5, and

(E) the reporter with the ncRNA-a7 inserted downstream of luciferase. X axes show relative Firefly (FL) to Renilla (RL) luciferase activity. Cotransfected siRNAs are indicated to the right of the bars.

All data shown are mean ± SE from six independent experiments. *p < 0.05, **p < 0.01, ***p < 0.001 by onetailed Student’s t tes


---Dissection of the ncRNA-a7 in a Reporter Construct

An important property of enhancing sequences is their orientation independence (Imperiale and Nevins, 1984; Khoury and Gruss, 1983; Kong et al., 1997).We designed reporter constructs (Figure 7A) in which the ncRNA-a7 sequence is reversed (pGL3-TK-ncRNA-a7-RV) in order to assess its orientation independence. The ncRNA-a7-RV construct displayed a similar transcriptional enhancing activity as the construct containing the---




Figure 7. RNA-Dependent Activation of a Reporter Gene by ncNRA-a7

Figure 7. RNA-Dependent Activation of a Reporter Gene by ncNRA-a7

(A) Properties of the ncRNA-a7 containing luciferase reporter vector.

(B, C, E, and F) Luciferase reporter assays. The Firefly luciferase vectors were cotransfected with
a Renilla luciferase vector (pRL-TK) for transfection control.

(D) Semiquantitative PCR of ncRNA-a7.

(B)Reporter experiments with the ncRNA-a7 insert reversed as indicated in the left panel.

(C) The TK-promoter driving luciferase expression was deleted from the construct and
expression values are shown relative to the pGL3-TK control plasmid as a reference.

(E) Truncated reporter constructs containing the ncRNA-a7 promoter and downstream sequences, but not the ncRNA-a7 sequence [pGL3-TK-delta(ncRNA-a7)], or one with a poly(A) signal in the beginning of the ncRNA-a7 to induce premature polyadenylation [pGL3-TK-ncRNA-a7-p(A)].

See also (D) for analysis of expression from these plasmids.

(F) Protein coding sequences were inserted in place of ncRNA-a7 downstream of the ncRNA-a7 promoter. Full-length GTSF1L or ID1 sequences are used. X axes show relative Firefly (FL) to Renilla (RL) luciferase activity. All data shown are mean ± SE from six independent experiments. ***p <
0.001 by one-tailed Student’s t test.


---ncRNA-a7 insert in its endogenous orientation with respect to the regulated gene (Figure 7B).

To show that luciferase expression requires a promoter and that ncRNA-7a cannot act as a proximal promoter, we deleted the TK promoter from the reporter vectors. As shown in Figure 7C, ncRNA-a7 cannot drive transcription of the Firefly luciferase in the absence of a proximal TK promoter. These experiments demonstrate that sequences corresponding to ncRNA-a7 transcription unit can function to activate expression of a heterologous promoter in an orientation-independent manner, but cannot act as a promoter itself.

To further verify that ncRNA-a7 is the active component of the transcriptional enhancement, we constructed two reporters in which ncRNA-a7 sequences are either deleted or shortened by placing a strong polyadenylation signal within the ncRNAa genomic sequence but close to the transcriptional start site,
to induce premature polyadenylation (Figures 7D and 7E). Both modifications result in loss of the increased gene expression (Figure 7E) compared to constructs where ncRNA-a7 is expressed. Finally, to assess whether the RNA corresponding to ncRNA-7a is critical for increased gene expression, we developed
constructs where DNA sequences corresponding to two different protein-coding genes were positioned in the place of ncRNA-a7 (Figure 7F), keeping the endogenous ncRNA-a7 promoter. Neither of these constructs displayed an increased gene expression compared to that of the control constructs
(Figure 7F). Taken together, these experiments demonstrate that the potentiation of gene expression is signaled by the ncRNA-a and is not merely the result of the transcription of the ncRNA.

DISCUSSION:

We used the annotation of the human genome performed by GENCODE to arrive at a collection of long ncRNAs that are expressed from loci independent of those of protein-coding genes or previously described nc RNAs. GENCODE annotation encompasses both protein-coding and noncoding transcripts and relies on experimental data obtained through the analysis of cDNAs, ESTs and spliced RNAs. Our collection of ~3,000 transcripts correspond to the manual curation of about a 1/3 of the human genome. Analysis of the GENCODE data indicates that nearly all of their noncoding annotated transcripts are spliced (Figure S1A).

Importantly, the median distance of an ncRNA transcript to a protein-coding gene is over a 100 kb making it an unlikely scenario for the ncRNA to be an extension of protein-coding transcripts (Figures S2C and S2D). Moreover, transcriptionally active ncRNAs display similar chromatin modifications seen with expressed protein-coding genes (Figures S1B and S1C). Furthermore, the analyzed ncRNAs display RNA pol(II), p300 and CBP occupancy at levels similar to those of the surrounding protein coding genes, consistent with their transcriptional independence (Figure S4). Although our analysis is focused on
understanding the function of a set of ncRNAs annotated by GENCODE, the human transcriptome includes other forms of ncRNAs with important regulatory functions that have not been included in our study. These include the antisense transcripts arising from protein-coding genes, precursors of microRNAs as well as a wealth of unspliced transcripts described in multiple studies (Guttman et al., 2009; Kapranov et al., 2007; Rinn et al., 2007).

Taken together, the novelty of our work lies in the following:

First, we show that at multiple loci of the human genome depletion of a long ncRNA leads to a specific decrease in the expression of neighboring protein-coding genes. Previous studies analyzing the function of long ncRNAs in X-inactivation or the imprinting phenomenon point to their role in silencing of gene
expression (Mattick, 2009).

Second, we show that the enhancement of gene expression by ncRNAs is not cell specific as we observe the effect in five different cell lines.

Third, this enhancement of gene expression is mediated through RNA, as depletion of such activating ncRNAs abrogate increased transcription of the neighboring genes.

Fourth, through the use of heterologous reporter assays, we suggest that activating ncRNAs mediate
this RNA-dependent transcriptional responsiveness in cis.

Fifth, we show that similar to classically defined distal activating sequences, ncRNA-mediated activation of gene expression is orientation independent.

Sixth, we present evidence that similar to defined activating sequences, ncRNAs cannot drive transcription in the absence of a proximal promoter.

Finally, we demonstrate that the activation of gene expression in the heterologous reporter system is mediated through RNA as multiple approaches depleting the RNA levels lead to abrogation of the
stimulatory response.

Therefore, we have uncovered a new biological function in positive regulation of gene expression for
a class of ncRNAs in human cells.

There are previous reports of individual ncRNAs having a positive effect on gene expression. The ~3.8 kb Evf-2 ncRNA was shown to form a complex with the homeodomain-containing protein Dlx2 and lead to transcriptional enhancement (Feng et al., 2006). Similarly, the ncRNA HSR1 (heat-shock RNA-1) forms a complex with HSF1 (heat-shock transcription factor 1), resulting in induction of heat-shock proteins during the cellular heat-shock response (Shamovsky et al., 2006) and an isoform of ncRNASRA(steroid receptor RNAactivator) functions to coactivate steroid receptor responsiveness (Lanz et al., 1999). Our findings that activating ncRNAs positively regulate gene expression extend these previous studies and demonstrate that the activation of gene expression by long ncRNA may be a general function of a class of long ncRNAs. Moreover, whether ncRNA effects seen in our study are mediated through association with specific
transcriptional activators is not known. However, this is a likely scenario given previous examples of an RNA-mediated responsiveness. Other possibilities include a formation of an RNA-DNA hybrid at the locus of the ncRNA or the protein-coding gene which may result in enhanced binding of the sequence specific DNA binding proteins or chromatin modifying complexes.

A recent study uncovers a set of bidirectional transcripts (termed eRNA) that are derived from sites in the human genome that show occupancy by CBP, RNA polymerase II and are decorated by monomethyl Histone H3 lysine 4 (H3K4) (Kim et al., 2010). Moreover, they show that the expression of such transcripts
is correlated with their nearest protein-coding genes. There are fundamental differences between their collection of ~2000 transcripts and our GENCODE set of transcripts. First, while all their eRNAs are bidirectional, only about 1% of our ncRNAs show evidence of bidirectionality (see the example shown in the TAL1 locus). Second, our analysis of the histone modifications of a subset of ncRNAs that are expressed in lymph (Barski et al., 2007) indicates the presence of H3K4 trimethylation at the transcriptional start sites and H3K36 trimethylation at the body of the gene (Figures S1B and S1C). This is in stark contrast to eRNA loci where there is an absence of H3K4 trimethyl marks and the predominant chromatin signature is the
monomethyl H3K4. Third, eRNAs are reported to be predominantly not polyadenylated. The majority of our collection of ncRNAs show evidence of polyadenylation as they were amplified using oligo-dT-primed reactions and furthermore 41% display the presence of a canonical polyadenylation site. Analysis
of the protein-coding transcripts revealed that a similar proportion (52%) to that of our ncRNAs contain the canonical polyadenylation sites. Finally, while we show that a set of our ncRNAs function to enhance gene expression, there is no evidence provided for eRNAs exerting a biological function. While we believe that eRNAs designate a different class of ncRNAs than ncRNA-a described in our study, it is temping to speculate that many of the ncRNA-a and their promoters may correspond to mammalian enhancers or polycomb/trithorax response elements (PRE/TREs). In such a scenario, binding of polycomb or trithorax
proteins to proximal promoters of ncRNA-a will regulate the expression of ncRNA-a which in turn impact the expression of the protein-coding gene at the distance.

Another set of recently published ncRNAs were termed long intervening noncoding RNA or lincRNAs (Guttman et al., 2009). The comparison of our ncRNAs and the lincRNAs show that about 13% of the ncRNAs defined by ENCODE overlap the broad regions encoding a set of recently identified human
lincRNAs (Khalil et al., 2009). The overlap between our ncRNAs and lincRNAs are even smaller (~4%) if one considers only the exons corresponding to lincRNAs. Importantly, the studies with lincRNAs did not reveal any transcriptional effects in neighboring genes (Khalil et al., 2009). Therefore, it is likely that lincRNAs describe a distinct set of ncRNAs compared to those annotated by GENCODE. Similar to the diverse functions for proteins, ncRNAs such as lincRNAs may play other roles in regulating gene expression.

The GENCODE annotation used in this study encompasses only a third of the human genome. Therefore, the number of ncRNAs in human cells is likely to grow and may equal or even surpass the number of protein-coding genes. Our considerations for selection of ncRNAs excluded all ncRNAs associated with
protein-coding genes and their promoters, as well as known ncRNAs. Therefore, the repertoire of the noncoding transcripts in human cells contains many more transcripts than those cataloged in this study. Specifically, there have been reports of pervasive amount of antisense transcription as well as transcription
mapping to promoter regions of protein-coding genes (Core et al., 2008; Denoeud et al., 2007; Kapranov et al., 2007; Preker et al., 2008; Seila et al., 2008). Whether such transcripts will have biological functions similar to those described for activating ncRNAs in our study is not known. However, it is clear that future
genome-wide genetic analysis of ncRNAs in mammalian cells will begin to shed light on different classes of the ncRNAs.

The precise mechanism by which our ncRNAs function to enhance gene expression is not known. We envision a mechanism by which ncRNAs by virtue of sequence or structural homology targets the neighboring protein-coding genes to bring about increased expression. Our experimental evidence using
a heterologous promoter point to the mechanism of action for activating ncRNAs operating in cis. However, genome-wide analysis following depletion of ncRNA-a7 suggested changes in gene expression that may not be related to the action of ncRNA-a7 on its local environment and may be a result of wider trans-mediated effects of ncRNA-a7. Such regulatory functions of ncRNAs could be achieved through an RNA-mediated recruitment of a transcriptional activator, displacement of a transcriptional repressor, recruitment of a basal transcription factor or a chromatin-remodeling factor. While we favor a transcriptional based mechanism for ncRNA activation, effects on RNA stability cannot be excluded. Taken together, the next few years will bring about new prospects for the long ncRNAs as central players in gene expression.

EXPERIMENTAL PROCEDURES:

Extracting Long ncRNA Data

The HAVANA annotation has been downloaded using the DAS server provided by the Sanger institute (version July,16th 2008). We removed all annotated biotypes or functional elements belonging to specific categories such as pseudogenes or protein-coding genes. We excluded all transcripts overlapping with known protein coding loci annotated by HAVANA, RefSeq or UCSC. Transcripts falling into a 1 kb window of any protein-coding gene were also removed. Finally, we excluded all transcripts covered by known noncoding RNAs such as miRNAs (miRbase version 11.0 April 2008).

To estimate the evolutionary constraints among mammalian sequences we constructed the cumulative distribution of PhastCons scores for ancestral repeats (ARs), RefSeq genes and long ncRNAs. The cumulative distributions of these transcripts or repeats are plotted using a log-scale on the y axis.

Cell Culture and siRNA Transfections

Human primary keratinocytes from four different biological donors were grown in Keratinocyte medium (KFSM, Invitrogen). Differentiation was induced by 2.5 ng/ml 12-O-tetradecanoylphorbol-13-acetate (TPA) during 48 hr.

HEK293, A549, HeLa, and MCF-7 cells were cultured in complete DMEM media (GIBCO) containing 10% FBS, and 13 Anti/Anti (GIBCO). Jurkat cells were cultured in complete RPMI media (GIBCO) containing 10% FBS and 13 Anti/Anti (GIBCO). Migration assays were performed as previously described(Gumireddy et al., 2009).

For transfections of 293, HeLa, A549, and MCF-7 cells we used Lipofectamine 2000 (Invitrogen) according to the manufacturer’s recommendations and an siRNA concentration of 50 nM. Jurkat cells were transfected using HiPerFect (QIAGEN) according to the manufacturer’s recommendations and an siRNA concentration of 100 nM.

RNA Purification, cDNA Synthesis, and Quantitative PCR

Cells were harvested and resuspended in TRIzol (Invitrogen) and RNA extracted according to the manufacturer’s protocol. cDNA synthesis was done using MultiScribe reverse transcriptase and random primers from Applied Biosystems. Quantitative PCR was done using SybrGreen reaction mix (Applied
Biosystems) and an HT7900 sequence detection system (Applied Biosystems). For all quantitative PCR reactions Gapdh was measured for an internal control and used to normalize the data.

Cloning of pGL3-TK Reporters and Luciferase Assay

pGL3-Basic was digested with BglII and HindIII and the TK promoter from pRL-TK was inserted into these sites. Inserts were amplified from genomic DNA and cloned into the BamHI and SalI sites 50 to the luciferase gene. Luciferase assays were performed in 96-well white plates using Dual-Glo (Promega) according to the manufacturer’s protocol.

Microarrays

Custom-made microarrays (Agilent) were designed based on the library of 3019 long ncRNA sequences, with on average six probes targeting each transcript. Human whole genome mRNA arrays were from Agilent (G4112F). Total RNA samples were converted to cDNA using oligo-dT primers. Labeling of the
cDNA and hybridization to the microarrays were performed according to Agilent standard dye swap protocols. Data analysis was done using the AFM 4.0 software. All microarrays were done in four biological replicates.

SUPPLEMENTAL INFORMATION:

Supplemental Information includes Extended Experimental Procedures, four figures, and six tables and can be found with this article online at doi:10.1016/j.cell.2010.09.001.

ACKNOWLEDGMENTS:

Thanks to the HAVANA team for use of their genome annotation. We also thank the CRG Genomic Facility and the Functional Genomics Core Facility at Wistar and UPenn for expertise in microarray analysis. We thank Dr. Ken Zaret for helpful discussions.

U.A.O. is supported by a grant from the Danish Research Council;
M.B. is supported by an HFSPO fellowship;
A.G. was supported by a fellowship from the American Italian Cancer Foundation;
R.G. was supported through Spanish ministry, GENCODE U54 HG004555-01, and NIH; and
R.S. was supported by a grant from NIH, GM 079091.

REFERENCES:

Banerji, J., Olson, L., and Schaffner, W. (1983). A lymphocyte-specific cellular
enhancer is located downstream of the joining region in immunoglobulin heavy
chain genes. Cell 33, 729–740.

Banerji, J., Rusconi, S., and Schaffner, W. (1981). Expression of a beta-globin
gene is enhanced by remote SV40 DNA sequences. Cell 27, 299–308.

Barrallo-Gimeno, A., and Nieto, M.A. (2005). The Snail genes as inducers of cell
movement and survival: implications in development and cancer. Development
132, 3151–3161.

Barski, A., Cuddapah, S., Cui, K., Roh, T.Y., Schones, D.E., Wang, Z., Wei, G.,
Chepelev, I., and Zhao, K. (2007). High-resolution profiling of histone methylations
in the human genome. Cell 129, 823–837.

Berezikov, E., and Plasterk, R.H. (2005). Camels and zebrafish, viruses and
cancer: a microRNA update. Hum. Mol. Genet 14, Spec No. 2, R183-190.

Bertone, P., Stolc, V., Royce, T.E., Rozowsky, J.S., Urban, A.E., Zhu, X., Rinn,
J.L., Tongprasit, W., Samanta, M., Weissman, S., et al. (2004). Global identifi-
cation of human transcribed sequences with genome tiling arrays. Science
306, 2242–2246.

Birney, E., Stamatoyannopoulos, J.A., Dutta, A., Guigo, R., Gingeras, T.R.,
Margulies, E.H., Weng, Z., Snyder, M., Dermitzakis, E.T., Thurman, R.E.,
et al. (2007). Identification and analysis of functional elements in 1% of the
human genome by the ENCODE pilot project. Nature 447, 799–816.

Blanco, E., Parra, G., and Guigo, R. (2007). Using geneid to identify genes.
Curr. Protoc. Bioinformatics Chapter 4, Unit 4 3.

Carthew, R.W., and Sontheimer, E.J. (2009). Origins and Mechanisms of miRNAs
and siRNAs. Cell 136, 642–655.

Cheng, J., Kapranov, P., Drenkow, J., Dike, S., Brubaker, S., Patel, S., Long, J.,
Stern, D., Tammana, H., Helt, G., et al. (2005). Transcriptional maps of 10
human chromosomes at 5-nucleotide resolution. Science 308, 1149–1154.

Core, L.J., Waterfall, J.J., and Lis, J.T. (2008). Nascent RNA sequencing
reveals widespread pausing and divergent initiation at human promoters.
Science 322, 1845–1848.

De Craene, B., Gilbert, B., Stove, C., Bruyneel, E., van Roy, F., and Berx, G.
(2005). The transcription factor snail induces tumor cell invasion through
modulation of the epithelial cell differentiation program. Cancer Res. 65,
6237–6244.

Denoeud, F., Kapranov, P., Ucla, C., Frankish, A., Castelo, R., Drenkow, J., Lagarde,
J., Alioto, T., Manzano, C., Chrast, J., et al. (2007). Prominent use of
distal 50 transcription start sites and discovery of a large number of additional
exons in ENCODE regions. Genome Res. 17, 746–759.

Efroni, S., Duttagupta, R., Cheng, J., Dehghani, H., Hoeppner, D.J., Dash, C.,
Bazett-Jones, D.P., Le Grice, S., McKay, R.D., Buetow, K.H., et al. (2008).
Global transcription in pluripotent embryonic stem cells. Cell Stem Cell 2,
437–447.

Euskirchen, G.M., Rozowsky, J.S., Wei, C.L., Lee, W.H., Zhang, Z.D., Hartman,
S., Emanuelsson, O., Stolc, V., Weissman, S., Gerstein, M.B., et al.
(2007). Mapping of transcription factor binding regions in mammalian cells
by ChIP: comparison of array- and sequencing-based technologies. Genome
Res. 17, 898–909.

Fejes-Toth, K., Sotirova, V., Sachidanandam, R., Assaf, G., Hannon, G.J., Kapranov,
P., Foissac, S., Willingham, A.T., Duttagupta, R., Dumais, E., and Gingeras,
T.R. (2009). Post-transcriptional processing generates a diversity of
50-modified long and short RNAs. Nature 457, 1028–1032.

Feng, J., Bi, C., Clark, B.S., Mady, R., Shah, P., and Kohtz, J.D. (2006). The Evf-
2 noncoding RNA is transcribed from the Dlx-5/6 ultraconserved region and
functions as a Dlx-2 transcriptional coactivator. Genes Dev. 20, 1470–1484.

Fire, A., Xu, S., Montgomery, M.K., Kostas, S.A., Driver, S.E., and Mello, C.C.
(1998). Potent and specific genetic interference by double-stranded RNA in
Caenorhabditis elegans. Nature 391, 806–811.

Gillies, S.D., Morrison, S.L., Oi, V.T., and Tonegawa, S. (1983). A tissuespecific
transcription enhancer element is located in the major intron of a rearranged
immunoglobulin heavy chain gene. Cell 33, 717–728.

Gumireddy, K., Li, A., Gimotty, P.A., Klein-Szanto, A.J., Showe, L.C., Katsaros,
D., Coukos, G., Zhang, L., and Huang, Q. (2009). KLF17 is a negative regulator
of epithelial-mesenchymal transition and metastasis in breast cancer. Nat. Cell
Biol. 11, 1297–1304.

Guttman, M., Amit, I., Garber, M., French, C., Lin, M.F., Feldser, D., Huarte, M.,
Zuk, O., Carey, B.W., Cassady, J.P., et al. (2009). Chromatin signature reveals
over a thousand highly conserved large non-coding RNAs in mammals. Nature
458, 223–227.

Harrow, J., Denoeud, F., Frankish, A., Reymond, A., Chen, C.K., Chrast, J.,
Lagarde, J., Gilbert, J.G., Storey, R., Swarbreck, D., et al. (2006). GENCODE:
producing a reference annotation for ENCODE. Genome Biol. 7 (Suppl 1),
S4 1–9.

Heard, E., and Disteche, C.M. (2006). Dosage compensation in mammals:
fine-tuning the expression of the X chromosome. Genes Dev. 20, 1848–1867.

Heintzman, N.D., Hon, G.C., Hawkins, R.D., Kheradpour, P., Stark, A., Harp,
L.F., Ye, Z., Lee, L.K., Stuart, R.K., Ching, C.W., et al. (2009). Histone modifi-
cations at human enhancers reflect global cell-type-specific gene expression.
Nature 459, 108–112.

Imperiale, M.J., and Nevins, J.R. (1984). Adenovirus 5 E2 transcription unit: an
E1A-inducible promoter with an essential element that functions independently
of position or orientation. Mol. Cell. Biol. 4, 875–882.

Kapranov, P., Cheng, J., Dike, S., Nix, D.A., Duttagupta, R., Willingham, A.T.,
Stadler, P.F., Hertel, J., Hackermuller, J., Hofacker, I.L., et al. (2007). RNA
maps reveal new RNA classes and a possible function for pervasive transcription.
Science 316, 1484–1488.

Khalil, A.M., Guttman, M., Huarte, M., Garber, M., Raj, A., Rivea Morales, D.,
Thomas, K., Presser, A., Bernstein, B.E., van Oudenaarden, A., et al. (2009).
Many human large intergenic noncoding RNAs associate with chromatin-modifying
complexes and affect gene expression. Proc. Natl. Acad. Sci. U S A.

Khoury, G., and Gruss, P. (1983). Enhancer elements. Cell 33, 313–314.

Kim, T.K., Hemberg, M., Gray, J.M., Costa, A.M., Bear, D.M., Wu, J., Harmin,
D.A., Laptewicz, M., Barbara-Haley, K., Kuersten, S., et al. (2010). Widespread
transcription at neuronal activity-regulated enhancers. Nature.

Kong, S., Bohl, D., Li, C., and Tuan, D. (1997). Transcription of the HS2
enhancer toward a cis-linked gene is independent of the orientation, position,
and distance of the enhancer relative to the gene. Mol. Cell. Biol. 17, 3955–
3965.

Lanz, R.B., McKenna, N.J., Onate, S.A., Albrecht, U., Wong, J., Tsai, S.Y., Tsai,
M.J., and O’Malley, B.W. (1999). A steroid receptor coactivator, SRA, functions
as an RNA and is present in an SRC-1 complex. Cell 97, 17–27.

Lecuyer, E., and Hoang, T. (2004). SCL: from the origin of hematopoiesis to
stem cells and leukemia. Exp. Hematol. 32, 11–24.

Lee, R.C., Feinbaum, R.L., and Ambros, V. (1993). The C. elegans heterochronic
gene lin-4 encodes small RNAs with antisense complementarity to
lin-14. Cell 75, 843–854.

Lunter, G., Ponting, C.P., and Hein, J. (2006). Genome-wide identification of
human functional DNA using a neutral indel model. PLoS Comput. Biol. 2, e5.

Mattick, J.S. (2009). The genetic signatures of noncoding RNAs. PLoS Genet.
5, e1000459.

Nieto, M.A. (2002). The snail superfamily of zinc-finger transcription factors.
Nat. Rev. Mol. Cell Biol. 3, 155–166.

Ogawa, H., Ishiguro, K., Gaubatz, S., Livingston, D.M., and Nakatani, Y. (2002).
A complex with chromatin modifiers that occupies E2F- and Myc-responsive
genes in G0 cells. Science 296, 1132–1136.

Parra, G., Blanco, E., and Guigo, R. (2000). GeneID in Drosophila. Genome
Res. 10, 511–515.

Ponting, C.P., Oliver, P.L., and Reik, W. (2009). Evolution and functions of long
noncoding RNAs. Cell 136, 629–641.

Preker, P., Nielsen, J., Kammler, S., Lykke-Andersen, S., Christensen, M.S.,
Mapendano, C.K., Schierup, M.H., and Jensen, T.H. (2008). RNA exosome
depletion reveals transcription upstream of active human promoters. Science
322, 1851–1854.

Rinn, J.L., Kertesz, M., Wang, J.K., Squazzo, S.L., Xu, X., Brugmann, S.A.,
Goodnough, L.H., Helms, J.A., Farnham, P.J., Segal, E., and Chang, H.Y.
(2007). Functional demarcation of active and silent chromatin domains in
human HOX loci by noncoding RNAs. Cell 129, 1311–1323.

Savagner, P. (2001). Leaving the neighborhood: molecular mechanisms
involved during epithelial-mesenchymal transition. Bioessays 23, 912–923.

Seila, A.C., Calabrese, J.M., Levine, S.S., Yeo, G.W., Rahl, P.B., Flynn, R.A.,
Young, R.A., and Sharp, P.A. (2008). Divergent transcription from active
promoters. Science 322, 1849–1851.

Shamovsky, I., Ivannikov, M., Kandel, E.S., Gershon, D., and Nudler, E. (2006).
RNA-mediated response to heat shock in mammalian cells. Nature 440,
556–560.

Wang, E.T., Sandberg, R., Luo, S., Khrebtukova, I., Zhang, L., Mayr, C., Kingsmore,
S.F., Schroth, G.P., and Burge, C.B. (2008). Alternative isoform regulation
in human tissue transcriptomes. Nature 456, 470–476.

Wightman, B., Ha, I., and Ruvkun, G. (1993). Posttranscriptional regulation of
the heterochronic gene lin-14 by lin-4 mediates temporal pattern formation in
C. elegans. Cell 75, 855–862.

Yang, P.K., and Kuroda, M.I. (2007). Noncoding RNAs and intranuclear positioning
in monoallelic gene expression. Cell 128, 777–786.




NetworkEditors' Perspectives: "Long noncoding RNAs as Activators of Specific Gene Transcription".

This exciting analysis by Ulf Ørom , Thomas Derrien , Malte Beringer , Kiranmai Gumireddy , Allessandro Gardini , Giovanni Bussotti , Fan Lai, Matthias Zytnicki , Cedric Notredame , Qihong Huang , Roderic Guigo , and Ramin Shiekhattar  reveals a large set of long noncoding RNAs that are capable of activating gene transcription at specific gene sites within human cells.

Such gene activators may play important roles in cell differentiation, maturation, and those gene networks that compose intranuclear  embryomas  within adult human neoplastic cells.

Additional References:

1. Frenster JH, and Hovsepian JA,
"Models of successive levels of resolution during individual gene transcription".

2. Frenster JH, and Hovsepian JA,
 "Micro RNAs and adult neoplasms of embryonic type".

3. Mishra PJ,  and Merlino G,
"MicroRNA reexpression as differentiation therapy in cancer".

4. Taulli R, Bersani F, Foglizzo V, Linari A, Vigna E, Ladanyi M, Tuschl T, and Ponzetto C,
"The muscle-specific microRNA miR-206 blocks human rhabdomyosarcoma growth in xenotransplanted mice by promoting myogenic differentiation".

5. Pique-Regi R, Degner JF, Pai AA, Gaffney DJ, Gilad Y, Pritchard JK,
"Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data".

6. Frenster JH, and Hovsepian JA,
"Reprogramming the human cancer cell nucleus".




Conclusions from Embryoma Genomics:

1. Each cell retains all of its embryonic genes for a lifetime.

2. Controls for embryonic genes are often absent in adults.

3. Uncontrolled embryonic genes can replicate wildly.

4.  Replicating genes participate in  intra-cellular competition.

5.  The basis for gene competition is selective transcription.

6.  MicroRNAs can reprogram embryomic transcription.

7.  Gene reprogramming can produce normal phenotypes.

8.  Normal phenotypes can by-pass chromosomal lesions.

9.  MicroRNA therapy may need to be permanent.

10. Transplantation of microRNAs could be preferred.

http://www.embryomas.net/




Conclusions from Euchromatin Thermodynamic Pathways.

1. Pathways within cell genomes involve a flow of information.

2. Information can flow by direct contact or by third parties.

3. Direct contact within whole genomes is difficult to regulate.

4. DNA-DNA direct contects are influenced by agents.

5. Nuclear agents include hydrophilic ionic and hydrophobic conforming ligands.

6. Third parties within genomes involve RNAs and proteins.

7.  RNAs and proteins are easy to regulate or reverse.

8.  Information can be shared, lost, or transformed.

9. System information can be hidden during system isolation.

10.  Local information can be permanently lost during system entropy.

http://www.cancerbiophysics.net/




Further Topics in:  Euchromatin,  active DNA, and  RNA  ribo-regulators:

Links to Current Research in Euchromatin:
Links to Euchromatin Activator RNA Reviews:
Links to Euchromatin Activator RNA Research:
Links to Ultrastructural Probes of DNase I-Sensitive Sites:
Links to RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma Immuno-Pathology:
Links to Activated T-Lymphocyte Immunotherapy:
Links to Medical Systems Biology:
Links to Selective Gene Transcription:
Links to RNA-Induced Epigenetics:
Links to RNA-Induced Embryogenesis:
Links to RNA and Biological Causality:
Links to Reprogramming and Neoplasia:

A Brief History of Activator RNA:

"Ultrastructural Probes of Active DNA Sites, and the RNA Activators of DNA".
(PowerPoint Presentation).


Top of Page - Euchromatin NetworkEuchromatin ResearchResearch in Quantitative Radiology


For Further Information and Feedback:

Jeannette A. Hovsepian, M.D.
E-mail: frensasc@ix.netcom.com
Phone:  +1 650 367 6483



euchromatin: "the most active portion of the genome within the cell nucleus".
embryoma:  "adult neoplasm expressing one or more embryo-exclusive genes".
entropy:  "maximum entropy defines the isolated reaction steady-state equilibrium".
EMT: "activated embryonic gene network driving cancer progression".
enhancers: "long noncoding RNAs capable of activating gene transcription".