Miao-Chih Tsai 1, Ohad Manor 2, Yue Wan 1, Nima Mosammaparast 3, Jordon K. Wang 1, Fei Lan, 3, 4, Yang Shi 3, Eran Segal 2, Howard Y. Chang, 1, *
1 Howard Hughes Medical Institute and Program in Epithelial
Biology, Stanford University School of Medicine, Stanford, CA 94305, USA.
2 Department of Computer Science and Applied Mathematics,
Weizmann Institute of Science, Rehovot 76100, Israel.
3 Department of Pathology, Harvard Medical School, and
Division of New Born Medicine, Department of Medicine, Children’s Hospital
Boston, Boston, MA 02138, USA.
4 Constellation Pharmaceuticals, 215 First Street, Suite
200, Cambridge, MA 02142, USA.
* To whom correspondence should be addressed. E-mail: howchang@stanford.edu
Received for publication 7 May 2010. Accepted for publication 28 June 2010.
Long intergenic noncoding RNAs (lincRNAs) regulate chromatin
states and epigenetic inheritance. Here, we show that the lincRNA
HOTAIR serves as a scaffold for at least two distinct histone modification
complexes. A 5' domain of HOTAIR binds Polycomb Repressive Complex 2 (PRC2)
while a 3' domain of HOTAIR binds the LSD1/CoREST/REST complex. The ability
to tether two distinct complexes enables RNA-mediated assembly of
PRC2 and LSD1, and coordinates targeting of PRC2 and LSD1 to chromatin
for coupled histone H3 lysine 27 methylation and lysine 4 demethylation.
Our results suggest that lincRNAs may serve as scaffolds by providing
binding surfaces to assemble select histone modification enzymes, and thereby
specify the pattern of histone modifications on target genes.
Introduction:
Long intergenic noncoding RNAs (lincRNAs) regulate dosage
compensation, imprinting, and developmental gene expression by establishing
chromatin domains in an allele-and cell-type specific manner [reviewed
by (1, 2). LincRNAs are intimately associated with chromatin
remodeling complexes (3–7), but molecular mechanisms
of their functions are still lacking. Post-translational modifications
of histones recruit DNA binding proteins and chromatin remodeling machinery,
and are often coupled for combinatorial control [reviewed by
(8). For instance in embryonic stem cells, many genes encoding developmental
regulators, such as the HOX, are transcriptionally silent but possess
bivalent histone H3 lysine 4 (H3K4) and lysine 27 (H3K27)
methylation, which are resolved into univalent H3K4 or H3K27 methylation
domains upon differentiation (9, 10). Here we show that
a lincRNA can coordinate histone modifications by binding to multiple histone
modification enzymes.
The lincRNA HOTAIR is transcribed from the HOXC locus and
targets Polycomb Repressive Complex 2 (PRC2, comprised of H3K27 methylase
EZH2, SUZ12, and EED) to silence HOXD and select genes on other
chromosomes (7, 11). The genomic regions
flanking HOXD are also bound by CoREST/REST repressor complexes
(12), which contain LSD1 (KDM1/BHC110), a demethylase that mediates enzymatic
demethylation of H3K4me2 (13) and that is required for
proper repression of Hox genes in Drosophila (14).
We therefore hypothesized that HOTAIR maycoordinately interact with both
PRC2 and LSD1. Immunoprecipitation (IP) of either endogenous LSD1
or FLAG-tagged LSD1 from primary foreskin fibroblasts or HeLa cells specifically
retrieved endogenous HOTAIR RNA with comparable enrichment to EZH2 IP,
the positive control (Fig. 1A and fig.
S1A). IP of three other chromatin proteins did not retrieve
HOTAIR (fig. S1A), and LSD1,
EZH2, or FLAG-LSD1 IP did not retrieve U1 RNA, a nuclear ncRNA that
served as a negative control. Purified biotinylated HOTAIR RNA, but not
GFP RNA or an antisense HOTAIR fragment, specifically retrieved EZH2, SUZ12,
and LSD1 from HeLa cell nuclear extract (Fig. 1B and
fig.
S1B). LSD1 forms a complex with CoREST (15), which
can bridge LSD1 to the neuronal gene silencer REST (16).
REST is believed to mediate silencing through two distinct effector arms:
one via LSD1-CoREST, and separately via the adaptor protein CDYL and the
H3K9 KMT G9a (17). HOTAIR specifically bound to CoREST
and REST, but not CDYL or G9a, nor to the putative PRC1 subunit YY1 (Fig.
1B). Further, biotinylated HOTAIR bound to purified PRC2 and LSD1 complexes
in vitro (Fig. 1C and fig.
S1C). These results suggest that HOTAIR directly interacts with PRC2
and LSD1 complexes.
Using a series of HOTAIR deletion mutants, the PRC2 binding activity mapped to nucleotides 1 to 300 of HOTAIR, while the LSD1 complex binding activity mapped to nucleotides 1500 to 2146 (Fig. 1D). Deletion mutants that retained nucleotides 1 to 300 bound EZH2 or SUZ12 with equal efficiency as full length HOTAIR, and deletion mutants that retained nucleotides 1500 to 2146 retained LSD1 binding activity. Thus, HOTAIR is a modular bifunctional RNA has distinct binding domains for PRC2 and LSD1 complexes. Computational analysis and RNA footprinting showed that the PRC2 and LSD1 binding domains of HOTAIR are likely to possess extensive but distinct secondary structures (fig. S2).
The presence of independent binding sites for PCR2 and LSD1 on HOTAIR suggests that HOTAIR may bridge PRC2 and LSD1 complexes. EZH2 IP retrieved LSD1, and conversely LSD1 IP retrieved EZH2 from foreskin fibroblasts (Fig. 2A). We estimate that less than 5% of the two complexes physically interact with each other, consistent with prior purification results that isolated PRC2 and CoRESTLSD1 as separate complexes (18, 19). RNAi of HOTAIR or RNase treatment of the IP abrogated the interaction between EZH2 and LSD1, suggesting that HOTAIR is required to bridge this interaction (Fig. 2A and fig. S3). Wild-type HeLa cells or HeLa cells stably expressing FLAG-LSD1 (FL-HeLa) expressed ~ten-fold less HOTAIR than foreskin fibroblasts and showed undetectable endogenous interaction between PRC2 and LSD1. Enforced expression of HOTAIR in FLHeLa cells to a level comparable to foreskin fibroblasts allowed robust interaction between PRC2 and LSD1 (Fig. 2B). Gel filtration chromatography confirmed that HOTAIR expression shifts PRC2 subunits into a higher molecular weight complex coincident with the LSD1 complex, suggesting the formation of a higher ordered complex comprised of HOTAIR, PRC2, and LSD1 complexes in HOTAIR overexpressing cells (fig. S4). Moreover, expression of each HOTAIR mutant that lacked the ability to bind either PRC2 or LSD1 in vitro failed to induce PRC2- LSD1 interaction in cells (Fig. 2C and fig. S3C).
HOTAIR-mediated bridging of PRC2 and LSD1 complexes also enables
their coordinate binding to target
genes on chromatin. HOTAIR is required for H3K27 methylation
and transcriptional silencing across the HOXD
locus (7). Therefore, we mapped PRC2 (as indicated
by SUZ12) and LSD1 occupancy across the HOX loci and on
promoters genome-wide by chromatin IP followed by microarray
analysis (ChIP-chip), in primary foreskin
fibroblasts after control RNAi or HOTAIR knock down (Fig.
3 and figs. S5 to S7). HOTAIR
knockdown decreased SUZ12 and LSD1 occupancy in a similar pattern across
HOXD (Fig. 3, A and B, and fig.
S6; R = 0.59, p < 10-9, t-test).
Coordinate loss of SUZ12 and LSD1 occupancy caused by HOTAIR knockdown
were concentrated in proximal promoters of HOXD genes (Fig.
3B). These regions correspondingly lost H3K27me3 and gained H3K4me2,
the respective histone methylation products of PRC2 and LSD1 complexes
(Fig. 3, A and C, and figs.
S6 and S7; R = 0.40, p < 10-9, t-test).
The loss of H3K27me3 occurred across broad domains
encompassing multiple HOXD genes and intergenic regions,
while the gain of H3K4me2 was concentrated near the transcriptional
start sites of HOXD genes (8). Multiple independent
siRNAs targeting HOTAIR gave the same results.
Examining human promoters genome-wide, ChIP-chip analysis
showed that PRC2 and LSD1 occupied 4740 and
2116 gene promoters, respectively (Fig. 3D).
Nearly one third of LSD1 occupied promoters, comprised of 721 genes,
were
also occupied by SUZ12, revealing a significant overlap (257
overlap expected by chance alone. p = 3.4 × 10-164,
hypergeometric distribution). Among these 721 genes cooccupied by SUZ12
and LSD1, the distances between the binding sites of SUZ12 and
LSD1 were predominantly less than 500 base pairs, which is the fragmentation
size of chromatin in our ChIP assay and the limit of resolution
(fig. S8A).
HOTAIR knockdown led to concordant loss of SUZ12 and
LSD1 occupancy in 289 of the 721 genes normally cooccupied by SUZ12
and LSD1 (almost 40%) (Fig. 3E and table
S1). Additional genes showed more exclusive loss of LSD1 occupancy
(33%)
or SUZ12 occupancy (16%), suggesting that HOTAIR may be involved
in other LSD1- or SUZ12-dependent pathways. ChIP followed by qPCR confirmed
the requirement of HOTAIR for PRC2 and LSD1 localization for all six genes
tested (fig. S8C). HOTAIR knockdown
did not change the chromatin occupancy by PRC2 and LSD1 at hundreds
of other genes, nor did it affect the protein or mRNA level
of the subunits of PRC2 or LSD1 complexes (Fig. 2A and
fig.
S9, A to C). The functional consequence of coordinate targeting of
PRC2 and LSD1 by HOTAIR is gene repression: genes co-occupied by
SUZ12-LSD1 in a HOTAIR-dependent manner are also significantly induced
upon HOTAIR knockdown as measured by microarray or qRT-PCR [p
< 0.05, Gene Set Enrichment Analysis
(20); Fig.
3F and fig. S8D]. These results
suggest that a single lincRNA—HOTAIR—may be required to target both
PRC2 and LSD1 to hundreds of genes across the
genome in order to coordinate histone modifications for gene
silencing.
Both PRC2 and LSD1 can bind multiple proteins that are thought
to provide DNA target specificity (16, 21).
A
possible consequence of the HOTAIR-mediated bridging is that PRC2
may be recruited to LSD1-CoREST-REST binding sites, and conversely,
LSD1 may be recruited to PRC2 binding sites. Prior genome-scale
mapping studies of PRC2 already identified the REST motif as one
of the most enriched DNA sequence motifs within PRC2 binding sites
but with no mechanistic explanation (22). We searched
for enriched sequence motifs in SUZ12 binding sites lost upon HOTAIR knock
down (“HOT-S sites” for short) and identified several enriched motifs
(23), including a motif that corresponds to the right
half of the canonical REST motif (p = 1.05 × 10-12;
Fig.
4A
and fig. S10). REST is
able to bind only one half site of the canonical REST motif (24),
and genes containing HOT-S sites are enriched for experimentally
measured REST occupancy (p < 1.27 × 10-16,
hypergeometric distribution; fig.
S9D and table S2) (24).
The most significantly enriched motif in LSD1 binding sites that are lost
upon HOTAIR knockdown (termed “HOT-L sites”) is a CG-rich motif
(p = 3.66 × 10-10; Fig. 4B and
fig.
S10), which is important for PRC2 binding (22, 25,
26). Thus, the enrichment of the CG-rich motif may reflect the HOTAIR-dependent
recruitment
of LSD1 complexes to PRC2 bound sites, which are often in CpG islands.
We examined the gain of SUZ12 and LSD1 occupancy on chromatin when HOTAIR
is overexpressed in primary lung fibroblasts, which
do not express
endogenous HOTAIR. HOTAIR overexpression caused ectopic occupancy
of LSD1 and SUZ12 that significantly overlapped (p < 7.31
× 10-95). Further, motif analysis of the ectopically gained
binding sites recovered an almost identical CG-rich motif (p
= 7.9 × 10-37; Fig. 4C), suggesting
that this motif is involved in HOTAIR target selection. Nonetheless,
the REST half site and the CG-rich motif are currently not sufficient for
de
novo prediction of all HOTAIR-dependent genes,
suggesting that additional
motifs, binding partners, and/or motif arrangements may be important.
In this report, we demonstrate that the lincRNA HOTAIR can link
a histone methylase and a demethylase by acting as a modular scaffold
(fig. S11). Other lincRNAs
may also contain multiple binding sites for distinct protein complexes
that direct specific combinations of histone modifications on target
gene chromatin. Some lincRNAs may be “tethers” that recruit
several chromatin modifications to their sites of synthesis (2),
while other lincRNAs can act on distantly located genes as “guides”
to affect their chromatin states (2). Based on
their dynamic patterns of expression (27), specific
lincRNAs can potentially direct complex patterns of chromatin states
at specific genes in a spatially and temporally organized manner
during development and disease states.
NetworkEditors' Perspectives: "Repression or de-repression
of target genes by long noncoding RNAs".
This exciting study by Miao-Chih Tsai, Ohad Manor, Yue Wan, Nima Mosammaparast, Jordon Wang, Fei Lan , Yang Shi, Eran Segal , and Howard Chang expands our views of long noncoding RNAs simultaneously binding to one or more genes, target promoters, other RNAs, and selected proteins, forming polyvalent complexes that can be alternatively repressed or de-repressed for gene activity and expression. A new form of tethered gene networks appears likely, with profound effects on new studies of gene regulation, epigenetics, and reprogramming.
3D Mapping of Competing Ligand Factors during Single Gene Transcription.
3D Mapping of Competing Ligand Factors during Single Gene Transcription.
Several layers of transcription factors are present during the selection and initiation phases of individual gene transcription. The DNA coding sequence within open-looped and/or paired euchromatin 10 nm microfibrils is exposed to a growing cluster of primary ligands to DNA . Primary ligands may encounter counter-ligands in their immediate vicinity, thus forming ligand-counter ligand complexes.
Hydrophobic ligands are bound preferably to single-stranded portions of DNA and/or RNA.
1. The first ligand layer consists of small RNA molecules, synthesized in upstream or downstream cis- or trans- enhancer loci. Such enhancer RNAs find their preferred binding sites in complementary RNA molecules within the promoter regions of the selected gene.
2. The second layer finds the sense-DNA template already covered by the first layer of ligands. This second layer is also antagonized by intruding antisense-DNA kiss sequences on the same or other chromosomes within the cell. As a consequence, second layer ligands, whether they are histones, RNAs, proteins, lipids, hormones, or drug molecules, may not gain or retain direct contact with the sense-DNA template during active gene transcription.
RNA Polymerase II and other necessary enzymes are recruited between the layers into the growing transcription clusters early in the initiation stage.
3, 4, and 5. The third, fourth and fifth layers include sequences for binding to other euchromatin, heterochromatin, and nuclear membrane sites, respectively.
John H. Frenster, and Jeannette A. Hovsepian,
"Models
of successive levels of resolution during individual gene transcription".
We thank members of the D. Herschlag lab for assistance with RNA footprinting; X. Tan, P. Khavari, and J. Wysocka for critical reading of the manuscript. Supported by California Institute for Regenerative Medicine (RN1-00529-1 to H.Y.C.), NIH (R01-HG004361 to H.Y.C. and E.S, R01-CA118487 to Y.S), Susan G. Komen Foundation (M.-C.T.), Azrieli Foundation (O.M.), National Science Foundation (J.K.W.), and A-STAR (Y.W.). E.S. is the incumbent of the Soretta and Henry Shapiro career development chair. Y.S. is cofounder and on the scientific advisory board of Constellation Pharmaceuticals. H.Y.C. is an Early Career Scientist of the Howard Hughes Medical Institute.
http://www.sciencemag.org/cgi/content/full/sci;science.1192002/DC1
Materials and Methods
Figs. S1 to S11
Tables S1 and S2
References and Notes
Fig. 1. 5' domain of HOTAIR binds PRC2 and 3' domain of HOTAIR binds LSD1.
(A) LSD1 IP specifically retrieves HOTAIR RNA. Data (mean ± SD, n = 3) is relative to mock-IP (IgG or FLAG). ND, not-detectable.
(B) In vitro transcribed (IVT), biotinylated HOTAIR retrieves EZH2, LSD1, CoREST, and REST, but not G9a, CDYL, or YY1.
(C) IVT biotinylated HOTAIR binds to purified PRC2 and LSD1 complexes.
PRC2_3m:
recombinant purified core PRC2 complex with 3 members (EZH2, SUZ12, EED).
PRC2_5m:
recombinant purified PRC2 complex with 5
members (+RbAP48, AEBP2). LSD1.com: Tandem affinity purified
protein complex associated with FLAG-HA-LSD1 from HeLa cells. Composition
of protein complexes are shown in fig.
S1C.
(D)The first 300 bp (lined boxes) of HOTAIR is necessary
and sufficient to bind PRC2; the last 646 bp (meshed boxes)
is necessary and sufficient to bind LSD1 complex. The profiles are established
by RNA pulldown
of HeLa extract; retrieved proteins are detected by immunoblotting.
Fig. 2. HOTAIR is necessary and sufficient for interaction between EZH2 and LSD1.
(A) In foreskin fibroblasts, EZH2 interacts with LSD1 (lanes 1, 4). Knockdown of HOTAIR (lanes 3, 6), but not GFP (lanes 2, 5), abolishes this interaction. HOTAIR levels (mean ± SD) are shown on the right.
(B) HOTAIR expression in FLAG-LSD1 HeLa cells induces EZH2 and LSD1 interaction (lanes 3 and 6).
(C) Full length HOTAIR induces EZH2 and LSD1 interaction (lanes 3, 10) but not HOTAIR mutants lacking either 5' or 3' domain (lanes 4 to 7 and 11 to 14). Presence of indicated RNA domains is confirmed by RT-PCR (bottom panel).
Fig. 3. HOTAIR coordinates localization of PRC2 and LSD1 genome-wide.
(A) Changes in mRNA and occupancy of H3K4me2, H3K27me3, LSD1, and SUZ12 across HOXD locus after RNAi of HOTAIR in foreskin fibroblasts. Yellow boxes indicate regions of notable correlation between gain of H3K4me2 and concordant loss of LSD1, H3K27me3, and SUZ12.
(B) The patterns of change in LSD1 (x axis) and SUZ12 occupancy (y axis) upon HOTAIR knockdown across the HOXD locus are significantly correlated (Pearson correlation, R = 0.59, p < 10-9, t-test). This correlation is concentrated in proximal promoters of HOXD genes (R =0.86).
(C) Positive correlation of changes in SUZ12 (x axis) and H3K27me3 occupancy (y axis) and negative correlation of LSD1 (x axis) and H3K4me2 occupancy (y axis).
(D) Venn diagram shows the genes occupied by SUZ12 (4740 genes), LSD1 (2116 genes), or both (721 genes).
(E) Heatmap of SUZ12 and LSD1 co-occupied genes (721 genes). Each column is an experiment; each row is a gene. HOTAIR knockdown led to concordant loss of SUZ12 and LSD1 occupancy. Chromatin occupancy is indicated in blue per the scale bar.
(F) HOTAIR knockdown leads to transcription de-repression of target genes. Mean ± SD of qRT-PCR data are shown.
Fig. 4. HOTAIR-dependent SUZ12 and LSD1 binding motifs
(A) SUZ12 occupancy sites lost upon HOTAIR knockdown (HOT-S sites)
are enriched for a DNA motif very
similar to the right half of canonical REST motif.
(B) LSD1 occupancy sites lost upon HOTAIR knockdown (HOT-L sites) are enriched for a CG-rich motif.
(C) A nearly identical CG-rich motif is enriched in LSD1/SUZ12 binding sites gained upon HOTAIR overexpression, suggesting that this motif is involved in HOTAIR target selection.
1. Wang JK, Tsai M-C, Poulin G, Adler AS, Chen S, Liu H, Shi Y and
Chang HY
"The histone demethylase UTX enables RB-dependent cell fate control".
http://genesdev.cshlp.org/content/24/4/327.long
2. Gupta RA, Shah N, Wang KC, Kim J, Horlings HM, Wong DJ, Tsai M-C,
Hung T, Argani P, Rinn JL, Wang Y, Brzoska P, Kong B, Li R, West RB, van
de Vijver MJ, Sukumar S, and Chang HY,
"Long non-coding
RNA HOTAIR reprograms chromatin state to promote cancer metastasis".
3. Frenster JH, and Hovsepian JA,
"Models of
successive levels of resolution during individual gene transcription".
Conclusions from Embryoma Genomics:
1. Each cell retains all of its embryonic genes for a lifetime.
2. Controls for embryonic genes are often absent in adults.
3. Uncontrolled embryonic genes can replicate wildly.
4. Replicating genes participate in intra-cellular competition.
5. The basis for gene competition is selective transcription.
6. MicroRNAs can reprogram embryomic transcription.
7. Gene reprogramming can produce normal phenotypes.
8. Normal phenotypes can by-pass chromosomal lesions.
9. MicroRNA therapy may need to be permanent.
10. Transplantation of microRNAs could be preferred.
1. Pathways within cell genomes involve a flow of information.
2. Information can flow by direct contact or by third parties.
3. Direct contact within whole genomes is difficult to regulate.
4. DNA-DNA direct contects are influenced by agents.
5. Nuclear agents include hydrophilic ionic and hydrophobic conforming ligands.
6. Third parties within genomes involve RNAs and proteins.
7. RNAs and proteins are easy to regulate or reverse.
8. Information can be shared, lost, or transformed.
9. System information can be hidden during system isolation.
10. Local information can be permanently lost during system entropy.
Links to Current
Research in Euchromatin:
Links to
Euchromatin Activator RNA Reviews:
Links to
Euchromatin Activator RNA Research:
Links to Ultrastructural
Probes of DNase I-Sensitive Sites:
Links to
RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma
Immuno-Pathology:
Links to Activated
T-Lymphocyte Immunotherapy:
Links to Medical
Systems Biology:
Links to Selective
Gene Transcription:
Links to RNA-Induced
Epigenetics:
Links to RNA-Induced
Embryogenesis:
Links to RNA and
Biological Causality:
Links to Reprogramming
and Neoplasia:
A Brief History of Activator RNA:
"Ultrastructural
Probes of Active DNA Sites, and the RNA Activators of DNA".
(PowerPoint Presentation).
Top of Page - Euchromatin
Network - Euchromatin
Research - Research
in Quantitative Radiology
For Further Information and Feedback:
Jeannette A. Hovsepian, M.D.
E-mail: frensasc@ix.netcom.com
Phone: +1 650 367 6483