ABSTRACT Recent advances in genome-wide techniques allowed the identification of thousands of non-coding RNAs with various sizes in eukaryotes, some of which have further been shown to serve important functions in man...ABSTRACT Recent advances in genome-wide techniques allowed the identification of thousands of non-coding RNAs with various sizes in eukaryotes, some of which have further been shown to serve important functions in many biologi- cal processes. However, in model plant Arabidopsis, novel intermediate-sized ncRNAs (im-ncRNAs) (50-300 nt) have very limited information. By using a modified isolation strategy combined with deep-sequencing technology, we identified 838 im-ncRNAs in Arabidopsis globally. More than half (58%) are new ncRNA species, mostly evolutionary divergent. Interestingly, annotated protein-coding genes with 5'-UTR-derived novel im-ncRNAs tend to be highly expressed. For intergenic im-ncRNAs, their average abundances were comparable to mRNAs in seedlings, but subsets exhibited signifi- cantly lower expression in senescing leaves. Further, intergenic im-ncRNAs were regulated by similar genetic and epige- netic mechanisms to those of protein-coding genes, and some showed developmentally regulated expression patterns. Large-scale reverse genetic screening showed that the down-regulation of a number of im-ncRNAs resulted in either obvious molecular changes or abnormal developmental phenotypes in vivo, indicating the functional importance of im-ncRNAs in plant growth and development. Together, our results demonstrate that novel Arabidopsis im-ncRNAs are developmentally regulated and functional components discovered in the transcriptome.展开更多
Accumulating evidence suggests that non-coding RNAs (ncRNAs) are both widespread and functionally important in many eukaryotic organisms. In this study, we employed a special size fractionation and cDNA library cons...Accumulating evidence suggests that non-coding RNAs (ncRNAs) are both widespread and functionally important in many eukaryotic organisms. In this study, we employed a special size fractionation and cDNA library construction method followed by 454 deep sequencing to systematically profile rice intermediate-size ncRNAs. Our analysis resulted in the identification of 1349 ncRNAs in total, including 754 novel ncRNAs of an unknown functional category. Chromosome distribution of all identified ncRNAs showed no strand bias, and displayed a pattern similar to that observed in protein-coding genes with few chromosome dependencies. More than half of the ncRNAs were centered around the plus-strand of the 5' and 3' termini of the coding regions. The majority of the novel ncRNAs were rice specific, while 78% of the small nucleolar RNAs (snoRNAs) were conserved. Tandem duplication drove the expansion of over half of the snoRNA gene families. Furthermore, 90% of the snoRNA candidates were shown to produce small RNAs between 20-30 nt, 80% of which were associated with ARGONAUT proteins generally, and AGOlb in particular. Overall, our findings provide a comprehensive view of an intermediate-size non-coding transcriptome in a monocot species, which will serve as a useful platform for an in-depth analysis of ncRNA functions.展开更多
Although only about 2%of the human genome has proved to be protein-coding genes,recent advances in genome wide analysis have revealed that the majority of the genome is transcribed,mainly from noncoding segments that ...Although only about 2%of the human genome has proved to be protein-coding genes,recent advances in genome wide analysis have revealed that the majority of the genome is transcribed,mainly from noncoding segments that were once considered"junk sequences"or"dark matters"(Liu et al.,2011a;Zhang et al.,2014b). In addition to the well-characterized housekeeping non- coding RNAs (ncRNAs) (tRNA, rRNA, small nuclear RNA and small nucleolar RNAs) and some small regulatory ncRNAs (microRNAs and small interfering RNAs), the transcriptome of mammals could also pervasively have been transcribed long noncoding RNAs (lncRNAs, at least 200 nt) (Rinn and Chang, 2012; Xie et al., 2012).展开更多
基金grants from the National Basic Research Program of China (973 Program),the National Natural Science Foundation of China,in part by the Peking-Tsinghua Center for Life Sciences and a grant from the Next-Generation BioGreen 21 Program,Rural Development Administration,Republic of Korea
文摘ABSTRACT Recent advances in genome-wide techniques allowed the identification of thousands of non-coding RNAs with various sizes in eukaryotes, some of which have further been shown to serve important functions in many biologi- cal processes. However, in model plant Arabidopsis, novel intermediate-sized ncRNAs (im-ncRNAs) (50-300 nt) have very limited information. By using a modified isolation strategy combined with deep-sequencing technology, we identified 838 im-ncRNAs in Arabidopsis globally. More than half (58%) are new ncRNA species, mostly evolutionary divergent. Interestingly, annotated protein-coding genes with 5'-UTR-derived novel im-ncRNAs tend to be highly expressed. For intergenic im-ncRNAs, their average abundances were comparable to mRNAs in seedlings, but subsets exhibited signifi- cantly lower expression in senescing leaves. Further, intergenic im-ncRNAs were regulated by similar genetic and epige- netic mechanisms to those of protein-coding genes, and some showed developmentally regulated expression patterns. Large-scale reverse genetic screening showed that the down-regulation of a number of im-ncRNAs resulted in either obvious molecular changes or abnormal developmental phenotypes in vivo, indicating the functional importance of im-ncRNAs in plant growth and development. Together, our results demonstrate that novel Arabidopsis im-ncRNAs are developmentally regulated and functional components discovered in the transcriptome.
基金This work was supported by grants from National Basic Research Program of China (973 Program) (2012CB910900) National Natural Science Foundation of China (31171156, U1031001)+1 种基金the Ministry of Science and Technology of China (2011CB100101, 2009DFB30030, 2008AA022301) and the Ministry of Agriculture of China (2008ZX08012-005, 2009ZX08012-021 B).We thank Dr. Ning Wei and Abigail Coplin for reading and commenting this manuscript. No conflict of interest declared.
文摘Accumulating evidence suggests that non-coding RNAs (ncRNAs) are both widespread and functionally important in many eukaryotic organisms. In this study, we employed a special size fractionation and cDNA library construction method followed by 454 deep sequencing to systematically profile rice intermediate-size ncRNAs. Our analysis resulted in the identification of 1349 ncRNAs in total, including 754 novel ncRNAs of an unknown functional category. Chromosome distribution of all identified ncRNAs showed no strand bias, and displayed a pattern similar to that observed in protein-coding genes with few chromosome dependencies. More than half of the ncRNAs were centered around the plus-strand of the 5' and 3' termini of the coding regions. The majority of the novel ncRNAs were rice specific, while 78% of the small nucleolar RNAs (snoRNAs) were conserved. Tandem duplication drove the expansion of over half of the snoRNA gene families. Furthermore, 90% of the snoRNA candidates were shown to produce small RNAs between 20-30 nt, 80% of which were associated with ARGONAUT proteins generally, and AGOlb in particular. Overall, our findings provide a comprehensive view of an intermediate-size non-coding transcriptome in a monocot species, which will serve as a useful platform for an in-depth analysis of ncRNA functions.
基金supported by the grants from the National Key Research and Development Plan (2016YFA0100702,2016YFC0902502)the National Key Basic Research Program (973 Program) (Nos.2013CB531304 and 2011CBA01104)+1 种基金the National Sciences Foundation of China (Nos. 31301152,31670789,31671316,31370789 and 30825023)CAMS Innovation Fund for Medical Sciences (CIFMS,2016-I2M-2-001,2016-I2M-1-001,2016-I2M-1-004)
文摘Although only about 2%of the human genome has proved to be protein-coding genes,recent advances in genome wide analysis have revealed that the majority of the genome is transcribed,mainly from noncoding segments that were once considered"junk sequences"or"dark matters"(Liu et al.,2011a;Zhang et al.,2014b). In addition to the well-characterized housekeeping non- coding RNAs (ncRNAs) (tRNA, rRNA, small nuclear RNA and small nucleolar RNAs) and some small regulatory ncRNAs (microRNAs and small interfering RNAs), the transcriptome of mammals could also pervasively have been transcribed long noncoding RNAs (lncRNAs, at least 200 nt) (Rinn and Chang, 2012; Xie et al., 2012).