Title |
Identification of a dinucleotide signature that discriminates coding from non-coding long RNAs
|
---|---|
Published in |
Frontiers in Genetics, September 2014
|
DOI | 10.3389/fgene.2014.00316 |
Pubmed ID | |
Authors |
Damien Ulveling, Marcel E. Dinger, Claire Francastel, Florent Hubé |
Abstract |
To date, the main criterion by which long ncRNAs (lncRNAs) are discriminated from mRNAs is based on the capacity of the transcripts to encode a protein. However, it becomes important to identify non-ORF-based sequence characteristics that can be used to parse between ncRNAs and mRNAs. In this study, we first established an extremely selective workflow to define a highly refined database of lncRNAs which was used for comparison with mRNAs. Then using this highly selective collection of lncRNAs, we found the CG dinucleotide frequencies were clearly distinct. In addition, we showed that the bias in CG dinucleotide frequency was conserved in human and mouse genomes. We propose that this sequence feature will serve as a useful classifier in transcript classification pipelines. We also suggest that our refined database of "bona fide" lncRNAs will be valuable for the discovery of other sequence characteristics distinct to lncRNAs. |
X Demographics
As of 1 July 2024, you may notice a temporary increase in the numbers of X profiles with Unknown location. Click here to learn more.
Geographical breakdown
Country | Count | As % |
---|---|---|
Australia | 1 | 25% |
United States | 1 | 25% |
France | 1 | 25% |
Unknown | 1 | 25% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 3 | 75% |
Members of the public | 1 | 25% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
India | 1 | 3% |
France | 1 | 3% |
Unknown | 30 | 94% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 6 | 19% |
Researcher | 6 | 19% |
Student > Bachelor | 5 | 16% |
Professor | 4 | 13% |
Student > Postgraduate | 2 | 6% |
Other | 6 | 19% |
Unknown | 3 | 9% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 13 | 41% |
Biochemistry, Genetics and Molecular Biology | 12 | 38% |
Engineering | 2 | 6% |
Computer Science | 1 | 3% |
Neuroscience | 1 | 3% |
Other | 1 | 3% |
Unknown | 2 | 6% |