Journal article
Emergence, Retention and Selection: A Trilogy of Origination for Functional De Novo Proteins from Ancestral LncRNAs in Primates
PLoS genetics, Vol.11(7), pp.e1005391-e1005391
07/2015
DOI: 10.1371/journal.pgen.1005391
PMCID: PMC4503675
PMID: 26177073
Abstract
While some human-specific protein-coding genes have been proposed to originate from ancestral lncRNAs, the transition process remains poorly understood. Here we identified 64 hominoid-specific de novo genes and report a mechanism for the origination of functional de novo proteins from ancestral lncRNAs with precise splicing structures and specific tissue expression profiles. Whole-genome sequencing of dozens of rhesus macaque animals revealed that these lncRNAs are generally not more selectively constrained than other lncRNA loci. The existence of these newly-originated de novo proteins is also not beyond anticipation under neutral expectation, as they generally have longer theoretical lifespan than their current age, due to their GC-rich sequence property enabling stable ORFs with lower chance of non-sense mutations. Interestingly, although the emergence and retention of these de novo genes are likely driven by neutral forces, population genetics study in 67 human individuals and 82 macaque animals revealed signatures of purifying selection on these genes specifically in human population, indicating a proportion of these newly-originated proteins are already functional in human. We thus propose a mechanism for creation of functional de novo proteins from ancestral lncRNAs during the primate evolution, which may contribute to human-specific genetic novelties by taking advantage of existed genomic contexts.
Details
- Title: Subtitle
- Emergence, Retention and Selection: A Trilogy of Origination for Functional De Novo Proteins from Ancestral LncRNAs in Primates
- Creators
- Jia-Yu ChenQing Sunny Shen - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, ChinaWei-Zhen Zhou - Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing, ChinaJiguang Peng - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, ChinaBin Z He - FAS Center for Systems Biology & Howard Hughes Medical Institute, Harvard University, Cambridge, Massachusetts, United States of AmericaYumei Li - Peking UniversityChu-Jun Liu - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, ChinaXuke Luan - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China; Peking-Tsinghua Center for Life Sciences, Beijing, ChinaWanqiu Ding - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, ChinaShuxian Li - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, ChinaChunyan Chen - Chinese Academy of SciencesBertrand Chin-Ming Tan - Molecular Medicine Research Center, Chang Gung University, Tao-Yuan, TaiwanYong E Zhang - Chinese Academy of SciencesAibin He - Peking UniversityChuan-Yun Li - Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
- Resource Type
- Journal article
- Publication Details
- PLoS genetics, Vol.11(7), pp.e1005391-e1005391
- DOI
- 10.1371/journal.pgen.1005391
- PMID
- 26177073
- PMCID
- PMC4503675
- NLM abbreviation
- PLoS Genet
- ISSN
- 1553-7390
- eISSN
- 1553-7404
- Publisher
- United States
- Grant note
- Howard Hughes Medical Institute
- Language
- English
- Date published
- 07/2015
- Academic Unit
- Biology
- Record Identifier
- 9983991998602771
Metrics
82 Record Views