The number of distinct functional classes of single-stranded RNAs (ssRNAs) and the number of sequences representing them are substantial and continue to increase. Organizing this data in an evolutionary context is essential, yet traditional comparative sequence analyses require that homologous sites can be identified. This prevents comparative analysis between sequences of different functional classes that share no site-to-site sequence similarity. Analysis within a single evolutionary lineage also limits evolutionary inference because shared ancestry confounds properties of molecular structure and function that are historically contingent with those that are imposed for biophysical reasons. Here, we apply a method of comparative analysis to ssRNAs that is not restricted to homologous sequences, and therefore enables comparison between distantly related or unrelated sequences, minimizing the effects of shared ancestry. This method is based on statistical similarities in nucleotide base composition among different functional classes of ssRNAs. In order to denote base composition unambiguously, we have calculated the fraction G+A and G+U content, in addition to the more commonly used fraction G+C content. These three parameters define RNA composition space, which we have visualized using interactive graphics software. We have examined the distribution of nucleotide composition from 15 distinct functional classes of ssRNAs from organisms spanning the universal phylogenetic tree and artificial ribozymes evolved in vitro. Surprisingly, these distributions are biased consistently in G+A and G+U content, both within and between functional classes, regardless of the more variable G+C content. Additionally, an analysis of the base composition of secondary structural elements indicates that paired and unpaired nucleotides, known to have different evolutionary rates, also have significantly different compositional biases. These universal compositional biases observed among ssRNAs sharing little or no sequence similarity suggest, contrary to current understanding, that base composition biases constitute a convergent adaptation among a wide variety of molecular functions.

译文

单链rna (ssrna) 的不同功能类别的数量以及代表它们的序列数量是大量的,并且还在继续增加。在进化背景下组织这些数据至关重要,但是传统的比较序列分析要求可以识别同源位点。这可以防止在没有站点到站点序列相似性的不同功能类的序列之间进行比较分析。单个进化谱系内的分析也限制了进化推断,因为共有的祖先混淆了分子结构和功能的特性,这些特性在历史上与出于生物物理原因而施加的特性有关。在这里,我们对ssrna应用了一种比较分析方法,该方法不限于同源序列,因此可以在远缘或无关序列之间进行比较,从而最大程度地减少了共享血统的影响。此方法基于不同功能类别的ssrna之间核苷酸碱基组成的统计相似性。为了明确表示碱基成分,除了更常用的分数G C含量外,我们还计算了分数G A和G U含量。这三个参数定义了RNA组成空间,我们已经使用交互式图形软件对其进行了可视化。我们已经检查了来自跨越通用系统进化树和体外进化的人工核酶的生物体的15种不同功能类ssrna的核苷酸组成分布。令人惊讶的是,无论功能类内部和功能类之间的G A和G U含量如何,这些分布在G A和G U含量上都是一致的,而与更可变的G C含量无关。此外,对二级结构元件的碱基组成的分析表明,已知具有不同进化速率的成对和不成对核苷酸也具有明显不同的组成偏差。在具有很少或没有序列相似性的ssrna中观察到的这些普遍的组成偏差表明,与目前的理解相反,碱基组成偏差构成了多种分子功能之间的收敛适应。

+1
+2
100研值 100研值 ¥99课程
检索文献一次
下载文献一次

去下载>

成功解锁2个技能,为你点赞

《SCI写作十大必备语法》
解决你的SCI语法难题!

技能熟练度+1

视频课《玩转文献检索》
让你成为检索达人!

恭喜完成新手挑战

手机微信扫一扫,添加好友领取

免费领《Endnote文献管理工具+教程》

微信扫码, 免费领取

手机登录

获取验证码
登录