It has been shown that the generalized F-statistics can give satisfactory performances in identifying differentially expressed genes with microarray data. However, for some complex diseases, it is still possible to identify a high proportion of false positives because of the modest differential expressions of disease related genes and the systematic noises of microarrays. The main purpose of this study is to develop statistical methods for Affymetrix microarray gene expression data so that the impact on false positives from non-expressed genes can be reduced. I proposed two novel generalized F-statistics for identifying differentially expressed genes and a novel approach for estimating adjusting factors. The proposed statistical methods systematically combine filtering of non-expressed genes and identification of differentially expressed genes. For comparison, the discussed statistical methods were applied to an experimental data set for a type 2 diabetes study. In both two- and three-sample analyses, the proposed statistics showed improvement on the control of false positives.

译文

:已经表明,广义的F统计量可以在用微阵列数据鉴定差异表达的基因方面提供令人满意的性能。但是,对于某些复杂的疾病,由于疾病相关基因的适度差异表达和微阵列的系统噪声,仍然有可能鉴定出较高比例的假阳性。这项研究的主要目的是为Affymetrix微阵列基因表达数据开发统计方法,以便减少未表达基因对假阳性的影响。我提出了两种新颖的广义F统计量,用于鉴定差异表达的基因和一种估算调节因子的新方法。提出的统计方法系统地结合了非表达基因的过滤和差异表达基因的鉴定。为了进行比较,将讨论的统计方法应用于2型糖尿病研究的实验数据集。在两个和三个样本的分析中,建议的统计数据显示出对假阳性的控制有所改善。

+1
+2
100研值 100研值 ¥99课程
检索文献一次
下载文献一次

去下载>

成功解锁2个技能,为你点赞

《SCI写作十大必备语法》
解决你的SCI语法难题!

技能熟练度+1

视频课《玩转文献检索》
让你成为检索达人!

恭喜完成新手挑战

手机微信扫一扫,添加好友领取

免费领《Endnote文献管理工具+教程》

微信扫码, 免费领取

手机登录

获取验证码
登录