基于深度学习方法的赖氨酸琥珀酰化位点表征和鉴定。-小狗文献

【Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method.].

【基于深度学习方法的赖氨酸琥珀酰化位点表征和鉴定。】 复制标题 收藏收藏

影响因子 :
发表时间：2019-11-07
来源期刊：Sci Rep

DOI：10.1038/s41598-019-52552-4 复制DOI
文章类型：杂志文章

作者列表：
下载文献

Succinylation is a type of protein post-translational modification (PTM), which can play important roles in a variety of cellular processes. Due to an increasing number of site-specific succinylated peptides obtained from high-throughput mass spectrometry (MS), various tools have been developed for computationally identifying succinylated sites on proteins. However, most of these tools predict succinylation sites based on traditional machine learning methods. Hence, this work aimed to carry out the succinylation site prediction based on a deep learning model. The abundance of MS-verified succinylated peptides enabled the investigation of substrate site specificity of succinylation sites through sequence-based attributes, such as position-specific amino acid composition, the composition of k-spaced amino acid pairs (CKSAAP), and position-specific scoring matrix (PSSM). Additionally, the maximal dependence decomposition (MDD) was adopted to detect the substrate signatures of lysine succinylation sites by dividing all succinylated sequences into several groups with conserved substrate motifs. According to the results of ten-fold cross-validation, the deep learning model trained using PSSM and informative CKSAAP attributes can reach the best predictive performance and also perform better than traditional machine-learning methods. Moreover, an independent testing dataset that truly did not exist in the training dataset was used to compare the proposed method with six existing prediction tools. The testing dataset comprised of 218 positive and 2621 negative instances, and the proposed model could yield a promising performance with 84.40% sensitivity, 86.99% specificity, 86.79% accuracy, and an MCC value of 0.489. Finally, the proposed method has been implemented as a web-based prediction tool (CNN-SuccSite), which is now freely accessible at http://csb.cse.yzu.edu.tw/CNN-SuccSite/ .

译文

琥珀酰化是蛋白质翻译后修饰 (PTM) 的一种，可以在多种细胞过程中发挥重要作用。由于从高通量质谱 (MS) 获得的位点特异性琥珀酰化肽的数量不断增加，因此已开发出各种工具来计算识别蛋白质上的琥珀酰化位点。但是，大多数这些工具都基于传统的机器学习方法来预测琥珀酰化位点。因此，这项工作旨在基于深度学习模型进行琥珀酰化位点预测。MS验证的琥珀酰化肽的丰度使得能够通过基于序列的属性 (例如位置特异性氨基酸组成，k间隔氨基酸对 (CKSAAP) 的组成和位置特异性评分矩阵 (PSSM)) 研究琥珀酰化位点的底物位点特异性。此外，通过将所有琥珀酰化序列分为几个具有保守的底物基序的组，采用最大依赖性分解 (MDD) 来检测赖氨酸琥珀酰化位点的底物特征。根据十倍交叉验证的结果，使用PSSM和信息性CKSAAP属性训练的深度学习模型可以达到最佳的预测性能，并且性能也优于传统的机器学习方法。此外，使用训练数据集中真正不存在的独立测试数据集将所提出的方法与六种现有的预测工具进行比较。测试数据集由218阳性和2621阴性实例组成，所提出的模型可以产生具有84.40% 灵敏度、86.99% 特异性、86.79% 准确性和0.489 MCC值的有希望的性能。最后，所提出的方法已实现为基于web的预测工具 (cnn-succsite)，现在可以在http://csb.cse.yzu.edu.tw/cnn-succsite/上自由访问。

【Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method.].

手机登录