高级检索
当前位置: 首页 > 详情页

Performance evaluation of computational methods for splice-disrupting variants and improving the performance using the machine learning-based framework

文献详情

资源类型:
WOS体系:
Pubmed体系:

收录情况: ◇ SCIE

单位: [1]Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Internal Med,Div Cardiol, Wuhan 430030, Peoples R China [2]Hubei Key Lab Genet & Mol Mech Cardiol Disorders, Wuhan 430030, Peoples R China [3]Genet Diagnost Ctr, Wuhan, Peoples R China [4]Internal Med Dept, Wuhan, Peoples R China
出处:
ISSN:

关键词: splice-disrupting variant alternative splicing prediction method machine learning performance evaluation genetic diagnostic

摘要:
A critical challenge in genetic diagnostics is the assessment of genetic variants associated with diseases, specifically variants that fall out with canonical splice sites, by altering alternative splicing. Several computational methods have been developed to prioritize variants effect on splicing; however, performance evaluation of these methods is hampered by the lack of large-scale benchmark datasets. In this study, we employed a splicing-region-specific strategy to evaluate the performance of prediction methods based on eight independent datasets. Under most conditions, we found that dbscSNV-ADA performed better in the exonic region, S-CAP performed better in the core donor and acceptor regions, S-CAP and SpliceAI performed better in the extended acceptor region and MMSplice performed better in identifying variants that caused exon skipping. However, it should be noted that the performances of prediction methods varied widely under different datasets and splicing regions, and none of these methods showed the best overall performance with all datasets. To address this, we developed a new method, machine learning-based classification of splice sites variants (MLCsplice), to predict variants effect on splicing based on individual methods. We demonstrated that MLCsplice achieved stable and superior prediction performance compared with any individual method. To facilitate the identification of the splicing effect of variants, we provided precomputed MLCsplice scores for all possible splice sites variants across human protein-coding genes (http://39.105.51.3:8090/MLCsplice/) . We believe that the performance of different individual methods under eight benchmark datasets will provide tentative guidance for appropriate method selection to prioritize candidate splice-disrupting variants, thereby increasing the genetic diagnostic yield.

基金:
语种:
被引次数:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2021]版
大类 | 2 区 生物学
小类 | 2 区 生化研究方法 2 区 数学与计算生物学
最新[2025]版:
大类 | 2 区 生物学
小类 | 1 区 数学与计算生物学 2 区 生化研究方法
JCR分区:
出版当年[2020]版:
Q1 BIOCHEMICAL RESEARCH METHODS Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY
最新[2023]版:
Q1 BIOCHEMICAL RESEARCH METHODS Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

影响因子: 最新[2023版] 最新五年平均 出版当年[2020版] 出版当年五年平均 出版前一年[2019版] 出版后一年[2021版]

第一作者:
第一作者单位: [1]Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Internal Med,Div Cardiol, Wuhan 430030, Peoples R China [2]Hubei Key Lab Genet & Mol Mech Cardiol Disorders, Wuhan 430030, Peoples R China
通讯作者:
通讯机构: [1]Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Internal Med,Div Cardiol, Wuhan 430030, Peoples R China [2]Hubei Key Lab Genet & Mol Mech Cardiol Disorders, Wuhan 430030, Peoples R China [3]Genet Diagnost Ctr, Wuhan, Peoples R China [4]Internal Med Dept, Wuhan, Peoples R China
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:426 今日访问量:2 总访问量:410 更新日期:2025-04-01 建议使用谷歌、火狐浏览器 常见问题

版权所有:重庆聚合科技有限公司 渝ICP备12007440号-3 地址:重庆市两江新区泰山大道西段8号坤恩国际商务中心16层(401121)