Motivation: Epstein-Barr virus (EBV) is one of the most prevalent DNA oncogenic viruses. The integration of EBV into the host genome has been reported to play an important role in cancer development. The preference of EBV integration showed strong dependence on the local genomic environment, which enables the prediction of EBV integration sites. Results: An attention-based deep learning model, DeepEBV, was developed to predict EBV integration sites by learning local genomic features automatically. First, DeepEBV was trained and tested using the data from the dsVIS database. The results showed that DeepEBV with EBV integration sequences plus Repeat peaks and 2-fold data augmentation performed the best on the training dataset. Furthermore, the performance of the model was validated in an independent dataset. In addition, the motifs of DNA-binding proteins could influence the selection preference of viral insertional mutagenesis. Furthermore, the results showed that DeepEBV can predict EBV integration hotspot genes accurately. In summary, DeepEBV is a robust, accurate and explainable deep learning model, providing novel insights into EBV integration preferences and mechanisms.
基金:
National Science and Technology Major Project of the Ministry of science and technology of China [2018ZX10301402, 2018YFC2001600]; National Natural Science Foundation of China [81761148025, 82001919, 81871473]; Guangzhou Science and Technology Programme [201704020093]; National Ten Thousands Plan for Young Top Talents; Key-Area Research and Development Program of Guangdong Province [2019B03035001]; General Program of Natural Science Foundation of Guang-dong Province of China [2021A1515012438]; National Postdoctoral Program for Innovative Talent [BX20200398]; China Postdoctoral Science Foundation [2020M672995]; Guangdong Basic and Applied Basic Research Foundation [2020A1515110170]; Characteristic Innovation Research Project of University Teachers [2020SWYY07]
语种:
外文
被引次数:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2020]版:
大类|2 区生物
小类|1 区数学与计算生物学2 区生化研究方法2 区生物工程与应用微生物
最新[2025]版:
大类|3 区生物学
小类|3 区生化研究方法3 区生物工程与应用微生物3 区数学与计算生物学
JCR分区:
出版当年[2019]版:
Q1MATHEMATICAL & COMPUTATIONAL BIOLOGYQ1BIOCHEMICAL RESEARCH METHODSQ1BIOTECHNOLOGY & APPLIED MICROBIOLOGY
最新[2023]版:
Q1BIOCHEMICAL RESEARCH METHODSQ1BIOTECHNOLOGY & APPLIED MICROBIOLOGYQ1MATHEMATICAL & COMPUTATIONAL BIOLOGY
第一作者单位:[1]Minist Educ, Key Lab Brain Cognit & Educ Sci, Guangzhou, Peoples R China[2]South China Normal Univ, Inst Brain Res & Rehabil, Guangzhou 510631, Peoples R China
通讯作者:
通讯机构:[3]Sun Yat Sen Univ, Affiliated Hosp 1, Dept Gynaecol Oncol, Guangzhou 510080, Guangdong, Peoples R China[12]Huazhong Univ Sci & Technol, Tongji Med Coll, Tongji Hosp, Dept Obstet & Gynaecol, Wuhan 430030, Hubei, Peoples R China
推荐引用方式(GB/T 7714):
Liang Jiuxing,Cui Zifeng,Wu Canbiao,et al.DeepEBV: a deep learning model to predict Epstein-Barr virus (EBV) integration sites[J].BIOINFORMATICS.2021,37(20):3405-3411.doi:10.1093/bioinformatics/btab388.
APA:
Liang, Jiuxing,Cui, Zifeng,Wu, Canbiao,Yu, Yao,Tian, Rui...&Hu, Zheng.(2021).DeepEBV: a deep learning model to predict Epstein-Barr virus (EBV) integration sites.BIOINFORMATICS,37,(20)
MLA:
Liang, Jiuxing,et al."DeepEBV: a deep learning model to predict Epstein-Barr virus (EBV) integration sites".BIOINFORMATICS 37..20(2021):3405-3411