From initial human papillomavirus (HPV) infection and precursor stages, the development of cervical cancer takes decades. High-sensitivity HPV DNA testing is currently recommended as primary screening method for cervical cancer, whereas better triage methodologies are encouraged to provide accurate risk management for HPV-positive women. Given that virus-driven genomic variation accumulates during cervical carcinogenesis, we designed a 39 Mb custom capture panel targeting 17 HPV types and 522 mutant genes related to cervical cancer. Using capture-based next-generation sequencing, HPV integration status, somatic mutation and copy number variation were analyzed on 34 paired samples, including 10 cases of HPV infection (HPV+), 10 cases of cervical intraepithelial neoplasia (CIN) grade and 14 cases of CIN2+ (CIN2: n = 1; CIN2-3: n = 3; CIN3: n = 9; squamous cell carcinoma: n = 1). Finally, the machine learning algorithm (Random Forest) was applied to build the risk stratification model for cervical precursor lesions based on CIN2+ enriched biomarkers. Generally, HPV integration events (11 in HPV+, 25 in CIN1 and 56 in CIN2+), non-synonymous mutations (2 in CIN1, 12 in CIN2+) and copy number variations (19.1 in HPV+, 29.4 in CIN1 and 127 in CIN2+) increased from HPV+ to CIN2+. Interestingly, 'common' deletion of mitochondrial chromosome was significantly observed in CIN2+ (P = 0.009). Together, CIN2+ enriched biomarkers, classified as HPV information, mutation, amplification, deletion and mitochondrial change, successfully predicted CIN2+ with average accuracy probability score of 0.814, and amplification and deletion ranked as the most important features. Our custom capture sequencing combined with machine learning method effectively stratified the risk of cervical lesions and provided valuable integrated triage strategies.
基金:
National Science and Technology Major Project of the Ministry of Science and Technology of China [2018ZX10301402]; Nature and Science Foundation of China [81761148025]; Guangzhou Science and Technology Programme [201704020093]; Fundamental Research Funds for the Central Universities [17ykzd15]; National Supercomputer Center In Guangzhou
第一作者单位:[1]Sun Yat Sen Univ, Precis Med Inst, Dept Obstet & Gynecol, Zhongshan 2nd Rd, Guangzhou, Guangdong, Peoples R China
通讯作者:
通讯机构:[1]Sun Yat Sen Univ, Precis Med Inst, Dept Obstet & Gynecol, Zhongshan 2nd Rd, Guangzhou, Guangdong, Peoples R China[4]Huazhong Univ Sci & Technol, Tongji Med Coll, Tongji Hosp, Dept Obstet & Gynecol, Wuhan 430030, Hubei, Peoples R China
推荐引用方式(GB/T 7714):
Tian Rui,Cui Zifeng,He Dan,et al.Risk stratification of cervical lesions using capture sequencing and machine learning method based on HPV and human integrated genomic profiles[J].CARCINOGENESIS.2019,40(10):1220-1228.doi:10.1093/carcin/bgz094.
APA:
Tian, Rui,Cui, Zifeng,He, Dan,Tian, Xun,Gao, Qinglei...&Hu, Zheng.(2019).Risk stratification of cervical lesions using capture sequencing and machine learning method based on HPV and human integrated genomic profiles.CARCINOGENESIS,40,(10)
MLA:
Tian, Rui,et al."Risk stratification of cervical lesions using capture sequencing and machine learning method based on HPV and human integrated genomic profiles".CARCINOGENESIS 40..10(2019):1220-1228