检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:耿彧 杨蓉蓉 张静 GENG Yu;YANG Rongrong;ZHANG Jing(School of Health Management,Jinzhou Medical University,Jinzhou 121001,China;School of Computer Science and Technology,Xi'an Jiaotong University,Xi'an 710049,China)
机构地区:[1]锦州医科大学健康管理学院,辽宁锦州121001 [2]西安交通大学计算机科学与技术学院,陕西西安710049
出 处:《南方医科大学学报》2020年第10期1493-1499,共7页Journal of Southern Medical University
基 金:辽宁省自然科学基金计划项目(20180550161,20180550855,2019-ZD-0604)。
摘 要:目的遗传变异中的单体型扩增因具有潜在的选择优势和克隆演变敏感性,成为寻找易感癌基因的一个重要标志。本文充分考虑单体型扩增状态的影响因素,有效实现稀有变异关联分析。方法通过等位基因变异频率估计单体型扩增状态。首先采用置换检验,基于等位基因变异频率实现候选变异位点的聚类。再应用似然聚类方法,确定隐马尔科夫随机场模型中的邻域系统。此外,引入一个威尔逊区间和错误识别率的组合过滤机制,进一步提高变异位点识别精度。最后将候选集与单体型扩增状态合并到加权虚拟位点中用于关联分析。结果通过仿真实验,分别对不同次等位基因变异频率的Ⅰ型错误率比较分析,发现Ⅰ型错误率基本稳定在2%以内。与其他5种关联分析方法分别进行Ⅰ型和Ⅱ错误率比较分析,Ⅰ型与Ⅱ型错误率均控制在2%以内,显示出其显著优势及较好的统计能力。结论本研究提出的针对单体型扩增区域的肿瘤易感变异关联分析方法能够较为精确的识别单体型扩增区域的肿瘤易感变异,具有良好的健壮性与稳定性,可为临床诊断提供决策支持。Objective Haplotype amplification on germline variants is suggested to imply potential selective advantages and clonal expansion susceptibility and has become an important signature for seeking cancer susceptibility gene.Here we propose an improved association method that fully considers the haplotype amplification status.Methods The haplotype amplification status was estimated by the variant allelic frequencies.We adopted a permutation test on variant allelic frequencies to divide the candidate variants into multiple groups.A likelihood clustering method was then applied to establish the neighborhood system of the hidden Markov random field framework.A filtering pipeline was introduced into the proposed method to further refine the candidate variants,including a Wilson's interval filter and a false discovery rate controller.The final candidate set along with the haplotype amplification status was collapsed into the weighted virtual sites for association tests.Results Through simulated tests on a series of datasets,we compared the type I error rates of different minor allele frequencies,which stably fell within 2%,suggesting good robustness of the algorithm.In addition,we compared another 5 published association approaches for Type-I and Type-II error rates with the proposed method,which resulted in the error rates all within 2%,demonstrating significant advantages and a good statistical ability of the proposed method.Conclusion The proposed method can accurately identify tumor susceptibility variants in haplotype amplification area with good robustness and stability.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28