基于荷斯坦牛群体基因组数据填充软件的准确性比较(Minimac 3与Beagle 5.1)  被引量:2

Comparison of Software(Minimac 3 and Beagle 5.1)for Genomic Imputation Using Holstein Cow Population

在线阅读下载全文

作  者:罗汉鹏 窦金焕[1] 安涛[1] 陈少侃 王雅春[1] LUO Hanpeng;DOU Jinhuan;AN Tao;CHEN Shaokan;WANG Yachun(College of Animal Science and Technology,China Agricultural University,Beijing 100193,China;Beijing Sunlon Livestock Development Co.,Ltd.,Beijing 100029, China)

机构地区:[1]中国农业大学动物科学技术学院,北京100193 [2]北京首农畜牧发展有限公司,北京100029

出  处:《中国畜牧兽医》2021年第5期1664-1671,共8页China Animal Husbandry & Veterinary Medicine

基  金:现代农业(奶牛)产业技术体系建设专项资金(CARS-36);长江学者和创新团队发展计划(IRT_15R62);农业品种改良提升专项(2130135)。

摘  要:为探究基因组数据填充软件准确性的影响因素和展示填充具体过程,本研究使用两款主要填充软件Beagle 5.1和Minimac 3对奶牛基因组50K芯片数据进行填充至150K,使用个体的填充结果和真实数据进行填充一致性计算,比较两软件的填充准确性和一致性的差异及其主要影响因素。研究结果表明,Minimac 3软件需要使用其他软件进行基因定向后再进行填充,而Beagle 5.1软件可同时进行基因定向和基因组填充。Beagle 5.1与Minimac 3软件填充一致性的相关系数为0.98;Beagle 5.1软件平均填充的准确性(r^(2))为0.9841,一致性为0.9914,填充准确性与一致性的相关系数为0.39;Minimac 3软件平均填充的准确性为0.9782,一致性为0.9911,填充准确性(r^(2))和一致性的相关系数为0.36。由于软件计算填充准确性原理问题,填充的准确性(r^(2))受最小等位基因影响较大。填充的一致性在最小等位基因频率和位点杂合度上升时均呈下降趋势,当位点杂合度>0.6时显著下降(填充一致性低于0.8),但Beagle 5.1软件的填充效果在相同的最小等位基因频率和杂合度下均优于Minimac 3软件。本研究发现填充准确性(r^(2))受填充位点的杂合度影响较大,而Beagle 5.1软件进行基因组数据填充的准确性更高,基因组数据填充后使用填充一致性作为填充准确性的判断标准可避免删除过多有效填充位点。The aims of current study were to show the process of genomic imputation and investigate the factors affecting accuracy of genomic imputation.The data of 50K panel imputed to 150K for dairy cattle was used to compare accuracy and concordance of imputation for two software(Beagle 5.1 and Minimac 3).Concordance was calculated by cross validation of individuals with imputed data and real data.The target population for imputation should be phased by Minimac 3 and the function of Beagle 5.1 including pre-phasing and imputation.The correlation of concordance between Minimac 3 and Beagle 5.1 were 0.98.For Beagle 5.1,the average of imputation accuracy(r^(2))and concordance were 0.9841 and 0.9914,respectively,and the correlation between imputation accuracy and concordance was 0.39.For Minimac 3,the average of imputation accuracy(r^(2))and concordance were 0.9782 and 0.9911,respectively,and the correlation between imputation accuracy(r^(2))and concordance was 0.36.Imputation accuracy(r^(2))was associated with minor allele frequency due to the formula for calculating accuracy from the software.With the increasing of minor allele frequency and heterozygosity for makers,the concordance of imputation was decreased.There was a steep decline when heterozygosity was higher than 0.6(concordance of imputation was lower than 0.8).However,the accuracy of Beagle 5.1 software was better than that of Minimac 3 software under the same minor allele frequency and heterozygosity of imputed site.The accuracy of imputation(r^(2))was mainly affected by heterozygosity of SNPs and Beagle 5.1 had better performance on imputation accuracy than that of Minimac 3.Using concordance as the accuracy of imputation to select SNPs could avoid losing useful makers for further study.

关 键 词:荷斯坦牛 群体 基因组填充 准确性 

分 类 号:S813.3[农业科学—畜牧学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象