基因预测算法中阈值的傅里叶质谱分析  

Analysis on Threshold Used in Gene Prediction Algorithm Based on Fourier Spectrum

在线阅读下载全文

作  者:刘平[1] 马玉韬[1] 孙学宏[1] 张成[1] 杜勇[2] 

机构地区:[1]宁夏大学物理电气信息学院/宁夏沙漠信息智能感知重点实验室,银川750021 [2]宁夏医科大学总医院小儿外科,银川750004

出  处:《湖北农业科学》2014年第6期1432-1435,共4页Hubei Agricultural Sciences

基  金:国家自然科学基金项目(81260100);宁夏自然科学基金项目(NZ13028);宁夏高校科学研究重点项目(NGY2013003)

摘  要:蛋白质编码区预测中阈值选择对预测结果的影响不容忽视。研究提出以归一化的功率谱密度作为判别DNA序列编码区和非编码区的阈值,以FIR(Finite impulse response,FIR)窄通带滤波器NPBF(Narrow pass band filter,NPBF)作为编码区预测算法核心,采用DNA序列集HMR195和ALLSEQ作为测试集,以碱基层的近似相关系数(Approximate correlation,AC)为预测准确率测度指标,对所提出方法与现有方法的预测结果做了比较。结果表明,采用新阈值得到的预测准确率最高,算法简单直观。Threshold selection of protein coding regions prediction algorithm has important influence on the prediction accuracy.In this paper,a new threshold and normalized value of power spectrum density was proposed to differentiate protein coding regions and non-coding regions.Using the FIR (Finite impulse response) NPBF (Narrow pass-band filter) as the kernel of the prediction algorithm and taking the DNA sequences data sets HMR195 and ALLSEQ as the test sets,the prediction results of the NPBF algorithm with new threshold was compared with those of the same algorithm using other two thresholds.The results were discussed with the AC(Approximate correlation) used as a base level prediction accuracy measure.It was indicated that the proposed threshold was the best choice for higher A C and less amount of computation.

关 键 词:蛋白质编码区预测 窄通带滤波器 归一化的功率谱密度值 信噪比 近似相关系数 

分 类 号:TP391.9[自动化与计算机技术—计算机应用技术] TN713[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象