PSO_BFA优化词袋模型及蛋白质亚细胞定位预测  被引量:2

PSO_BFA Optimized Bag of Words Model and Prediction of Protein Subcellular Localization

在线阅读下载全文

作  者:胡雪娇 陈行健 赵南 薛卫 HU Xuejiao;CHEN Xingjian;ZHAO Nan;XUE Wei(School of Information Science and Technology,Nanjing Agricultural University,Nanjing 210095,China)

机构地区:[1]南京农业大学信息科学技术学院

出  处:《计算机工程与应用》2020年第1期165-171,共7页Computer Engineering and Applications

基  金:中央高校基本科研业务费专项资金(No.KYZ201668)

摘  要:提出了一种基于PSO_BFA优化的词袋模型。传统词袋模型有两个重要参数:窗口大小d和字典大小k。结合粒子群算法和细菌觅食算法产生新的PSO_BFA混合优化算法,在PSO进行局部搜索时,加入BFA的复制和迁移行为,得到PSO_BFA的最优解即为窗口大小和字典大小的最佳组合。将优化词袋模型与蛋白质序列的氨基酸组成算法和伪氨基酸组成算法结合,获得蛋白质序列的词袋特征。实验结果证明,基于PSO_BFA优化的词袋模型能有效提高蛋白质亚细胞定位预测的精度。A bag of words model based on PSO_BFA optimization is proposed.The traditional bag of words model has two important parameters,the window size d and the dictionary size k,respectively.By combining particle swarm optimization and bacterial foraging algorithm,a new integrated optimization algorithm called PSO_BFA is proposed.During the process of local search in PSO,the replication and migration behavior of BFA are added to obtain the best solution of the new PSO_BFA,which is the best combination of window size and dictionary size.Then the optimized BOW model combined with amino acid composition and pseudo amino acid composition is applied to extract the feature vectors of the protein sequences.The experimental results show that the BOW model optimized by PSO_BFA can effectively improve the accuracy of protein subcellular location prediction.

关 键 词:词袋模型 粒子群算法 细菌觅食 亚细胞定位预测 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象