检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《航天医学与医学工程》2013年第5期367-370,共4页Space Medicine & Medical Engineering
基 金:浙江省自然科学基金资助(Y1100219)
摘 要:目的研究一种高效的基因特征提取方法,以尽可能地克服传统噪声基因剔除法中阈值设置主观性带来的信息丢失问题。方法收集Golub等发布的急性白血病基因表达谱公共数据库中的数据。相对宽松地剔除噪声基因,适当增加被选基因数量,进而利用二维主元分析法(2D—PCA)技术进行二次基因特征提取,并采用基于机器支持向量机(SVM)的分类形式。结果文中方法可获得90个二次特征和100.00%的分类精度;与直接利用一次特征进行分类相比,分类精度可提高2.78~8.35%。结论通过适当增加被选基因数量提取高效且维数相对较低的特征是可行的。Objective To study a gene feature extraction method with high efficiency so as to overcome the problem of effective information lost due to the subjective threshold setting during noise gene elimination in conventional methods. Methods The data for the analysis were taken from public Leukemia dataset published by Golub etal. More selected genes were introduced properly by relaxing the constraints of threshold setting during the process of gene noise elimination. Two-Dimensional Principal Component Analysis (2D-PCA) tech nique was applied to the selected genes to extract secondary features. Support vector machine (SVM) based classifier was used for the classification. Results Ninety secondary features could be extracted using the pro-posed approach. Its classification accuracy was 100% and the overall classification accuracy could be in creased by 2.78 - 8.35 percent as compared with the elementary feature based classification. Conclusion It is feasible to extract more effect features with lower dimensions by introducing more selected genes properly. Key words: gene ; feature extraction ; secondary feature ; support vector machine ; classification
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15