一种SRBCT亚型识别与特征基因选取方法  被引量:1

Approach to subtype recognition and selection of informative genes for SRBCT

在线阅读下载全文

作  者:何爱香[1] 朱云华[2] 安凯[2] 

机构地区:[1]山东工商学院信电学院,山东烟台264005 [2]中国空间技术研究院513所,山东烟台264000

出  处:《计算机工程与应用》2007年第3期223-226,共4页Computer Engineering and Applications

摘  要:基于基因表达谱提出了一种选取特征基因并使用多类支持向量机(MSVM)进行肿瘤亚型识别的方法。就小圆蓝细胞瘤(SRBCT)的亚型识别问题,以组间和组内平方和比率(BSS/WSS)作为衡量基因分类重要性的标准,据此选择基因构造若干MSVM模型,由分类错误率确定了含25个基因的特征集合,并利用基于相关距离的冗余分析方法去除冗余,得到15个特征基因。基于该特征子集构造的MSVM在测试集上取得100%的预测准确率。与相关文献的比较表明了该方法的有效性和可行性。An approach to tumor molecular classification based on their gene expression profiles is presented.A new measure known as between-groups to within-groups sums of squares ratio(BSS/WSS) is used as the criterion of screening predictive genes for SRBCT subtype recognition.The 152 genes are chosen by this criterion and form the feature set whose subsets will be used to create MSVM models to identify the subtypes.The trained MSVM based on the top 25 genes ranked by BSS/WSS is able to achieve 100% accuracy on the training and blind test dataset.Then this subset is analyzed by the dissimilarity distance to remove its redundancy.As a result,the 15 genes are retained with the same accuracy as the subset of 25 genes and are regarded as the final subset.Comparison with other methods demonstrates efficiency and feasibility of the method and the predictive models proposed in this work.

关 键 词:多类支持向量机 基因表达谱 特征选取 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象