基于模糊区分矩阵的结直肠癌基因选择  被引量:2

Colon characteristic gene selection based on fuzzy discernibility matrix

在线阅读下载全文

作  者:李藤 杨田 代建华 陈鸰[3] Li Teng;Yang Tian;Dai Jianhua;Chen Ling(College of Logistics and Transportation,Central South University of Forestry and Technology,Changsha,410004,China;Hunan Provincial Science and Technology Project Foundation,Hunan Normal University,Changsha,410081,China;Xiangya Hospital of Central South University,Changsha,410008,China;College of Systems Engineering,University of Defense Science and Technology,Changsha,410073,China)

机构地区:[1]中南林业科技大学物流与交通学院,长沙410004 [2]湖南师范大学智能计算与语言信息处理湖南省重点实验室,长沙410081 [3]中南大学湘雅医院,长沙410008 [4]国防科技大学系统工程学院,长沙410073

出  处:《南京大学学报(自然科学版)》2019年第4期633-643,共11页Journal of Nanjing University(Natural Science)

基  金:中国博士后科学基金(2017T100795);湖南省自然科学基金(2017JJ2408);湖南省重点研发计划(2018SK2129)

摘  要:由于低分化肿瘤很难通过常规组织病理学诊断发现,而结合基因检测的手段可以准确筛选出针对特定肿瘤的致病基因,因此基因选择是进行肿瘤分类和临床治疗的关键问题.肿瘤基因表达数据具有样本小、维度高的特征,现有的基因选择算法在分类精度和计算效率上还有待提高.在模糊粗糙集理论的基础上进行区分矩阵模糊化,并依此设计了模糊区分矩阵属性约简算法.相比于经典的区分矩阵,模糊化的区分矩阵能够体现不同属性对于两个对象区分程度的差异,从而选择区分程度更高的属性而获得更好的分类效果.数值实验表明该方法提高了肿瘤基因数据的分类精度,且降低了计算耗时.实验采用kNN分类器进行结直肠癌(Colon Microarray)分类特征基因选择实验,从2000个特征基因中筛选出了五个结直肠癌发病相关的关键基因,且分类精度高达88.06%。Since poorly differentiated tumors are difficult to be diagnosed by conventional histopathology,through gene selection can accurate screen disease.causing genes for specific tumors,therefore gene selection has become a key issue in tumor classification and clinical treatment. Tumor gene expression data usually contains thousands of genes but a small number of samples. On the basis of fuzzy rough set theory, the concept of discernibility matrix fuzzification is proposed in this paper. Compared with the classical discernibility matrix, the fuzzy discernibility matrix can reflect the difference in the degree of the two objects distinguished by different attributes,so that the attributes with higher degree of distinction can be selected for better classification effect. Numerical experiments show that this method improves the classification accuracy of tumor gene data and reduces the computation time. In this study,kNN classifier was used for the gene selection of Colon cancer (Colon Microarray),five key genes related to Colon cancer were screened from 2000 feature genes and the classification accuracy was as high as 88.06%.

关 键 词:模糊粗糙集 粗糙集 模糊区分矩阵 基因选择 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象