检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]广西师范学院数学与统计科学学院,广西南宁530023
出 处:《广西师范学院学报(自然科学版)》2016年第4期36-41,共6页Journal of Guangxi Teachers Education University(Natural Science Edition)
基 金:混合与缺失数据统计分析广西高校重点实验室科学基金开放项目(GXMMSL201407);广西高校科学技术研究项目(KY2015YB190)
摘 要:如何从肿瘤基因表达谱成千上万个基因中提取与疾病相关的信息基因,已成为肿瘤分类问题的研究核心.该文介绍了Lasso回归,以公开的结肠癌数据集为分析对象,采用信噪比指标对基因排序过滤无关基因.然后利用R软件中求解Lasso的程序包msgps和glmnet进行降维,从而选择出信息基因.与前人研究结果比较,说明了Lasso回归的有效性.该文的算法利用R软件实现,代码公开,操作简单.How to extract informative gene associated with tumor from thousands of gene in tumor gene expression profile, has been the core of the research of tumor classification. In this paper, the lasso regression is introduced firstly. The open colon cancer data set is taken as the analysis object, and a large number of genes is filtered by ranking the value of signal noise ratio. Then the packages of msgps and glmnet of R software solving Lasso regression are respectively used to reduce dimensionality of the gene data filtered, and the informative gene is gained. Compared with previous research results, the results of this paper is effective based on Lasso regression and R software. The algorithm of this paper is implemented with R language, in which the code is open and easy to operate.
分 类 号:O212.4[理学—概率论与数理统计]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.229