基于深度森林算法的慢性胃炎中医证候分类  被引量:16

Syndrome Classification of Chronic Gastritis Based on Multi-grained Cascade Forest

在线阅读下载全文

作  者:颜建军[1] 刘章鹏 刘国萍[2] 郭睿[2,3] 王忆勤 付晶晶[2] 钱鹏[2] YAN Jianjun;LIU Zhangpeng;LIU Guoping;GUO Rui;WANG Yiqin;FU Jingjing;QIAN Peng(School of Mechanical and Power Engineering, East China University of Science and Technology, Shanghai 200237, China;Laboratory of Information Access and Synthesis of Traditional Chinese Medicine Four Diagnosis, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China;Institute of Interdisciplinary Research Complex, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China)

机构地区:[1]华东理工大学机械与动力工程学院,上海200237 [2]上海中医药大学四诊信息综合实验室,上海201203 [3]上海中医药大学交叉科学研究院,上海201203

出  处:《华东理工大学学报(自然科学版)》2019年第4期593-599,共7页Journal of East China University of Science and Technology

基  金:国家自然科学基金(81270050,81302913,30901897,81173199)

摘  要:针对中医问诊复杂性和非线性的特点,采用深度森林算法(gcForest)构建慢性胃炎中医问诊证候分类模型.利用gcForest分析慢性胃炎问诊数据,建立证候分类模型,并与DBN和DBM两种深度学习算法以及ML-KNN、BSVM、ECC、RankSVM、LIFT这5种多标记学习算法构建的模型进行比较.实验结果表明,该模型在多标记评价指标和单个证型的分类准确率上都优于其他算法,能有效地解决慢性胃炎中医问诊证候分类问题,通过该算法建立的模型分类效果良好,可以为慢性胃炎证候量化诊断研究提供参考.The standardization and objectification of traditional Chinese medicine (TCM) inquiry has been becoming hot issues in machine learning fields. However, TCM inquiry data has complex relation between the symptoms and syndromes as well as among symptoms such that most of machine learning algorithms cannot effectively deal with the complexity and non-linearity of TCM inquiry data. In this paper, we propose a model of syndrome classification of chronic gastritis (CG) with multi-grained cascade forest (gcForest). TCM inquiry is a typical multilabel learning problem, that is, a patient may have two or more syndromes at the same time. Firstly, we convert the multi-label problem into binary classification via transformation method. And then, the classification model is made for each syndrome via gcForest algorithm. The gcForest is a novel decision tree ensemble method based on deep learning and is composed of two independent parts, cascade forest and multi-grained scanning. The proposed algorithm is compared with two deep learning algorithms, Deep Belief Nets (DBN) and Deep Boltzmann Machine (DBM), and other five multi-label algorithms, ML-KNN, BSVM, ECC, RankSVM, and LIFT. It is shown from the experiment results that the proposes model can outperform these algorithms based on multi-label metrics and classification accuracy of each syndrome overall. The general accuracy reaches up to 0.834, and the classification precision of 6 syndromes is 0.906, 0.818, 0.764, 0.966, 0.840, 0.912, respectively. Besides, we also analyze the effect of hyperparameter on model performance, whose results verify its robustness. The gcForest exhibits hierarchical and abstract traits during the data process that is consistent with TCM syndrome diagnosis. Therefore, gcForest can effectively solve the TCM inquiry syndrome classification of CG and provide the reference for the research of CG quantitative diagnosis.

关 键 词:证候分类 深度森林 深度学习 慢性胃炎 中医 

分 类 号:R241[医药卫生—中医诊断学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象