基于近红外光谱维度转换和卷积神经网络识别小产地烟叶  被引量:1

Identification of tobacco leaves from small production regions based on nearinfrared spectral dimension transformation and convolutional neural network

在线阅读下载全文

作  者:居雷[1] 高扬 张鑫[1] 葛炯[1] 岳宝华 束茹欣 JU Lei;GAO Yang;ZHANG Xin;GE Jiong;YUE Baohua;SHU Ruxin(Technology Center,Shanghai Tobacco Group Co.,Ltd.,Shanghai 201315,China;Department of Chemistry,School of Science,Shanghai University,Shanghai 200444,China)

机构地区:[1]上海烟草集团有限责任公司技术中心,上海市201315 [2]上海大学理学院化学系,上海市200444

出  处:《烟草科技》2024年第7期8-13,共6页Tobacco Science & Technology

基  金:中国烟草总公司重点研发项目“面向多品牌、多品规卷烟产品数字化设计内核算法研发与应用”(110202202001);上海烟草集团有限责任公司科技项目“卷烟产品数字化综合设计平台架构研究”(K2024-1-030Z)。

摘  要:为了提升小产地烟叶识别的准确率,解决近红外光谱分析技术在面对样本量大、相似度高、分类数多时类别预测不佳的问题。采集4625个云南省8个小产地的烟叶样品,将一维近红外光谱数据重构为二维图像数据,采用卷积神经网络(Convolutional neural network,CNN)建立了小产地烟叶的分类识别模型,并比较了不同机器学习算法的效果。结果表明:①主成分分析(PCA)、支持向量机(SVM)等常规的机器学习算法在处理多个相邻产地烟叶分类时效果一般,SVM算法的训练集、测试集总体准确率分别为78.86%、69.08%。②采用CNN的训练集、测试集准确率分别达97.41%、92.54%,相较于SVM算法分别高出18.55、23.46百分点。通过近红外光谱维度转换并结合CNN算法,可以提取更多的样品特征信息,有效应用于小产地烟叶的分类识别。To improve the identification accuracy for tobacco leaf production areas and their category prediction accuracy using near-infrared(NIR)spectroscopy analysis and when dealing with a large number of samples with high similarity and numerous classifications.A total of 4625 tobacco leaf samples were collected from eight small production regions in Yunnan Province,and one-dimensional near-infrared spectral data were transformed into two-dimensional image data.The convolutional neural network(CNN)algorithm was used to build an identification model for tobacco leaves from these small regions,and the effects of different machine-learning algorithms were also compared.The results showed that:1)Conventional machine-learning algorithms such as principal component analysis(PCA)and support vector machine(SVM)were generally not very effective in classifying tobacco leaves from multiple adjacent regions.The overall accuracies of the training and test sets of the SVM algorithm were 78.86%and 69.08%,respectively.2)The accuracies of the training and test sets of CNN reached 97.41%and 92.54%,respectively,which were 18.55 and 23.46 percentage points higher than those of the SVM algorithm.By transforming the dimension of the NIR spectral data and combining with the CNN algorithm,more sample characteristics could be extracted and effectively applied to the classification and identification of tobacco leaves from small regions.

关 键 词:烟叶 小产地 近红外光谱 光谱维度转换 卷积神经网络 分类识别 

分 类 号:TS411[农业科学—烟草工业]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象