基于KPCA与KLPP及Wilks统计量的留兰香三维荧光数据特征提取与鉴别分析  

Spearmint three-dimensional fluorescence data feature extraction and identification based on KPCA and KLPP and Wilks statistics

在线阅读下载全文

作  者:殷勇[1] 徐非凡 于慧春[1] 袁云霞[1] YIN Yong;XU Feifan;YU Huichun;YUAN Yunxia(College of Food and Bioengineering,Henan University of Science and Technology,Luoyang 471000,China)

机构地区:[1]河南科技大学食品与生物工程学院,洛阳471000

出  处:《农业工程学报》2024年第19期272-280,共9页Transactions of the Chinese Society of Agricultural Engineering

基  金:国家重点研发计划项目(2017YFC1600802)。

摘  要:为实现留兰香产地的快速鉴别,该研究提出了一种核主成分分析(kernel principal component analysis,KPCA)与核局部保持投影(kernel locality preserving projections,KLPP)及WilksΛ统计量序贯融合的特征波长提取策略,在此基础上鉴别5个产地的留兰香。首先,在采集5个产地300个留兰香样本的三维荧光数据后,运用三角形内插值法去除原始光谱中的瑞利散射和拉曼散射,并运用SG(Savitzky-Golay)对数据进行平滑预处理。然后,对预处理后的荧光光谱数据分别利用KPCA、KPCA+KLPP、KPCA+WilksΛ统计量、 KPCA+KLPP+WilksΛ统计量4种方法提取特征激发波长和特征发射波长。接着,按特征激发波长从小到大顺序将其对应的特征发射波长光谱值首尾相连转换成行向量;4种方法从300个样本中各得到1个300行的特征波长光谱值矩阵。再者,运用Fisher判别分析(fisher discriminant analysis,FDA)对特征波长光谱值矩阵进行数据可分性融合,生成可分性FD(fisher discriminant)变量。选取前4个累计判别能力达到99%的FD变量作为鉴别模型的输入向量。最后,用支持向量机(support vector machine,SVM)算法分析4个FD变量,分别得到对应于4种特征提取波长方法的FDA+SVM鉴别结果,其正确率分别为92.00%、96.00%、94.67%、100%。结果表明,所提出的KPCA+KLPP+WilksΛ统计量序贯融合的特征波长提取策略能够有效减少三维荧光光谱数据的冗余,并能表征原始荧光数据的信息特征,实现了5种留兰香产地的正确鉴别。该研究可为后续利用三维荧光光谱开展留兰香重要组分量化分析提供一定的基础。This study aims to rapidly and accurately identify the origin of spearmint.A feature wavelength extraction was proposed to combine the sequential fusion of Kernel Principal Component Analysis(KPCA)with Kernel Locality Preserving Projections(KLPP)and Wilks'Λstatistics.Effective discrimination was realized among the five origins of spearmint.Firstly,the three-dimensional fluorescence data was collected from the 300 spearmint samples in five locations.The interferences of Rayleigh and Raman scattering were removed from the original spectra by triangular interpolation.Savitzky-Golay(SG)was used to smooth and then eliminate the noise.Secondly,the preprocessed three-dimensional fluorescence spectrum matrix was merged into the data matrix,according to the emission wavelength.KPCA,KPCA+KLPP,KPCA+WilksΛstatistics,and KPCA+KLPP+WilksΛstatistics were used to extract the feature wavelengths of the three-dimensional fluorescence spectrum matrix after preprocessing.Among them,KPCA and KPCA+WilksΛstatistics were used to extract 40 wavelengths of feature emission and eight wavelengths of feature excitation,respectively.KPCA+KLPP and KPCA+KLPP+WilksΛstatistics were to extract 50 wavelengths of feature emission and 10 wavelengths of feature excitation,respectively.Thirdly,the spectral values of feature emission wavelength were converted into row vectors,according to the wavelength order of feature excitation.Four 300-row spectral matrices of feature wavelength were then obtained from the 300 samples.Furthermore,Fisher discriminant analysis(FDA)was applied to the spectral matrix of feature wavelengths for data separable fusion for the separable fisher discriminant(FD)variables.The first four FD variables were selected with a cumulative discriminant ability of 99%as the input vectors of the discriminant model for the spearmint origins.Finally,the support vector machine(SVM)was used to analyze the four FD variables.The FDA+SVM identification model showed that there was a four-feature wavelength extraction.The correct rates for t

关 键 词:荧光光谱 判别分析 模型 留兰香 核主成分分析 

分 类 号:TS207.3[轻工技术与工程—食品科学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象