PCA和SPA的近红外光谱识别白菜种子品种研究  被引量:15

Discrimination of Varieties of Cabbage with Near Infrared Spectra Based on Principal Component Analysis and Successive Projections Algorithm

在线阅读下载全文

作  者:罗微[1] 杜焱喆[1] 章海亮[1] LUO Wei DU Yan-zhe ZHANG Hai-liang(East China Jiaotong University, Nanchang 330013, China)

机构地区:[1]华东交通大学,江西南昌330013

出  处:《光谱学与光谱分析》2016年第11期3536-3541,共6页Spectroscopy and Spectral Analysis

基  金:国家自然科学基金项目(61565005)资助

摘  要:为了实现对不同品种白菜种子的快速无损鉴别,应用近红外光谱技术获取白菜种子的光谱反射率,首先采用变量标准化校正和多元散射校正对原始光谱进行预处理;其次,采用主成分分析法(PCA)对光谱数据进行聚类分析,从定性分析的角度得到三种不同白菜种子的特征差异,并采用连续投影算法(SPA)选取特征波长;最后,分别基于全波段光谱、PCA分析得到的前3个主成分变量以及SPA算法选取的特征波长,建立了最小二乘支持向量机(LS-SVM)和偏最小二乘判别(PLS-DA)模型进行白菜种子不同品种的鉴别。从主成分PC1、PC2得分图中可以看出,主成分1和2对不同种类白菜种子具有很好的聚类作用。基于特征波长建立的PLS-DA和LS-SVM模型的判别结果优于基于主成分变量建立的模型,其中基于特征波长建立的LS-SVM模型识别效果最优,建模集和预测集的品种识别率均达到100%。结果表明,通过SPA算法选取的6个特征波长变量能够很好的反映光谱信息,提出的SPA算法结合LS-SVM预测模型能获得满意的分类结果,为白菜种子品种的识别提供了一种新方法。The varieties of cabbage seeds directly affect the yield and quality of cabbage,in order to rapidly and nondestructively identify the varieties of cabbage seeds,near infrared spectra technique were applied in this study and reflectance spectrum of the cabbage seeds was obtained.Firstly,to excavate the effective information in the spectral data and improve signal to noise ratio,the raw spectra was pre-processed with the method of standard normal variate(SNV)and multiplicative scatter correction(MSC).Secondly,principal component analysis(PCA)was used to analyze the clustering of cabbage samples,then the characteristic differentia of three cabbage varieties was obtained through qualitative analysis.Six Effective wavelengths were selected by successive projections algorithm(SPA).Finally,the full spectra variable,the first three principal components(PCs)using PCA and selected effective wavelengths using SPA were respectively set as inputs of the partial least squares discriminant analysis(PLS-DA)and least-squares support vector machine(LS-SVM)models for the classification of cabbage seeds.As can be seen from the two dimensional plot drawn with the scores of PC1 and PC2(the first two principle components),PC1 and PC2had a good clustering effect for different kinds of cabbage seeds.LS-SVM models performed better than PLS-DA models,the correct rates of discrimination were 100% achieved with LS-SVM models.PLS-DA and LS-SVM models built based on the selectedwavelengths performed better than the models built based on the first three principal components,moreover,the SPA-LS-SVM model obtained the best results among all models,with 100% discrimination accuracy for both the calibration set and the prediction set.The overall results show that SPA can extract wavelengths,and the LS-SVM model combined with SPA can obtain optimal classification results.So the present paper could offer an alternate approach for the rapid discrimination of cabbage seeds variety.

关 键 词:近红外光谱 主成分分析 连续投影算法 偏最小二乘鉴别 最小二乘支持向量机 

分 类 号:TP731[自动化与计算机技术—检测技术与自动化装置]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象