Spectroscopy can be used for detecting crop characteristics. A goal of crop spectrum analysis is to extract effective features from spectral data for establishing a detection model. An ideal spectral feature set shoul...Spectroscopy can be used for detecting crop characteristics. A goal of crop spectrum analysis is to extract effective features from spectral data for establishing a detection model. An ideal spectral feature set should have high sensitivity to target parameters but low information redundancy among features.However, feature-selection methods that satisfy both requirements are lacking. To address this issue,in this study, a novel method, the continuous wavelet projections algorithm(CWPA), was developed,which has advantages of both continuous wavelet analysis(CWA) and the successive projections algorithm(SPA) for generating optimal spectral feature set for crop detection. Three datasets collected for crop stress detection and retrieval of biochemical properties were used to validate the CWPA under both classification and regression scenarios. The CWPA generated a feature set with fewer features yet achieving accuracy comparable to or even higher than those of CWA and SPA. With only two to three features identified by CWPA, an overall accuracy of 98% in classifying tea plant stresses was achieved, and high coefficients of determination were obtained in retrieving corn leaf chlorophyll content(R^(2)= 0.8521)and equivalent water thickness(R^(2)= 0.9508). The mechanism of the CWPA ensures that the novel algorithm discovers the most sensitive features while retaining complementarity among features. Its ability to reduce the data dimension suggests its potential for crop monitoring and phenotyping with hyperspectral data.展开更多
Plant diseases are a major threat that can severely impact the production of agriculture and forestry.This can lead to the disruption of ecosystem functions and health.With its ability to capture continuous narrow-ban...Plant diseases are a major threat that can severely impact the production of agriculture and forestry.This can lead to the disruption of ecosystem functions and health.With its ability to capture continuous narrow-band spectra,hyperspectral technology has become a crucial tool to monitor crop diseases using remote sensing.However,existing continuous wavelet analysis(CWA)methods suffer from feature redundancy issues,while the continuous wavelet projection algorithm(CWPA),an optimization approach for feature selection,has not been fully validated to monitor plant diseases.This study utilized rice bacterial leaf blight(BLB)as an example by evaluating the performance of four wavelet basis functions-Gaussian2,Mexican hat,Meyer,andMorlet-within theCWAandCWPAframeworks.Additionally,the classification models were constructed using the k-nearest neighbors(KNN),randomforest(RF),and Naïve Bayes(NB)algorithms.The results showed the following:(1)Compared to traditional CWA,CWPA significantly reduced the number of required features.Under the CWPA framework,almost all the model combinations achieved maximum classification accuracy with only one feature.In contrast,the CWA framework required three to seven features.(2)Thechoice of wavelet basis functions markedly affected the performance of themodel.Of the four functions tested,the Meyer wavelet demonstrated the best overall performance in both the CWPA and CWA frameworks.(3)Under theCWPAframework,theMeyer-KNNandMeyer-NBcombinations achieved the highest overall accuracy of 93.75%using just one feature.In contrast,under the CWA framework,the CWA-RF combination achieved comparable accuracy(93.75%)but required six features.This study verified the technical advantages of CWPA for monitoring crop diseases,identified an optimal wavelet basis function selection scheme,and provided reliable technical support to precisely monitor BLB in rice(Oryza sativa).Moreover,the proposed methodological framework offers a scalable approach for the early diagnosis and assessment of plant stress,which can contribute to improved accuracy and timeliness when plant stress is monitored.展开更多
The identification of timber properties is important for safe application.Near Infrared Spectroscopy(NIRS)technology is widely-used because of its simplicity,efficiency,and positive environmental attributes.However,in...The identification of timber properties is important for safe application.Near Infrared Spectroscopy(NIRS)technology is widely-used because of its simplicity,efficiency,and positive environmental attributes.However,in its application,weak signals are extracted from complex,overlapping and changing information.This study focused on the stability of NIR modeling.The Orthogonal Partial Least Squares(OPLS)and Successive Projections Algorithm(SPA)eliminates noise and extracts effective spectra,and an ensemble learning method MIX-PLS,is applied to establish the model.The elastic modulus of timber is taken as an example,and 201 wood samples of three species,Xylosmacongesta(Lour.)Merr.,Acer pictum subsp.mono,and Betula pendula,samples were divided into three groups to investigate modelling performance.The results show that OPLS can preprocess the near-infrared spectroscopy information according to the target object in the face of the system error and reduce errors to minimum.SPA finally selects 13 spectral bands,simplifies the NIR spectral data and improves model accuracy.The Pearson's correlation coefficient of Calibration(Rc)and the Pearson's correlation coefficient of Prediction(Rp)of Mix Partial Least Squares(MIX-PLS)were 0.95 and 0.90,and Root Mean Square Error of Calibration(RMSEC)and Root Mean Square Error of Prediction(RMSEP)are 2.075 and 6.001,respectively,which shows the model has good generalization abilities.展开更多
In this study,eight different varieties of maize seeds were used as the research objects.Conduct 81 types of combined preprocessing on the original spectra.Through comparison,Savitzky-Golay(SG)-multivariate scattering...In this study,eight different varieties of maize seeds were used as the research objects.Conduct 81 types of combined preprocessing on the original spectra.Through comparison,Savitzky-Golay(SG)-multivariate scattering correction(MSC)-maximum-minimum normalization(MN)was identified as the optimal preprocessing technique.The competitive adaptive reweighted sampling(CARS),successive projections algorithm(SPA),and their combined methods were employed to extract feature wavelengths.Classification models based on back propagation(BP),support vector machine(SVM),random forest(RF),and partial least squares(PLS)were established using full-band data and feature wavelengths.Among all models,the(CARS-SPA)-BP model achieved the highest accuracy rate of 98.44%.This study offers novel insights and methodologies for the rapid and accurate identification of corn seeds as well as other crop seeds.展开更多
酸度是评价砂糖橘品质的重要指标之一,为了消除光谱变量间的共线性影响、减少建模变量以提高校正速度,该文应用连续投影算法(SPA)对砂糖橘总酸近红外光谱无损检测模型进行优化。利用连接点修正方法修正近红外光谱,结合学生化残差图和模...酸度是评价砂糖橘品质的重要指标之一,为了消除光谱变量间的共线性影响、减少建模变量以提高校正速度,该文应用连续投影算法(SPA)对砂糖橘总酸近红外光谱无损检测模型进行优化。利用连接点修正方法修正近红外光谱,结合学生化残差图和模型回归图剔除异常样本,利用SPXY(sample set partitioning based on joint x-y distances)方法划分样本集,最后利用SPA进行变量选择,比较SPA选择的变量建模和全光谱变量PLS模型的预测效果,并分析橘皮对总酸模型的预测精度的影响程度。结果表明,只用了全部2001个变量中的9个变量,整果测定酸度情况下的SPA-MLR模型和SPA-PLS模型的预测精度与全部变量PLS模型的预测精度相当,预测相关系数Rp分别为0.829470,0.837095和0.857299。去皮留果肉测定酸度情况下则优选了13个变量,其SPA-MLR模型和SPA-PLS模型的Rp分别为0.819430、0.825277,均比全光谱变量PLS模型的Rp(0.780146)高,SPA算法提高了去皮留果肉测定酸度情况下的模型预测精度。展开更多
基金supported by the National Natural Science Foundation of China (42071420)the Major Special Project for 2025 Scientific,Technological Innovation (Major Scientific and Technological Task Project in Ningbo City)(2021Z048)the National Key Research and Development Program of China(2019YFE0125300)。
文摘Spectroscopy can be used for detecting crop characteristics. A goal of crop spectrum analysis is to extract effective features from spectral data for establishing a detection model. An ideal spectral feature set should have high sensitivity to target parameters but low information redundancy among features.However, feature-selection methods that satisfy both requirements are lacking. To address this issue,in this study, a novel method, the continuous wavelet projections algorithm(CWPA), was developed,which has advantages of both continuous wavelet analysis(CWA) and the successive projections algorithm(SPA) for generating optimal spectral feature set for crop detection. Three datasets collected for crop stress detection and retrieval of biochemical properties were used to validate the CWPA under both classification and regression scenarios. The CWPA generated a feature set with fewer features yet achieving accuracy comparable to or even higher than those of CWA and SPA. With only two to three features identified by CWPA, an overall accuracy of 98% in classifying tea plant stresses was achieved, and high coefficients of determination were obtained in retrieving corn leaf chlorophyll content(R^(2)= 0.8521)and equivalent water thickness(R^(2)= 0.9508). The mechanism of the CWPA ensures that the novel algorithm discovers the most sensitive features while retaining complementarity among features. Its ability to reduce the data dimension suggests its potential for crop monitoring and phenotyping with hyperspectral data.
基金supported by the‘Pioneer’and‘Leading Goose’R&D Program of Zhejiang(Grant No.2023C02018)Zhejiang Provincial Natural Science Foundation of China(Grant No.LTGN23D010002)+2 种基金National Natural Science Foundation of China(Grant No.42371385)Funds of the Natural Science Foundation of Hangzhou(Grant No.2024SZRYBD010001)Nanxun Scholars Program of ZJWEU(Grant No.RC2022010755).
文摘Plant diseases are a major threat that can severely impact the production of agriculture and forestry.This can lead to the disruption of ecosystem functions and health.With its ability to capture continuous narrow-band spectra,hyperspectral technology has become a crucial tool to monitor crop diseases using remote sensing.However,existing continuous wavelet analysis(CWA)methods suffer from feature redundancy issues,while the continuous wavelet projection algorithm(CWPA),an optimization approach for feature selection,has not been fully validated to monitor plant diseases.This study utilized rice bacterial leaf blight(BLB)as an example by evaluating the performance of four wavelet basis functions-Gaussian2,Mexican hat,Meyer,andMorlet-within theCWAandCWPAframeworks.Additionally,the classification models were constructed using the k-nearest neighbors(KNN),randomforest(RF),and Naïve Bayes(NB)algorithms.The results showed the following:(1)Compared to traditional CWA,CWPA significantly reduced the number of required features.Under the CWPA framework,almost all the model combinations achieved maximum classification accuracy with only one feature.In contrast,the CWA framework required three to seven features.(2)Thechoice of wavelet basis functions markedly affected the performance of themodel.Of the four functions tested,the Meyer wavelet demonstrated the best overall performance in both the CWPA and CWA frameworks.(3)Under theCWPAframework,theMeyer-KNNandMeyer-NBcombinations achieved the highest overall accuracy of 93.75%using just one feature.In contrast,under the CWA framework,the CWA-RF combination achieved comparable accuracy(93.75%)but required six features.This study verified the technical advantages of CWPA for monitoring crop diseases,identified an optimal wavelet basis function selection scheme,and provided reliable technical support to precisely monitor BLB in rice(Oryza sativa).Moreover,the proposed methodological framework offers a scalable approach for the early diagnosis and assessment of plant stress,which can contribute to improved accuracy and timeliness when plant stress is monitored.
基金supported financially by the China State Forestry Administration“948”projects(2015-4-52)Heilongjiang Natural Science Foundation(C2017005)。
文摘The identification of timber properties is important for safe application.Near Infrared Spectroscopy(NIRS)technology is widely-used because of its simplicity,efficiency,and positive environmental attributes.However,in its application,weak signals are extracted from complex,overlapping and changing information.This study focused on the stability of NIR modeling.The Orthogonal Partial Least Squares(OPLS)and Successive Projections Algorithm(SPA)eliminates noise and extracts effective spectra,and an ensemble learning method MIX-PLS,is applied to establish the model.The elastic modulus of timber is taken as an example,and 201 wood samples of three species,Xylosmacongesta(Lour.)Merr.,Acer pictum subsp.mono,and Betula pendula,samples were divided into three groups to investigate modelling performance.The results show that OPLS can preprocess the near-infrared spectroscopy information according to the target object in the face of the system error and reduce errors to minimum.SPA finally selects 13 spectral bands,simplifies the NIR spectral data and improves model accuracy.The Pearson's correlation coefficient of Calibration(Rc)and the Pearson's correlation coefficient of Prediction(Rp)of Mix Partial Least Squares(MIX-PLS)were 0.95 and 0.90,and Root Mean Square Error of Calibration(RMSEC)and Root Mean Square Error of Prediction(RMSEP)are 2.075 and 6.001,respectively,which shows the model has good generalization abilities.
基金supported by the Science and Technology Development Plan Project of Jilin Provincial Department of Science and Technology (No.20220203112S)the Jilin Provincial Department of Education Science and Technology Research Project (No.JJKH20210039KJ)。
文摘In this study,eight different varieties of maize seeds were used as the research objects.Conduct 81 types of combined preprocessing on the original spectra.Through comparison,Savitzky-Golay(SG)-multivariate scattering correction(MSC)-maximum-minimum normalization(MN)was identified as the optimal preprocessing technique.The competitive adaptive reweighted sampling(CARS),successive projections algorithm(SPA),and their combined methods were employed to extract feature wavelengths.Classification models based on back propagation(BP),support vector machine(SVM),random forest(RF),and partial least squares(PLS)were established using full-band data and feature wavelengths.Among all models,the(CARS-SPA)-BP model achieved the highest accuracy rate of 98.44%.This study offers novel insights and methodologies for the rapid and accurate identification of corn seeds as well as other crop seeds.
文摘酸度是评价砂糖橘品质的重要指标之一,为了消除光谱变量间的共线性影响、减少建模变量以提高校正速度,该文应用连续投影算法(SPA)对砂糖橘总酸近红外光谱无损检测模型进行优化。利用连接点修正方法修正近红外光谱,结合学生化残差图和模型回归图剔除异常样本,利用SPXY(sample set partitioning based on joint x-y distances)方法划分样本集,最后利用SPA进行变量选择,比较SPA选择的变量建模和全光谱变量PLS模型的预测效果,并分析橘皮对总酸模型的预测精度的影响程度。结果表明,只用了全部2001个变量中的9个变量,整果测定酸度情况下的SPA-MLR模型和SPA-PLS模型的预测精度与全部变量PLS模型的预测精度相当,预测相关系数Rp分别为0.829470,0.837095和0.857299。去皮留果肉测定酸度情况下则优选了13个变量,其SPA-MLR模型和SPA-PLS模型的Rp分别为0.819430、0.825277,均比全光谱变量PLS模型的Rp(0.780146)高,SPA算法提高了去皮留果肉测定酸度情况下的模型预测精度。