摘要
The classification of parboiled rice into types can be optimized through the use of machine learning(ML)algorithms,resulting in greater speed and accuracy in data processing.The objectives of this study were:(i)to investigate the spectral behavior of different types of parboiled rice(Types 1–5 and Off-type);(ii)to identify the most effective ML algorithm for classifying parboiled rice types;(iii)to determine the best kernel configuration and preprocessing methods for spectral data;and(iv)to recommend a protocol for implementing this technique in the rice storage industry.Samples were selected based on the maximum defect limits tolerated for each type,according to the Technical Rice Regulation.Spectral data were acquired using a spectroradiometer in the range of 350–2500 nm and subsequently processed with different methods,including baseline correction,standard normal variate,multiplicative scattering correction,combinations of these techniques with Savitzky-Golay smoothing,and the application of the first derivative of Savitzky-Golay smoothing.The data were analyzed using six different ML algorithms:Artificial Neural Network,Decision Tree,Logistic Regression,REPTree,Random Forest,and Support Vector Machine.Rice types were treated as output variables,while spectral features served as input variables.Logistic Regression and Support Vector Machine algorithms showed the best classification performance,with accuracy rates above 97%,F-scores around 0.98,and Kappa values exceeding 0.97.Spectral preprocessing did not yield substantial improvements and incurred high computational costs;therefore,using raw data was a viable and efficient alternative.For practical implementation in the rice storage industry,we recommend acquiring a VNIR-SWIR(visible near-infrared and shortwave infrared)hyperspectral sensor(350–2500 nm)and developing a classification model based on the Support Vector Machine algorithm with a linear kernel trained on representative local samples.Additionally,we recommend implementing an automated real-time classification system,a representative sample collection protocol,and detailed reporting for inventory and logistics optimization.
基金
supported by the Coordination for the Improvement of Higher Education Personnel,Brazil(Grant No.001)
the National Council for Scientific Technological and Development,Brazil(Grant No.304966/2023-1)
the Research Support Foundation of the State of Rio Grande do Sul,Brazil(Grant No.24/2551-0001150-1)。