Machine learning improve the discrimination of raw cotton from different countries

下载PDF

导出

摘要 Background The geo-traceability of cotton is crucial for ensuring the quality and integrity of cotton brands. However, effective methods for achieving this traceability are currently lacking. This study investigates the potential of explainable machine learning for the geo-traceability of raw cotton.Results The findings indicate that principal component analysis(PCA) exhibits limited effectiveness in tracing cotton origins. In contrast, partial least squares discriminant analysis(PLS-DA) demonstrates superior classification performance, identifying seven discriminating variables: Na, Mn, Ba, Rb, Al, As, and Pb. The use of decision tree(DT), support vector machine(SVM), and random forest(RF) models for origin discrimination yielded accuracies of 90%, 87%, and 97%, respectively. Notably, the light gradient boosting machine(Light GBM) model achieved perfect performance metrics, with accuracy, precision, and recall rate all reaching 100% on the test set. The output of the Light GBM model was further evaluated using the SHapley Additive ex Planation(SHAP) technique, which highlighted differences in the elemental composition of raw cotton from various countries. Specifically, the elements Pb, Ni, Na, Al, As, Ba, and Rb significantly influenced the model's predictions.Conclusion These findings suggest that explainable machine learning techniques can provide insights into the complex relationships between geographic information and raw cotton. Consequently, these methodologies enhances the precision and reliability of geographic traceability for raw cotton.

作者 WANG Tian XU Shuangjiao WEI Jingyan WANG Ming DU Weidong TIAN Xinquan MA Lei

机构地区 State Key Laboratory of Cotton Bio-breeding and Integrated Utilization Technology Center of Qingdao Customs Fiber Quality Monitoring Center of Xinjiang Uygur Autonomous Region

出处《Journal of Cotton Research》 2025年第3期444-456,共13页 棉花研究(英文)

基金 supported by Agricultural Science and Technology Innovation Program of Chinese Academy of Agricultural Science。

关键词 Raw cotton Mineral elements Machine learning Shapley value

分类号 TS111.9 [轻工技术与工程—纺织材料与纺织品设计] TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献3

1胡翔宇,郄梦洁,赵姗姗,马宇轩,王明林,赵燕.基于矿物元素和稳定同位素技术不同产地陈皮鉴别研究[J].食品安全质量检测学报,2023,14(20):46-55. 被引量：5
2田新权,付小琼,时萌,方丹,徐双娇,马磊.棉纤维DNA的提取及其在品种溯源中的尝试[J].棉花学报,2019,31(2):156-162. 被引量：7
3王静,刘超子,王铭,连素梅,高欣.同位素质谱及近红外光谱技术在棉花产地溯源中应用进展[J].中国棉花,2023,50(12):42-46. 被引量：3

二级参考文献70

1赵汝婷,杨曙明,赵燕.利用稳定同位素进行农产品溯源研究进展[J].核农学报,2020,34(S01):120-128. 被引量：16
2王立新,李云伏,常利芳,黄岚,李宏博,葛玲玲,刘丽华,姚骥,赵昌平.建立小麦品种DNA指纹的方法研究[J].作物学报,2007,33(10):1738-1740. 被引量：67
3况夏.炮制对陈皮中橙皮甙含量的影响[J].中药材,1998,21(7):346-346. 被引量：9
4陈庆全,张玉山.籼型水稻SSR标记遗传连锁图谱的构建及偏分离分析[J].分子植物育种,2009,7(4):685-689. 被引量：22
5程立方,崔秀君,程敬伦,贾丽萍.远红外、微波、热风干燥陈皮的对比实验研究[J].中国中药杂志,1998,23(8):472-473. 被引量：15
6王其献,程曙光,张腾,张冬枫.陈皮不同炮制品挥发油、浸出物的研究[J].安徽中医学院学报,1998,17(6):49-50. 被引量：6
7刘涵.中国棉花进口格局与展望[J].农业展望,2011,7(4):39-43. 被引量：5
8张小娟,何团结,陆徐忠,路曦结,倪金龙,马琳,杨剑波.陆地棉SSR核心引物筛选及95份骨干种质的遗传多样性分析[J].棉花学报,2011,23(6):529-536. 被引量：19
9熊宗伟,王雪姣,顾生浩,毛丽丽,张立祯,周治国.中国棉花纤维品质检验和评价的研究进展[J].棉花学报,2012,24(5):451-460. 被引量：25
10郭念欣,蔡佳良,姬生国.近红外光谱技术在陈皮道地性分析中的应用[J].中国药房,2013,24(15):1394-1396. 被引量：29

共引文献11

1邵亚林,司俊波,常玮,石家恋,丁勇.五种兜兰属植物基因组DNA提取方法比较[J].分子植物育种,2020,18(15):4965-4974. 被引量：17
2闫学春,栾培贤,何立川.通过显微介导将鳜总DNA片段导入镜鲤基因组的分子验证[J].水产学杂志,2020,33(4):1-6. 被引量：3
3袁俊杰,马新华,田琼,魏霜,卢乃会,杨卓瑜,陈文,龙阳.杂草疫情分析在大豆产地溯源中的辅助性应用研究[J].大豆科学,2021,40(1):106-111.
4杨梦琼,杨盈悦,梅光明,张小军,黄丽英.基于近中红外光谱技术鉴别4种大黄鱼产地[J].食品安全质量检测学报,2024,15(5):121-129. 被引量：3
5习佳林,郭阳,李安,陶湛文,赵杰,于寒冰.基于多元素和稳定同位素技术的桃产地溯源[J].食品安全质量检测学报,2024,15(9):62-68. 被引量：7
6耿向阳,郑丽莎,赵涛,周成凤,刘俊.基于近红外技术的主要进口国棉纤维快速识别[J].棉纺织技术,2024,52(10):79-82. 被引量：2
7朱仁愿,孙谏,王行智,陈婷,邱国玉.兰州百合中22种无机元素与种植区域相关性的研究[J].华西药学杂志,2024,39(5):567-574.
8王波,蒋志青,鲍军方,刘津玮.棉纤维产地溯源技术及研究进展[J].纺织学报,2024,45(11):244-250. 被引量：4
9于寒冰,郭阳,李安,赵杰,郑君杰,习佳林.基于稳定同位素和元素技术的草莓产地溯源[J].食品安全质量检测学报,2025,16(3):162-168. 被引量：3
10吴限鑫,彭天舒,李丽娜,林秋君,郭春景,王建忠.植物源农产品产地溯源技术的特征及应用进展[J].现代食品科技,2025,41(9):400-411. 被引量：2

1Kangqi Wang,Ziqi Wu,Man Zhang,Xueyao Lu,Jinsheng Lai,Meiling Zhang,Yi Wang.Metal ion transport in maize:survival in a variable stress environment Author links open overlay panel[J].Journal of Genetics and Genomics,2025,52(3):297-306.
2彭雄新.基于卷积神经网络的无人机烟叶遥感图像智能识别研究[J].电子设计工程,2025,33(21):150-155.
3Charles Hunt WALNE,Jagman DHILLON,Krishna N REDDY,Kambham Raja REDDY.Developing functional relationships of corn growth and developmental responses to nitrogen nutrition for modeling[J].Frontiers of Earth Science,2025,19(2):198-212.
4Kai-Xuan Huo,Yong-Chang Song,Yu Meng,Zi-Qiang Wang,Ming Yu,Bo-Wu Zhang,Jing-Ye Li.A green route to covalently fluorescent whitening cotton fabric for excellent washing durability and skin safety via electron beam irradiation[J].Nuclear Science and Techniques,2025,36(8):98-110.
5江虹,何世昊,高帅杰,李志元,秦勇.电感耦合等离子体质谱法测定两色金鸡菊中18种元素及其安全性评价[J].食品安全质量检测学报,2025,16(21):230-238. 被引量：1
6Yu Mao,Liangping Tu,Zhenyang Xu,Yue Jiang,Mingyu Zheng.Galaxy Morphology Classification Based on DenseNet-SE4 Algorithm[J].Research in Astronomy and Astrophysics,2025,25(8):100-118.
7胡晓燕,苏俊宇,沈涛,杨绍兵,王元忠.基于近红外光谱技术结合深度学习快速鉴别滇黄精干燥方法和产地[J].中草药,2025,56(18):6761-6772.
8Yu Zeng,Fuqiang Lai,Haijie Zhang,Yi Jiang,Junwei Pu,Tongtong Luo,Xiaoxia Zhao.An intelligent recognition method of deep shale gas reservoir laminaset based on laminaset clustering and R-L-M algorithm[J].Artificial Intelligence in Geosciences,2025,6(1):97-112.
9Yichao Zhang,Yanru Guo,Dandan Zhao,Manying Liu,Yange Zhang,Zhi Zheng.Advances in bismuth halide perovskite solar cells[J].Nano Research,2025,18(10):263-289.
10Tamara Zhukabayeva,Vasily Desnitsky,Assel Abdildayeva.Wireless Sensor Network Modeling and Analysis for Attack Detection[J].Computer Modeling in Engineering & Sciences,2025,144(8):2591-2625.

Journal of Cotton Research

2025年第3期

浏览历史

内容加载中请稍等...

Machine learning improve the discrimination of raw cotton from different countries

参考文献3

二级参考文献70

共引文献11

相关作者

相关机构

相关主题

浏览历史