The development of computer vision-based rice phenotyping techniques is crucial for precision field manage-ment and accelerated breeding,which facilitate continuously advancing rice production.Among phenotyping tasks,...The development of computer vision-based rice phenotyping techniques is crucial for precision field manage-ment and accelerated breeding,which facilitate continuously advancing rice production.Among phenotyping tasks,distinguishing image components is a key prerequisite for characterizing plant growth and development at the organ scale,enabling deeper insights into ecophysiological processes.However,owing to the fine structure of rice organs and complex illumination within the canopy,this task remains highly challenging,underscoring the need for a high-quality training dataset.Such datasets are scarce,both because of a lack of large,representative collections of rice field images and because of the time-intensive nature of the annotation.To address this gap,we created the first comprehensive multiclass rice semantic segmentation dataset,RiceSEG.We gathered nearly 50,000 high-resolution,ground-based images from five major rice-growing countries(China,Japan,India,the Philippines,and Tanzania),encompassing more than 6000 genotypes across all growth stages.From these original images,3078 representative samples were selected and annotated with six classes(background,green vegetation,senescent vegetation,panicle,weeds,and duckweed)to form the RiceSEG dataset.Notably,the subdataset from China spans all major genotypes and rice-growing environments from northeastern to southern regions.Both state-of-the-art convolutional neural networks and transformer-based semantic segmentation models were used as baselines.While these models perform reasonably well in segmenting background and green vegetation,they face difficulties during the reproductive stage,when canopy structures are more complex and when multiple classes are involved.These findings highlight the importance of our dataset for developing specialized segmentation models for rice and other crops.The RiceSEG dataset is publicly available at .展开更多
基金This work was supported by the National Key R&D Program of China(No.2022YFE0116200 and No.2022YFD2300700)the Young Scientists Fund of the National Natural Science Foundation of China(No.42201437 and No.32201893)+4 种基金the PhD Scientific Research and Innovation Foundation of The Education Department of Hainan Province Joint Project of Sanya Yazhou Bay Science and Technology City(No.HSPHDSRF-2024-09-001)the Hainan Provincial Natural Science Foundation of China(No.325QN370)the“JBGS”Project of Seed Industry Revitalization in Jiangsu Province(JBGS[2021]007)the Japan Society for the Promotion of Science(No.22KK0083 and No.JP25H01110)the Sarabetsu Village“Endowed Chair for Field Phenomics”project in Hokkaido,Japan.
文摘The development of computer vision-based rice phenotyping techniques is crucial for precision field manage-ment and accelerated breeding,which facilitate continuously advancing rice production.Among phenotyping tasks,distinguishing image components is a key prerequisite for characterizing plant growth and development at the organ scale,enabling deeper insights into ecophysiological processes.However,owing to the fine structure of rice organs and complex illumination within the canopy,this task remains highly challenging,underscoring the need for a high-quality training dataset.Such datasets are scarce,both because of a lack of large,representative collections of rice field images and because of the time-intensive nature of the annotation.To address this gap,we created the first comprehensive multiclass rice semantic segmentation dataset,RiceSEG.We gathered nearly 50,000 high-resolution,ground-based images from five major rice-growing countries(China,Japan,India,the Philippines,and Tanzania),encompassing more than 6000 genotypes across all growth stages.From these original images,3078 representative samples were selected and annotated with six classes(background,green vegetation,senescent vegetation,panicle,weeds,and duckweed)to form the RiceSEG dataset.Notably,the subdataset from China spans all major genotypes and rice-growing environments from northeastern to southern regions.Both state-of-the-art convolutional neural networks and transformer-based semantic segmentation models were used as baselines.While these models perform reasonably well in segmenting background and green vegetation,they face difficulties during the reproductive stage,when canopy structures are more complex and when multiple classes are involved.These findings highlight the importance of our dataset for developing specialized segmentation models for rice and other crops.The RiceSEG dataset is publicly available at .