Due to the illumination,complex background,and occlusion of the litchi fruits,the accurate detection of litchi in the field is extremely challenging.In order to solve the problem of the low recognition rate of litchi-...Due to the illumination,complex background,and occlusion of the litchi fruits,the accurate detection of litchi in the field is extremely challenging.In order to solve the problem of the low recognition rate of litchi-picking robots in field conditions,this study was inspired by the ideas of ResNet and dense convolution and proposed an improved feature-extraction network model named“YOLOv3_Litchi”,combining dense connections and residuals for the detection of litchis.Firstly,based on the traditional YOLOv3 deep convolution neural network and regression detection,the idea of residuals was to be put into the feature-extraction network to effectively avoid the problem of decreasing detection accuracy due to the excessive depths of the network layers.Secondly,under the premise of a good receptive field and high detection accuracy,the large convolution kernel was replaced by a small convolution kernel in the shallow layer of the network,thereby effectively reducing the model parameters.Finally,the idea of feature pyramid was used to design the network to identify the small target litchi to ensure that the shallow features were not lost and simultaneously reduced the model parameters.Experimental results show that the improved YOLOv3_Litchi model achieved better results than the classic YOLOv3_DarkNet-53 model and the YOLOv3_Tiny model.The mean average precision(mAP)score was 97.07%,which was higher than the 95.18%mAP of the YOLOv3_DarkNet-53 model and the 94.48%mAP of the YOLOv3_Tiny model.The frame frequency was 58 fps,which was higher than 29 fps of the YOLOv3_DarkNet-53 model.Compared with the classic Faster R-CNN model with the feature-extraction network VGG16,the mAP was increased by 1%,and the FPS advantage was obvious.Compared with the classic single shot multibox detector(SSD)model,both the accuracy and the running efficiency were improved.The results show that the improved YOLOv3_Litchi model had stronger robustness,higher detection accuracy,and less computational complexity for the identification of litchi in the field conditions,which should be helpful for litchi orchard precision management.展开更多
基金This work was financially supported by the National Natural Science Foundation of China(Grant No.32071912,No.61863011,No.31701325,No.31571568,No.31570180)the Guangzhou Science and Technology Project(Grant No.202002020016,No.202102080337)+4 种基金the Natural Science Foundation of Guangdong Province(Grant No.2018A030313330,No.2020A1515010793)the Second Batch of Industry-Education Cooperation Collaborative Projects in 2019,Ministry of Education(Grant No.201902062040)the Guangzhou Key Laboratory of Intelligent Agriculture(Grant No.201902010081)the Project of Rural Revitalization Strategy in Guangdong Province(Grant No.2020KJ261)the Applied Science and Technology Special Fund Project,Meizhou,China(Grant No.2019B0201005).
文摘Due to the illumination,complex background,and occlusion of the litchi fruits,the accurate detection of litchi in the field is extremely challenging.In order to solve the problem of the low recognition rate of litchi-picking robots in field conditions,this study was inspired by the ideas of ResNet and dense convolution and proposed an improved feature-extraction network model named“YOLOv3_Litchi”,combining dense connections and residuals for the detection of litchis.Firstly,based on the traditional YOLOv3 deep convolution neural network and regression detection,the idea of residuals was to be put into the feature-extraction network to effectively avoid the problem of decreasing detection accuracy due to the excessive depths of the network layers.Secondly,under the premise of a good receptive field and high detection accuracy,the large convolution kernel was replaced by a small convolution kernel in the shallow layer of the network,thereby effectively reducing the model parameters.Finally,the idea of feature pyramid was used to design the network to identify the small target litchi to ensure that the shallow features were not lost and simultaneously reduced the model parameters.Experimental results show that the improved YOLOv3_Litchi model achieved better results than the classic YOLOv3_DarkNet-53 model and the YOLOv3_Tiny model.The mean average precision(mAP)score was 97.07%,which was higher than the 95.18%mAP of the YOLOv3_DarkNet-53 model and the 94.48%mAP of the YOLOv3_Tiny model.The frame frequency was 58 fps,which was higher than 29 fps of the YOLOv3_DarkNet-53 model.Compared with the classic Faster R-CNN model with the feature-extraction network VGG16,the mAP was increased by 1%,and the FPS advantage was obvious.Compared with the classic single shot multibox detector(SSD)model,both the accuracy and the running efficiency were improved.The results show that the improved YOLOv3_Litchi model had stronger robustness,higher detection accuracy,and less computational complexity for the identification of litchi in the field conditions,which should be helpful for litchi orchard precision management.