Detection efficiency plays an increasingly important role in object detection tasks.One-stage methods are widely adopted in real life because of their high efficiency especially in some real-time detection tasks such ...Detection efficiency plays an increasingly important role in object detection tasks.One-stage methods are widely adopted in real life because of their high efficiency especially in some real-time detection tasks such as face recognition and self-driving cars.RetinaMask achieves significant progress in the field of one-stage detectors by adding a semantic segmentation branch,but it has limitation in detecting multi-scale objects.To solve this problem,this paper proposes RetinaMask with Gate(RMG)model,consisting of four main modules.It develops RetinaMask with a gate mechanism,which extracts and combines features at different levels more effectively according to the size of objects.It firstly extracted multi-level features from input image by ResNet.Secondly,it constructed a fused feature pyramid through feature pyramid network,then gate mechanism was employed to adaptively enhance and integrate features at various scales with the respect to the size of object.Finally,three prediction heads were added for classification,localization and mask prediction,driving the model to learn with mask prediction.The predictions of all levels were integrated during the post-processing.The augment network shows better performance in object detection without the increase of computation cost and inference time,especially for small objects.展开更多
基金the National Natural Science Foundation of China under Grant No.61672181。
文摘Detection efficiency plays an increasingly important role in object detection tasks.One-stage methods are widely adopted in real life because of their high efficiency especially in some real-time detection tasks such as face recognition and self-driving cars.RetinaMask achieves significant progress in the field of one-stage detectors by adding a semantic segmentation branch,but it has limitation in detecting multi-scale objects.To solve this problem,this paper proposes RetinaMask with Gate(RMG)model,consisting of four main modules.It develops RetinaMask with a gate mechanism,which extracts and combines features at different levels more effectively according to the size of objects.It firstly extracted multi-level features from input image by ResNet.Secondly,it constructed a fused feature pyramid through feature pyramid network,then gate mechanism was employed to adaptively enhance and integrate features at various scales with the respect to the size of object.Finally,three prediction heads were added for classification,localization and mask prediction,driving the model to learn with mask prediction.The predictions of all levels were integrated during the post-processing.The augment network shows better performance in object detection without the increase of computation cost and inference time,especially for small objects.