With the continuous development of artificial intelligence and computer vision technology,numerous deep learning-based lane line detection methods have emerged.DeepLabv3+,as a classic semantic segmentation model,has f...With the continuous development of artificial intelligence and computer vision technology,numerous deep learning-based lane line detection methods have emerged.DeepLabv3+,as a classic semantic segmentation model,has found widespread application in the field of lane line detection.However,the accuracy of lane line segmentation is often compromised by factors such as changes in lighting conditions,occlusions,and wear and tear on the lane lines.Additionally,DeepLabv3+suffers from high memory consumption and challenges in deployment on embedded platforms.To address these issues,this paper proposes a lane line detection method for complex road scenes based on DeepLabv3+and MobileNetV4(MNv4).First,the lightweight MNv4 is adopted as the backbone network,and the standard convolutions in ASPP are replaced with depthwise separable convolutions.Second,a polarization attention mechanism is introduced after the ASPP module to enhance the model’s generalization capability.Finally,the Simple Linear Iterative Clustering(SLIC)superpixel segmentation algorithmis employed to preserve lane line edge information.MNv4-DeepLabv3+was tested on the TuSimple and CULane datasets.On the TuSimple dataset,theMean Intersection over Union(MIoU)and Mean Pixel Accuracy(mPA)improved by 1.01%and 7.49%,respectively.On the CULane dataset,MIoU andmPA increased by 3.33%and 7.74%,respectively.Thenumber of parameters decreased from 54.84 to 3.19 M.Experimental results demonstrate that MNv4-DeepLabv3+significantly optimizes model parameter count and enhances segmentation accuracy.展开更多
Artificial intelligence,such as deep learning technology,has advanced the study of facial expression recognition since facial expression carries rich emotional information and is significant for many naturalistic situ...Artificial intelligence,such as deep learning technology,has advanced the study of facial expression recognition since facial expression carries rich emotional information and is significant for many naturalistic situations.To pursue a high facial expression recognition accuracy,the network model of deep learning is generally designed to be very deep while the model’s real-time performance is typically constrained and limited.With MobileNetV3,a lightweight model with a good accuracy,a further study is conducted by adding a basic ResNet module to each of its existing modules and an SSH(Single Stage Headless Face Detector)context module to expand the model’s perceptual field.In this article,the enhanced model named Res-MobileNetV3,could alleviate the subpar of real-time performance and compress the size of large network models,which can process information at a rate of up to 33 frames per second.Although the improved model has been verified to be slightly inferior to the current state-of-the-art method in aspect of accuracy rate on the publically available face expression datasets,it can bring a good balance on accuracy,real-time performance,model size and model complexity in practical applications.展开更多
文摘With the continuous development of artificial intelligence and computer vision technology,numerous deep learning-based lane line detection methods have emerged.DeepLabv3+,as a classic semantic segmentation model,has found widespread application in the field of lane line detection.However,the accuracy of lane line segmentation is often compromised by factors such as changes in lighting conditions,occlusions,and wear and tear on the lane lines.Additionally,DeepLabv3+suffers from high memory consumption and challenges in deployment on embedded platforms.To address these issues,this paper proposes a lane line detection method for complex road scenes based on DeepLabv3+and MobileNetV4(MNv4).First,the lightweight MNv4 is adopted as the backbone network,and the standard convolutions in ASPP are replaced with depthwise separable convolutions.Second,a polarization attention mechanism is introduced after the ASPP module to enhance the model’s generalization capability.Finally,the Simple Linear Iterative Clustering(SLIC)superpixel segmentation algorithmis employed to preserve lane line edge information.MNv4-DeepLabv3+was tested on the TuSimple and CULane datasets.On the TuSimple dataset,theMean Intersection over Union(MIoU)and Mean Pixel Accuracy(mPA)improved by 1.01%and 7.49%,respectively.On the CULane dataset,MIoU andmPA increased by 3.33%and 7.74%,respectively.Thenumber of parameters decreased from 54.84 to 3.19 M.Experimental results demonstrate that MNv4-DeepLabv3+significantly optimizes model parameter count and enhances segmentation accuracy.
基金supported by China Academy of Railway Sciences Corporation Limited(No.2021YJ127).
文摘Artificial intelligence,such as deep learning technology,has advanced the study of facial expression recognition since facial expression carries rich emotional information and is significant for many naturalistic situations.To pursue a high facial expression recognition accuracy,the network model of deep learning is generally designed to be very deep while the model’s real-time performance is typically constrained and limited.With MobileNetV3,a lightweight model with a good accuracy,a further study is conducted by adding a basic ResNet module to each of its existing modules and an SSH(Single Stage Headless Face Detector)context module to expand the model’s perceptual field.In this article,the enhanced model named Res-MobileNetV3,could alleviate the subpar of real-time performance and compress the size of large network models,which can process information at a rate of up to 33 frames per second.Although the improved model has been verified to be slightly inferior to the current state-of-the-art method in aspect of accuracy rate on the publically available face expression datasets,it can bring a good balance on accuracy,real-time performance,model size and model complexity in practical applications.