To balance the speed and accuracy in semantic segmentation of the urban street images for autonomous driving,we proposed an improved U-Net network.Firstly,to improve the model representation capability,our improved U-...To balance the speed and accuracy in semantic segmentation of the urban street images for autonomous driving,we proposed an improved U-Net network.Firstly,to improve the model representation capability,our improved U-Net network structure was designed as three parts,shallow layer,intermediate layer and deep layer.Different attention mechanisms were used according to their feature extraction characteristics.Specifically,a spatial attention module was used in the shallow network,a dual attention module was used in the intermediate layer network and a channel attention module was used in the deep network.At the same time,the traditional convolution was replaced by depthwise separable convolution in above three parts,which can largely reduce the number of network parameters,and improve the network operation speed greatly.The experimental results on three datasets show that our improved U-Net semantic segmentation model for street images can get better results in both segmentation accuracy and speed.The average mean intersection over union(MIoU)is 68.8%,which is increased by 9.2%and the computation speed is about 38 ms/frame.We can process 27 frames images for segmentation per second,which meets the real-time process and accuracy requirements for semantic segmentation of urban street images.展开更多
To solve the problems of convolutional neural network–principal component analysis(CNN-PCA)in fine description and generalization of complex reservoir geological features,a 3D attention U-Net network was proposed not...To solve the problems of convolutional neural network–principal component analysis(CNN-PCA)in fine description and generalization of complex reservoir geological features,a 3D attention U-Net network was proposed not using a trained C3D video motion analysis model to extract the style of a 3D model,and applied to complement the details of geologic model lost in the dimension reduction of PCA method in this study.The 3D attention U-Net network was applied to a complex river channel sandstone reservoir to test its effects.The results show that compared with CNN-PCA method,the 3D attention U-Net network could better complement the details of geological model lost in the PCA dimension reduction,better reflect the fluid flow features in the original geologic model,and improve history matching results.展开更多
Brown adipose tissue(BAT)is a kind of adipose tissue engaging in thermoregulatory thermogenesis,metaboloregulatory thermogenesis,and secretory.Current studies have revealed that BAT activity is negatively correlated w...Brown adipose tissue(BAT)is a kind of adipose tissue engaging in thermoregulatory thermogenesis,metaboloregulatory thermogenesis,and secretory.Current studies have revealed that BAT activity is negatively correlated with adult body weight and is considered a target tissue for the treatment of obesity and other metabolic-related diseases.Additionally,the activity of BAT presents certain differences between different ages and genders.Clinically,BAT segmentation based on PET/CT data is a reliable method for brown fat research.However,most of the current BAT segmentation methods rely on the experience of doctors.In this paper,an improved U-net network,ICA-Unet,is proposed to achieve automatic and precise segmentation of BAT.First,the traditional 2D convolution layer in the encoder is replaced with a depth-wise overparameterized convolutional(Do-Conv)layer.Second,the channel attention block is introduced between the double-layer convolution.Finally,the image information entropy(IIE)block is added in the skip connections to strengthen the edge features.Furthermore,the performance of this method is evaluated on the dataset of PET/CT images from 368 patients.The results demonstrate a strong agreement between the automatic segmentation of BAT and manual annotation by experts.The average DICE coeffcient(DSC)is 0.9057,and the average Hausdorff distance is 7.2810.Experimental results suggest that the method proposed in this paper can achieve effcient and accurate automatic BAT segmentation and satisfy the clinical requirements of BAT.展开更多
Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon...Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.展开更多
基金supported by the National Natural Science Foundation China(No.61601174)the Postdoctoral Research Foundation of Heilongjiang Province(No.LBH-Q17150)the Science and Technology Innovative Research Team in Higher Educational Institutions of Heilongjiang Province(No.2012TD007)。
文摘To balance the speed and accuracy in semantic segmentation of the urban street images for autonomous driving,we proposed an improved U-Net network.Firstly,to improve the model representation capability,our improved U-Net network structure was designed as three parts,shallow layer,intermediate layer and deep layer.Different attention mechanisms were used according to their feature extraction characteristics.Specifically,a spatial attention module was used in the shallow network,a dual attention module was used in the intermediate layer network and a channel attention module was used in the deep network.At the same time,the traditional convolution was replaced by depthwise separable convolution in above three parts,which can largely reduce the number of network parameters,and improve the network operation speed greatly.The experimental results on three datasets show that our improved U-Net semantic segmentation model for street images can get better results in both segmentation accuracy and speed.The average mean intersection over union(MIoU)is 68.8%,which is increased by 9.2%and the computation speed is about 38 ms/frame.We can process 27 frames images for segmentation per second,which meets the real-time process and accuracy requirements for semantic segmentation of urban street images.
基金Supported by the China National Oil and Gas Major Project(2016ZX05010-003)PetroChina Science and Technology Major Project(2019B1210,2021DJ1201).
文摘To solve the problems of convolutional neural network–principal component analysis(CNN-PCA)in fine description and generalization of complex reservoir geological features,a 3D attention U-Net network was proposed not using a trained C3D video motion analysis model to extract the style of a 3D model,and applied to complement the details of geologic model lost in the dimension reduction of PCA method in this study.The 3D attention U-Net network was applied to a complex river channel sandstone reservoir to test its effects.The results show that compared with CNN-PCA method,the 3D attention U-Net network could better complement the details of geological model lost in the PCA dimension reduction,better reflect the fluid flow features in the original geologic model,and improve history matching results.
基金supported in part by the National Natural Science Foundation of China(61701403,82122033,81871379)National Key Research and Development Program of China(2016YFC0103804,2019YFC1521103,2020YFC1523301,2019YFC-1521102)+3 种基金Key R&D Projects in Shaanxi Province(2019ZDLSF07-02,2019ZDLGY10-01)Key R&D Projects in Qinghai Province(2020-SF-143)China Post-doctoral Science Foundation(2018M643719)Young Talent Support Program of the Shaanxi Association for Science and Technology(20190107).
文摘Brown adipose tissue(BAT)is a kind of adipose tissue engaging in thermoregulatory thermogenesis,metaboloregulatory thermogenesis,and secretory.Current studies have revealed that BAT activity is negatively correlated with adult body weight and is considered a target tissue for the treatment of obesity and other metabolic-related diseases.Additionally,the activity of BAT presents certain differences between different ages and genders.Clinically,BAT segmentation based on PET/CT data is a reliable method for brown fat research.However,most of the current BAT segmentation methods rely on the experience of doctors.In this paper,an improved U-net network,ICA-Unet,is proposed to achieve automatic and precise segmentation of BAT.First,the traditional 2D convolution layer in the encoder is replaced with a depth-wise overparameterized convolutional(Do-Conv)layer.Second,the channel attention block is introduced between the double-layer convolution.Finally,the image information entropy(IIE)block is added in the skip connections to strengthen the edge features.Furthermore,the performance of this method is evaluated on the dataset of PET/CT images from 368 patients.The results demonstrate a strong agreement between the automatic segmentation of BAT and manual annotation by experts.The average DICE coeffcient(DSC)is 0.9057,and the average Hausdorff distance is 7.2810.Experimental results suggest that the method proposed in this paper can achieve effcient and accurate automatic BAT segmentation and satisfy the clinical requirements of BAT.
基金Supported by the National key research and development program in the 14th five year plan 2021YFA1200700)the National Natural Science Foundation of China(62535018,62431025,62561160113)the Natural Science Foundation of Shanghai(23ZR1473400).
文摘Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.