期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
A Fine-Grained Image Classification Model Based on Hybrid Attention and Pyramidal Convolution
1
作者 Sifeng Wang Shengxiang Li +3 位作者 Anran Li Zhaoan Dong Guangshun Li Chao Yan 《Tsinghua Science and Technology》 2025年第3期1283-1293,共11页
Finding more specific subcategories within a larger category is the goal of fine-grained image classification(FGIC),and the key is to find local discriminative regions of visual features.Most existing methods use trad... Finding more specific subcategories within a larger category is the goal of fine-grained image classification(FGIC),and the key is to find local discriminative regions of visual features.Most existing methods use traditional convolutional operations to achieve fine-grained image classification.However,traditional convolution cannot extract multi-scale features of an image and existing methods are susceptible to interference from image background information.Therefore,to address the above problems,this paper proposes an FGIC model(Attention-PCNN)based on hybrid attention mechanism and pyramidal convolution.The model feeds the multi-scale features extracted by the pyramidal convolutional neural network into two branches capturing global and local information respectively.In particular,a hybrid attention mechanism is added to the branch capturing global information in order to reduce the interference of image background information and make the model pay more attention to the target region with fine-grained features.In addition,the mutual-channel loss(MC-LOSS)is introduced in the local information branch to capture fine-grained features.We evaluated the model on three publicly available datasets CUB-200-2011,Stanford Cars,FGVCAircraft,etc.Compared to the state-of-the-art methods,the results show that Attention-PCNN performs better. 展开更多
关键词 fine-grained image classification pyramidal convolution hybrid attention
原文传递
A Novel Self-Supervised Learning Network for Binocular Disparity Estimation 被引量:1
2
作者 Jiawei Tian Yu Zhou +5 位作者 Xiaobing Chen Salman A.AlQahtani Hongrong Chen Bo Yang Siyu Lu Wenfeng Zheng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2025年第1期209-229,共21页
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This st... Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This study proposes a novel end-to-end disparity estimation model to address these challenges.Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting interferences.This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and efficiency.The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video data.Experimental results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing parameters.Moreover,the model exhibited faster convergence during training,contributing to overall performance enhancement.This study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments. 展开更多
关键词 Parallax estimation parallax regression model self-supervised learning Pseudo-Siamese neural network pyramid dilated convolution binocular disparity estimation
在线阅读 下载PDF
Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion
3
作者 Chuanchuan Wang Ahmad Sufril Azlan Mohamed +3 位作者 Xiao Yang Hao Zhang Xiang Li Mohd Halim Bin Mohd Noor 《Computers, Materials & Continua》 2025年第10期855-874,共20页
Classroom behavior recognition is a hot research topic,which plays a vital role in assessing and improving the quality of classroom teaching.However,existing classroom behavior recognition methods have challenges for ... Classroom behavior recognition is a hot research topic,which plays a vital role in assessing and improving the quality of classroom teaching.However,existing classroom behavior recognition methods have challenges for high recognition accuracy with datasets with problems such as scenes with blurred pictures,and inconsistent objects.To address this challenge,we proposed an effective,lightweight object detector method called the RFNet model(YOLO-FR).The YOLO-FR is a lightweight and effective model.Specifically,for efficient multi-scale feature extraction,effective feature pyramid shared convolutional(FPSC)was designed to improve the feature extract performance by leveraging convolutional layers with varying dilation rates from the input image in the backbone.Secondly,to address the problem of multi-scale variability in the scene,we design the Rep Ghost fusion Cross Stage Partial and Efficient Layer Aggregation Network(RGCSPELAN)to improve the network performance further and reduce the amount of computation and the number of parameters.In addition,by conducting experimental valuation on the SCB dataset3 and STBD-08 dataset.Experimental results indicate that,compared to the baseline model,the RFNet model has increased mean accuracy precision(mAP@50)from 69.6%to 71.0%on the SCB dataset3 and from 91.8%to 93.1%on the STBD-08 dataset.The RFNet approach has effectiveness precision at 68.6%,surpassing the baseline method(YOLOv11)at 3.3%and archieve the minimal size(4.9 M)on the SCB dataset3.Finally,comparing it with other algorithms,it accurately detects student behavior in complex classroom environments results confirmed that RFNet is well-suited for real-time and efficiently recognizing classroom behaviors. 展开更多
关键词 Classroom action recognition YOLO-FR feature pyramid shared convolutional rep ghost cross stage partial efficient layer aggregation network(RGCSPELAN)
在线阅读 下载PDF
Automatic Segmentation Method for Cone-Beam Computed Tomography Image of the Bone Graft Region within Maxillary Sinus Based on the Atrous Spatial Pyramid Convolution Network 被引量:1
4
作者 XU Jiangchang HE Shamin +2 位作者 YU Dedong WU Yiqun CHEN Xiaojun 《Journal of Shanghai Jiaotong university(Science)》 EI 2021年第3期298-305,共8页
Sinus floor elevation with a lateral window approach requires bone graft(BG)to ensure sufficient bone mass,and it is necessary to measure and analyse the BG region for follow-up of postoperative patients.However,the B... Sinus floor elevation with a lateral window approach requires bone graft(BG)to ensure sufficient bone mass,and it is necessary to measure and analyse the BG region for follow-up of postoperative patients.However,the BG region from cone-beam computed tomography(CBCT)images is connected to the margin of the maxillary sinus,and its boundary is blurred.Common segmentation methods are usually performed manually by experienced doctors,and are complicated by challenges such as low efficiency and low precision.In this study,an auto-segmentation approach was applied to the BG region within the maxillary sinus based on an atrous spatial pyramid convolution(ASPC)network.The ASPC module was adopted using residual connections to compose multiple atrous convolutions,which could extract more features on multiple scales.Subsequently,a segmentation network of the BG region with multiple ASPC modules was established,which effectively improved the segmentation performance.Although the training data were insufficient,our networks still achieved good auto-segmentation results,with a dice coefficient(Dice)of 87.13%,an Intersection over Union(Iou)of 78.01%,and a sensitivity of 95.02%.Compared with other methods,our method achieved a better segmentation effect,and effectively reduced the misjudgement of segmentation.Our method can thus be used to implement automatic segmentation of the BG region and improve doctors’work efficiency,which is of great importance for developing preliminary studies on the measurement of postoperative BG within the maxillary sinus. 展开更多
关键词 atrous spatial pyramid convolution(ASPC) bone graft(BG)region medical image segmentation residual connection
原文传递
An adaptive physics-informed deep learning method for pore pressure prediction using seismic data 被引量:6
5
作者 Xin Zhang Yun-Hu Lu +2 位作者 Yan Jin Mian Chen Bo Zhou 《Petroleum Science》 SCIE EI CAS CSCD 2024年第2期885-902,共18页
Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the g... Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the great potential to deal with pore pressure prediction.However,most of the traditional deep learning models are less efficient to address generalization problems.To fill this technical gap,in this work,we developed a new adaptive physics-informed deep learning model with high generalization capability to predict pore pressure values directly from seismic data.Specifically,the new model,named CGP-NN,consists of a novel parametric features extraction approach(1DCPP),a stacked multilayer gated recurrent model(multilayer GRU),and an adaptive physics-informed loss function.Through machine training,the developed model can automatically select the optimal physical model to constrain the results for each pore pressure prediction.The CGP-NN model has the best generalization when the physicsrelated metricλ=0.5.A hybrid approach combining Eaton and Bowers methods is also proposed to build machine-learnable labels for solving the problem of few labels.To validate the developed model and methodology,a case study on a complex reservoir in Tarim Basin was further performed to demonstrate the high accuracy on the pore pressure prediction of new wells along with the strong generalization ability.The adaptive physics-informed deep learning approach presented here has potential application in the prediction of pore pressures coupled with multiple genesis mechanisms using seismic data. 展开更多
关键词 Pore pressure prediction Seismic data 1D convolution pyramid pooling Adaptive physics-informed loss function High generalization capability
原文传递
A Novel Method of Heart Failure Prediction Based on DPCNN-XGBOOST Model 被引量:4
6
作者 Yuwen Chen Xiaolin Qin +1 位作者 Lige Zhang Bin Yi 《Computers, Materials & Continua》 SCIE EI 2020年第10期495-510,共16页
The occurrence of perioperative heart failure will affect the quality of medical services and threaten the safety of patients.Existing methods depend on the judgment of doctors,the results are affected by many factors... The occurrence of perioperative heart failure will affect the quality of medical services and threaten the safety of patients.Existing methods depend on the judgment of doctors,the results are affected by many factors such as doctors’knowledge and experience.The accuracy is difficult to guarantee and has a serious lag.In this paper,a mixture prediction model is proposed for perioperative adverse events of heart failure,which combined with the advantages of the Deep Pyramid Convolutional Neural Networks(DPCNN)and Extreme Gradient Boosting(XGBOOST).The DPCNN was used to automatically extract features from patient’s diagnostic texts,and the text features were integrated with the preoperative examination and intraoperative monitoring values of patients,then the XGBOOST algorithm was used to construct the prediction model of heart failure.An experimental comparison was conducted on the model based on the data of patients with heart failure in southwest hospital from 2014 to 2018.The results showed that the DPCNN-XGBOOST model improved the predictive sensitivity of the model by 3%and 31%compared with the text-based DPCNN Model and the numeric-based XGBOOST Model. 展开更多
关键词 Deep pyramid convolutional neural networks extreme gradient boosting heart failure prediction
在线阅读 下载PDF
Facial Expression Recognition Based on the Fusion of Infrared and Visible Image
7
作者 Jiancheng Zou Jiaxin Li +2 位作者 Juncun Wei Zhengzheng Li Xin Yang 《Journal on Artificial Intelligence》 2021年第3期123-134,共12页
Facial expression recognition is a research hot spot in the fields of computer vision and pattern recognition.However,the existing facial expression recognition models are mainly concentrated in the visible light envi... Facial expression recognition is a research hot spot in the fields of computer vision and pattern recognition.However,the existing facial expression recognition models are mainly concentrated in the visible light environment.They have insufficient generalization ability and low recognition accuracy,and are vulnerable to environmental changes such as illumination and distance.In order to solve these problems,we combine the advantages of the infrared and visible images captured simultaneously by array equipment our developed with two infrared and two visible lens,so that the fused image not only has the texture information of visible image,but also has the contrast information of infrared image.On the other hand,we improved the WGAN by adding SSIM and LBP loss functions to ensure the structural similarity between the fused image and infrared image,and also the texture similarity between the fused image and visible image respectively.Finally,a facial expression recognition model Pyconv-SE18 with pyramid convolution and attention mechanism module is designed to extract the important feature information of facial expression in multiple scales.We add cosine distance loss function to reduce the feature difference within the class.Experiment results show that the robustness of expression recognition algorithm to illumination is improved based on the fused images.The accuracy of this model on FER2013 and CK+public data sets are 69.3%and 94.6%,respectively. 展开更多
关键词 Image fusion expression recognition pyramid convolution attention mechanism
在线阅读 下载PDF
Power Grid Fault Diagnosis Based on Deep Pyramid Convolutional Neural Network 被引量:2
8
作者 Xu Zhang Huiting Zhang +4 位作者 Dongying Zhang Yixian Wang Ruiting Ding Yuchuan Zheng Yongxu Zhang 《CSEE Journal of Power and Energy Systems》 SCIE EI CSCD 2023年第6期2188-2203,共16页
Existing power grid fault diagnosis methods relyon manual experience to design diagnosis models, lack theability to extract fault knowledge, and are difficult to adaptto complex and changeable engineering sites. Consi... Existing power grid fault diagnosis methods relyon manual experience to design diagnosis models, lack theability to extract fault knowledge, and are difficult to adaptto complex and changeable engineering sites. Considering thissituation, this paper proposes a power grid fault diagnosismethod based on a deep pyramid convolutional neural networkfor the alarm information set. This approach uses the deepfeature extraction ability of the network to extract fault featureknowledge from alarm information texts and achieve end-to-endfault classification and fault device identification. First, a deeppyramid convolutional neural network model for extracting theoverall characteristics of fault events is constructed to identifyfault types. Second, a deep pyramidal convolutional neuralnetwork model for alarm information text is constructed, thetext description characteristics associated with alarm informationtexts are extracted, the key information corresponding to faultsin the alarm information set is identified, and suspicious faultydevices are selected. Then, a fault device identification strategythat integrates fault-type and time sequence priorities is proposedto identify faulty devices. Finally, the actual fault cases and thefault cases generated by the simulation are studied, and theresults verify the effectiveness and practicability of the methodpresented in this paper. 展开更多
关键词 Alarm information deep pyramid convolutional neural network fault classification fault device identification feature extraction key information
原文传递
基于双支路特征融合的MRI颅脑肿瘤图像分割研究 被引量:2
9
作者 熊炜 周蕾 +2 位作者 乐玲 张开 李利荣 《光电子.激光》 CAS CSCD 北大核心 2022年第4期383-392,共10页
针对磁共振成像(magnetic resonance imaging, MRI)颅脑肿瘤区域误识别与分割网络空间信息丢失问题,提出一种基于双支路特征融合的MRI脑肿瘤图像分割方法。首先通过主支路的重构VGG与注意力模型(re-parameterization visual geometry gr... 针对磁共振成像(magnetic resonance imaging, MRI)颅脑肿瘤区域误识别与分割网络空间信息丢失问题,提出一种基于双支路特征融合的MRI脑肿瘤图像分割方法。首先通过主支路的重构VGG与注意力模型(re-parameterization visual geometry group and attention model, RVAM)提取网络的上下文信息,然后使用可变形卷积与金字塔池化模型(deformable convolution and pyramid pooling model, DCPM)在副支路获取丰富的空间信息,之后使用特征融合模块对两支路的特征信息进行融合。最后引入注意力模型,在上采样过程中加强分割目标在解码时的权重。提出的方法在Kaggle_3m数据集和BraTS2019数据集上进行了实验验证,实验结果表明该方法具有良好的脑肿瘤分割性能,其中在Kaggle_3m上,Dice相似系数、杰卡德系数分别达到了91.45%和85.19%。 展开更多
关键词 磁共振成像(magnetic resonance imaging MRI)颅脑肿瘤图像分割 双支路特征融合 重构VGG与注意力模型(re-parameterization visual geometry group and attention model RVAM) 可变形卷积与金字塔池化模型(deformable convolution and pyramid pooling model DCPM)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部