期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Super-Resolution Generative Adversarial Network with Pyramid Attention Module for Face Generation
1
作者 Parvathaneni Naga Srinivasu G.JayaLakshmi +4 位作者 Sujatha Canavoy Narahari Victor Hugo C.de Albuquerque Muhammad Attique Khan Hee-Chan Cho Byoungchol Chang 《Computers, Materials & Continua》 2025年第10期2117-2139,共23页
The generation of high-quality,realistic face generation has emerged as a key field of research in computer vision.This paper proposes a robust approach that combines a Super-Resolution Generative Adversarial Network(... The generation of high-quality,realistic face generation has emerged as a key field of research in computer vision.This paper proposes a robust approach that combines a Super-Resolution Generative Adversarial Network(SRGAN)with a Pyramid Attention Module(PAM)to enhance the quality of deep face generation.The SRGAN framework is designed to improve the resolution of generated images,addressing common challenges such as blurriness and a lack of intricate details.The Pyramid Attention Module further complements the process by focusing on multi-scale feature extraction,enabling the network to capture finer details and complex facial features more effectively.The proposed method was trained and evaluated over 100 epochs on the CelebA dataset,demonstrating consistent improvements in image quality and a marked decrease in generator and discriminator losses,reflecting the model’s capacity to learn and synthesize high-quality images effectively,given adequate computational resources.Experimental outcome demonstrates that the SRGAN model with PAM module has outperformed,yielding an aggregate discriminator loss of 0.055 for real,0.043 for fake,and a generator loss of 10.58 after training for 100 epochs.The model has yielded an structural similarity index measure of 0.923,that has outperformed the other models that are considered in the current study for analysis. 展开更多
关键词 Artificial intelligence generative adversarial network pyramid attention module face generation deep learning
在线阅读 下载PDF
PARTICLE SWARM OPTIMIZATION BASED ON PYRAMID MODEL FOR SATELLITE MODULE LAYOUT 被引量:1
2
作者 Zhang Bao Teng Hongfei 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2005年第4期530-536,共7页
To improve the global search ability of particle swarm optimization (PSO), a multi-population PSO based on pyramid model (PPSO) is presented. Then, it is applied to solve the layout optimization problems against t... To improve the global search ability of particle swarm optimization (PSO), a multi-population PSO based on pyramid model (PPSO) is presented. Then, it is applied to solve the layout optimization problems against the background of an international commercial communication satellite (INTELSAT-Ⅲ) module. Three improvements are developed, including multi-population search based on pyramid model, adaptive collision avoidance among particles, and mutation of degraded particles. In the numerical examples of the layout design of this simplified satellite module, the performance of PPSO is compared to global version PSO and local version PSO (ring and Neumann PSO). The results show that PPSO has higher computational accuracy, efficiency and success ratio. 展开更多
关键词 Particle swarm optimization pyramid model Layout design Satellite module
在线阅读 下载PDF
CSC-YOLO:An Image Recognition Model for Surface Defect Detection of Copper Strip and Plates
3
作者 ZHANG Guo CHEN Tao WANG Jianping 《Journal of Shanghai Jiaotong university(Science)》 2025年第5期1037-1049,共13页
In order to meet the requirements of accurate identification of surface defects on copper strip in industrial production,a detection model of surface defects based on machine vision,CSC-YOLO,is proposed.The model uses... In order to meet the requirements of accurate identification of surface defects on copper strip in industrial production,a detection model of surface defects based on machine vision,CSC-YOLO,is proposed.The model uses YOLOv4-tiny as the benchmark network.First,K-means clustering is introduced into the benchmark network to obtain anchor frames that match the self-built dataset.Second,a cross-region fusion module is introduced in the backbone network to solve the difficult target recognition problem by fusing contextual semantic information.Third,the spatial pyramid pooling-efficient channel attention network(SPP-E)module is introduced in the path aggregation network(PANet)to enhance the extraction of features.Fourth,to prevent the loss of channel information,a lightweight attention mechanism is introduced to improve the performance of the network.Finally,the performance of the model is improved by adding adjustment factors to correct the loss function for the dimensional characteristics of the surface defects.CSC-YOLO was tested on the self-built dataset of surface defects in copper strip,and the experimental results showed that the mAP of the model can reach 93.58%,which is a 3.37% improvement compared with the benchmark network,and FPS,although decreasing compared with the benchmark network,reached 104.CSC-YOLO takes into account the real-time requirements of copper strip production.The comparison experiments with Faster RCNN,SSD300,YOLOv3,YOLOv4,Resnet50-YOLOv4,YOLOv5s,YOLOv7,and other algorithms show that the algorithm obtains a faster computation speed while maintaining a higher detection accuracy. 展开更多
关键词 copper strip surface defect detection K-means clustering cross-region fusion module spatial pyramid pooling-efficient channel attention network(SPP-E)module YOLOv4-tiny
原文传递
Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network 被引量:1
4
作者 Xuan Zhou Jianping Yi 《Intelligent Automation & Soft Computing》 SCIE 2023年第8期2103-2116,共14页
Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recog-nition.Previous action recognition methods utilize a fixed spatiotemporal windo... Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recog-nition.Previous action recognition methods utilize a fixed spatiotemporal window to learn local video representation.However,these methods failed to capture complex motion patterns due to their limited receptive field.To solve the above problems,this paper proposes a lightweight Temporal Pyramid Excitation(TPE)module to capture the short,medium,and long-term temporal context.In this method,Temporal Pyramid(TP)module can effectively expand the temporal receptive field of the network by using the multi-temporal kernel decomposition without significantly increasing the computational cost.In addition,the Multi Excitation module can emphasize temporal importance to enhance the temporal feature representation learning.TPE can be integrated into ResNet50,and building a compact video learning framework-TPENet.Extensive validation experiments on several challenging benchmark(Something-Something V1,Something-Something V2,UCF-101,and HMDB51)datasets demonstrate that our method achieves a preferable balance between computation and accuracy. 展开更多
关键词 Fine-grained action recognition temporal pyramid excitation module temporal receptive multi-excitation module
在线阅读 下载PDF
An Improved UNet Lightweight Network for Semantic Segmentation of Weed Images in Corn Fields
5
作者 Yu Zuo Wenwen Li 《Computers, Materials & Continua》 SCIE EI 2024年第6期4413-4431,共19页
In cornfields,factors such as the similarity between corn seedlings and weeds and the blurring of plant edge details pose challenges to corn and weed segmentation.In addition,remote areas such as farmland are usually ... In cornfields,factors such as the similarity between corn seedlings and weeds and the blurring of plant edge details pose challenges to corn and weed segmentation.In addition,remote areas such as farmland are usually constrained by limited computational resources and limited collected data.Therefore,it becomes necessary to lighten the model to better adapt to complex cornfield scene,and make full use of the limited data information.In this paper,we propose an improved image segmentation algorithm based on unet.Firstly,the inverted residual structure is introduced into the contraction path to reduce the number of parameters in the training process and improve the feature extraction ability;secondly,the pyramid pooling module is introduced to enhance the network’s ability of acquiring contextual information as well as the ability of dealing with the small target loss problem;and lastly,Finally,to further enhance the segmentation capability of the model,the squeeze and excitation mechanism is introduced in the expansion path.We used images of corn seedlings collected in the field and publicly available corn weed datasets to evaluate the improved model.The improved model has a total parameter of 3.79 M and miou can achieve 87.9%.The fps on a single 3050 ti video card is about 58.9.The experimental results show that the network proposed in this paper can quickly segment corn weeds in a cornfield scenario with good segmentation accuracy. 展开更多
关键词 Semantic segmentation deep learning UNet pyramid pooling module
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部