期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Enhanced Image Captioning via Integrated Wavelet Convolution and MobileNet V3 Architecture
1
作者 Mo Hou Bin Xu Wen Shang 《Computers, Materials & Continua》 2026年第2期897-915,共19页
Image captioning,a pivotal research area at the intersection of image understanding,artificial intelligence,and linguistics,aims to generate natural language descriptions for images.This paper proposes an efficient im... Image captioning,a pivotal research area at the intersection of image understanding,artificial intelligence,and linguistics,aims to generate natural language descriptions for images.This paper proposes an efficient image captioning model named Mob-IMWTC,which integrates improved wavelet convolution(IMWTC)with an enhanced MobileNet V3 architecture.The enhanced MobileNet V3 integrates a transformer encoder as its encoding module and a transformer decoder as its decoding module.This innovative neural network significantly reduces the memory space required and model training time,while maintaining a high level of accuracy in generating image descriptions.IMWTC facilitates large receptive fields without significantly increasing the number of parameters or computational overhead.The improvedMobileNet V3 model has its classifier removed,and simultaneously,it employs IMWTC layers to replace the original convolutional layers.This makes Mob-IMWTC exceptionally well-suited for deployment on lowresource devices.Experimental results,based on objective evaluation metrics such as BLEU,ROUGE,CIDEr,METEOR,and SPICE,demonstrate that Mob-IMWTC outperforms state-of-the-art models,including three CNN architectures(CNN-LSTM,CNN-Att-LSTM,CNN-Tran),two mainstream methods(LCM-Captioner,ClipCap),and our previous work(Mob-Tran).Subjective evaluations further validate the model’s superiority in terms of grammaticality,adequacy,logic,readability,and humanness.Mob-IMWTC offers a lightweight yet effective solution for image captioning,making it suitable for deployment on resource-constrained devices. 展开更多
关键词 Image caption wavelet convolution MobileNet V3 deep learning
在线阅读 下载PDF
Wavelet Transform-Based Bayesian Inference Learning with Conditional Variational Autoencoder for Mitigating Injection Attack in 6G Edge Network
2
作者 Binu Sudhakaran Pillai Raghavendra Kulkarni +1 位作者 Venkata Satya Suresh kumar Kondeti Surendran Rajendran 《Computer Modeling in Engineering & Sciences》 2025年第10期1141-1166,共26页
Future 6G communications will open up opportunities for innovative applications,including Cyber-Physical Systems,edge computing,supporting Industry 5.0,and digital agriculture.While automation is creating efficiencies... Future 6G communications will open up opportunities for innovative applications,including Cyber-Physical Systems,edge computing,supporting Industry 5.0,and digital agriculture.While automation is creating efficiencies,it can also create new cyber threats,such as vulnerabilities in trust and malicious node injection.Denialof-Service(DoS)attacks can stop many forms of operations by overwhelming networks and systems with data noise.Current anomaly detection methods require extensive software changes and only detect static threats.Data collection is important for being accurate,but it is often a slow,tedious,and sometimes inefficient process.This paper proposes a new wavelet transformassisted Bayesian deep learning based probabilistic(WT-BDLP)approach tomitigate malicious data injection attacks in 6G edge networks.The proposed approach combines outlier detection based on a Bayesian learning conditional variational autoencoder(Bay-LCVariAE)and traffic pattern analysis based on continuous wavelet transform(CWT).The Bay-LCVariAE framework allows for probabilistic modelling of generative features to facilitate capturing how features of interest change over time,spatially,and for recognition of anomalies.Similarly,CWT allows emphasizing the multi-resolution spectral analysis and permits temporally relevant frequency pattern recognition.Experimental testing showed that the flexibility of the Bayesian probabilistic framework offers a vast improvement in anomaly detection accuracy over existing methods,with a maximum accuracy of 98.21%recognizing anomalies. 展开更多
关键词 Bayesian inference learning automaton convolutional wavelet transform conditional variational autoencoder malicious data injection attack edge environment 6G communication
在线阅读 下载PDF
A YOLOv11 Empowered Road Defect Detection Model
3
作者 Xubo Liu Yunxiang Liu Peng Luo 《Computers, Materials & Continua》 2025年第10期1073-1094,共22页
Roads inevitably have defects during use,which not only seriously affect their service life but also pose a hidden danger to traffic safety.Existing algorithms for detecting road defects are unsatisfactory in terms of... Roads inevitably have defects during use,which not only seriously affect their service life but also pose a hidden danger to traffic safety.Existing algorithms for detecting road defects are unsatisfactory in terms of accuracy and generalization,so this paper proposes an algorithm based on YOLOv11.The method embeds wavelet transform convolution(WTConv)into the backbone’s C3k2 module to enhance low-frequency feature extraction while avoiding parameter bloat.Secondly,a novel multi-scale fusion diffusion network(MFDN)architecture is designed for the neck to strengthen cross-scale feature interactions,boosting detection precision.In terms of model optimization,the traditional downsampling method is discarded,and the innovative Adown(adaptive downsampling)technique is adopted,which streamlines the parameter scales while effectively mitigating the information loss problem during downsampling.Finally,in this paper,we propose Wise-PIDIoU by combining WiseIoU and MPDIoU to minimize the negative impact of low-quality anchor frames and enhance the detection capability of the model.The experimental results indicate that the proposed algorithm achieves an average detection accuracy of 86.5%for mAP@50 on the RDD2022 dataset,which is 2%higher than the original algorithm while ensuring that the amount of computation is basically unchanged.The number of parameters is reduced by 17%,and the F1 score is improved by 3%,showing better detection performance than other algorithms when facing different types of defects.The excellent performance on embedded devices proves that the algorithm also has favorable application prospects in practical inspection. 展开更多
关键词 Deep learning road defect detection YOLOv11 wavelet transform convolution
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部