期刊文献+
共找到1,392篇文章
< 1 2 70 >
每页显示 20 50 100
Unstructured Road Extraction in UAV Images based on Lightweight Model 被引量:1
1
作者 Di Zhang Qichao An +3 位作者 Xiaoxue Feng Ronghua Liu Jun Han Feng Pan 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2024年第2期372-384,共13页
There is no unified planning standard for unstructured roads,and the morphological structures of these roads are complex and varied.It is important to maintain a balance between accuracy and speed for unstructured roa... There is no unified planning standard for unstructured roads,and the morphological structures of these roads are complex and varied.It is important to maintain a balance between accuracy and speed for unstructured road extraction models.Unstructured road extraction algorithms based on deep learning have problems such as high model complexity,high computational cost,and the inability to adapt to current edge computing devices.Therefore,it is best to use lightweight network models.Considering the need for lightweight models and the characteristics of unstructured roads with different pattern shapes,such as blocks and strips,a TMB(Triple Multi-Block)feature extraction module is proposed,and the overall structure of the TMBNet network is described.The TMB module was compared with SS-nbt,Non-bottleneck-1D,and other modules via experiments.The feasibility and effectiveness of the TMB module design were proven through experiments and visualizations.The comparison experiment,using multiple convolution kernel categories,proved that the TMB module can improve the segmentation accuracy of the network.The comparison with different semantic segmentation networks demonstrates that the TMBNet network has advantages in terms of unstructured road extraction. 展开更多
关键词 Unstructured road lightweight model Triple Multi-Block(TMB) Semantic segmentation net
在线阅读 下载PDF
TELL-Me:A time-series-decomposition-based ensembled lightweight learning model for diverse battery prognosis and diagnosis 被引量:1
2
作者 Kun-Yu Liu Ting-Ting Wang +2 位作者 Bo-Bo Zou Hong-Jie Peng Xinyan Liu 《Journal of Energy Chemistry》 2025年第7期1-8,共8页
As batteries become increasingly essential for energy storage technologies,battery prognosis,and diagnosis remain central to ensure reliable operation and effective management,as well as to aid the in-depth investigat... As batteries become increasingly essential for energy storage technologies,battery prognosis,and diagnosis remain central to ensure reliable operation and effective management,as well as to aid the in-depth investigation of degradation mechanisms.However,dynamic operating conditions,cell-to-cell inconsistencies,and limited availability of labeled data have posed significant challenges to accurate and robust prognosis and diagnosis.Herein,we introduce a time-series-decomposition-based ensembled lightweight learning model(TELL-Me),which employs a synergistic dual-module framework to facilitate accurate and reliable forecasting.The feature module formulates features with physical implications and sheds light on battery aging mechanisms,while the gradient module monitors capacity degradation rates and captures aging trend.TELL-Me achieves high accuracy in end-of-life prediction using minimal historical data from a single battery without requiring offline training dataset,and demonstrates impressive generality and robustness across various operating conditions and battery types.Additionally,by correlating feature contributions with degradation mechanisms across different datasets,TELL-Me is endowed with the diagnostic ability that not only enhances prediction reliability but also provides critical insights into the design and optimization of next-generation batteries. 展开更多
关键词 Battery prognosis Interpretable machine learning Degradation diagnosis Ensemble learning Online prediction lightweight model
在线阅读 下载PDF
A Knowledge-Distilled CharacterBERT-BiLSTM-ATT Framework for Lightweight DGA Detection in IoT Devices
3
作者 Chengqi Liu YongtaoLi +1 位作者 Weiping Zou Deyu Lin 《Computers, Materials & Continua》 2026年第4期2049-2068,共20页
With the large-scale deployment of the Internet ofThings(IoT)devices,their weak securitymechanisms make them prime targets for malware attacks.Attackers often use Domain Generation Algorithm(DGA)to generate random dom... With the large-scale deployment of the Internet ofThings(IoT)devices,their weak securitymechanisms make them prime targets for malware attacks.Attackers often use Domain Generation Algorithm(DGA)to generate random domain names,hiding the real IP of Command and Control(C&C)servers to build botnets.Due to the randomness and dynamics of DGA,traditional methods struggle to detect them accurately,increasing the difficulty of network defense.This paper proposes a lightweight DGA detection model based on knowledge distillation for resource-constrained IoT environments.Specifically,a teacher model combining CharacterBERT,a bidirectional long short-term memory(BiLSTM)network,and attention mechanism(ATT)is constructed:it extracts character-level semantic features viaCharacterBERT,captures sequence dependencieswith the BiLSTM,and integrates theATT for key feature weighting,formingmulti-granularity feature fusion.An improved knowledge distillation approach transfers the teacher model’s learned knowledge to the simplified DistilBERT student model.Experimental results show the teacher model achieves 98.68%detection accuracy.The student modelmaintains slightly improved accuracy while significantly compressing parameters to approximately 38.4%of the teacher model’s scale,greatly reducing computational overhead for IoT deployment. 展开更多
关键词 IoT security DGA detection knowledge distillation lightweight model edge computing
在线阅读 下载PDF
YOLO-SPDNet:Multi-Scale Sequence and Attention-Based Tomato Leaf Disease Detection Model
4
作者 Meng Wang Jinghan Cai +6 位作者 Wenzheng Liu Xue Yang Jingjing Zhang Qiangmin Zhou Fanzhen Wang Hang Zhang Tonghai Liu 《Phyton-International Journal of Experimental Botany》 2026年第1期290-308,共19页
Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet th... Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet the requirements of early disease identification in complex natural environments.To address this issue,this study proposes an improved YOLO11-based model,YOLO-SPDNet(Scale Sequence Fusion,Position-Channel Attention,and Dual Enhancement Network).The model integrates the SEAM(Self-Ensembling Attention Mechanism)semantic enhancement module,the MLCA(Mixed Local Channel Attention)lightweight attention mechanism,and the SPA(Scale-Position-Detail Awareness)module composed of SSFF(Scale Sequence Feature Fusion),TFE(Triple Feature Encoding),and CPAM(Channel and Position Attention Mechanism).These enhancements strengthen fine-grained lesion detection while maintaining model lightweightness.Experimental results show that YOLO-SPDNet achieves an accuracy of 91.8%,a recall of 86.5%,and an mAP@0.5 of 90.6%on the test set,with a computational complexity of 12.5 GFLOPs.Furthermore,the model reaches a real-time inference speed of 987 FPS,making it suitable for deployment on mobile agricultural terminals and online monitoring systems.Comparative analysis and ablation studies further validate the reliability and practical applicability of the proposed model in complex natural scenes. 展开更多
关键词 Tomato disease detection YOLO multi-scale feature fusion attention mechanism lightweight model
在线阅读 下载PDF
A lightweight pure visual BEV perception method based on dual distillation of spatial-temporal knowledge
5
作者 LIU Bingdong YU Ruihang +1 位作者 XIONG Zhiming WU Meiping 《Journal of Systems Engineering and Electronics》 2026年第1期36-44,共9页
Bird's-eye-view(BEV)perception is a core technology for autonomous driving systems.However,existing solutions face the dilemma of high costs associated with multimodal methods and limited performance of vision-onl... Bird's-eye-view(BEV)perception is a core technology for autonomous driving systems.However,existing solutions face the dilemma of high costs associated with multimodal methods and limited performance of vision-only approaches.To address this issue,this paper proposes a framework named“a lightweight pure visual BEV perception method based on dual distillation of spatial-temporal knowledge”.This framework innovatively designs a lightweight vision-only student model based on Res Net,which leverages a dual distillation mechanism to learn from a powerful teacher model that integrates temporal information from both image and light detection and ranging(LiDAR)modalities.Specifically,we distill efficient multi-modal feature extraction and spatial fusion capabilities from the BEVFusion model,and distill advanced temporal information fusion and spatiotemporal attention mechanisms from the BEVFormer model.This dual distillation strategy enables the student model to achieve perception performance close to that of multi-modal models without relying on Li DAR.Experimental results on the nu Scenes dataset demonstrate that the proposed model significantly outperforms classical vision-only algorithms,achieves comparable performance to current state-of-the-art vision-only methods on the nu Scenes detection leaderboard in terms of both mean average precision(mAP)and the nu Scenes detection score(NDS)metrics,and exhibits notable advantages in inference computational efficiency.Although the proposed dual-teacher paradigm incurs higher offline training costs compared to single-model approaches,it yields a streamlined and highly efficient student model suitable for resource-constrained real-time deployment.This provides an effective pathway toward low-cost,high-performance autonomous driving perception systems. 展开更多
关键词 3D object detection bird's-eye-view(BEV) knowledge distillation multimodal fusion lightweight model
在线阅读 下载PDF
Lightweight Small Defect Detection with YOLOv8 Using Cascaded Multi-Receptive Fields and Enhanced Detection Heads
6
作者 Shengran Zhao Zhensong Li +2 位作者 Xiaotan Wei Yutong Wang Kai Zhao 《Computers, Materials & Continua》 2026年第1期1278-1291,共14页
In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds... In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds in current intelligent inspection algorithms,this paper proposes CG-YOLOv8,a lightweight and improved model based on YOLOv8n for PCB surface defect detection.The proposed method optimizes the network architecture and compresses parameters to reduce model complexity while maintaining high detection accuracy,thereby enhancing the capability of identifying diverse defects under complex conditions.Specifically,a cascaded multi-receptive field(CMRF)module is adopted to replace the SPPF module in the backbone to improve feature perception,and an inverted residual mobile block(IRMB)is integrated into the C2f module to further enhance performance.Additionally,conventional convolution layers are replaced with GSConv to reduce computational cost,and a lightweight Convolutional Block Attention Module based Convolution(CBAMConv)module is introduced after Grouped Spatial Convolution(GSConv)to preserve accuracy through attention mechanisms.The detection head is also optimized by removing medium and large-scale detection layers,thereby enhancing the model’s ability to detect small-scale defects and further reducing complexity.Experimental results show that,compared to the original YOLOv8n,the proposed CG-YOLOv8 reduces parameter count by 53.9%,improves mAP@0.5 by 2.2%,and increases precision and recall by 2.0%and 1.8%,respectively.These improvements demonstrate that CG-YOLOv8 offers an efficient and lightweight solution for PCB surface defect detection. 展开更多
关键词 YOLOv8n PCB surface defect detection lightweight model small object detection
在线阅读 下载PDF
Ultra-lightweight CNN design based on neural architecture search and knowledge distillation: A novel method to build the automatic recognition model of space target ISAR images 被引量:7
7
作者 Hong Yang Ya-sheng Zhang +1 位作者 Can-bin Yin Wen-zhe Ding 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2022年第6期1073-1095,共23页
In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of th... In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of the space target inverse synthetic aperture radar(ISAR)image recognition model with ultra-lightweight and high accuracy.This method introduces the NAS method into the radar image recognition for the first time,which solves the time-consuming and labor-consuming problems in the artificial design of the space target ISAR image automatic recognition model(STIIARM).On this basis,the NAS model’s knowledge is transferred to the student model with lower computational complexity by the flow of the solution procedure(FSP)distillation method.Thus,the decline of recognition accuracy caused by the direct compression of model structural parameters can be effectively avoided,and the ultralightweight STIIARM can be obtained.In the method,the Inverted Linear Bottleneck(ILB)and Inverted Residual Block(IRB)are firstly taken as each block’s basic structure in CNN.And the expansion ratio,output filter size,number of IRBs,and convolution kernel size are set as the search parameters to construct a hierarchical decomposition search space.Then,the recognition accuracy and computational complexity are taken as the objective function and constraint conditions,respectively,and the global optimization model of the CNN architecture search is established.Next,the simulated annealing(SA)algorithm is used as the search strategy to search out the lightweight and high accuracy STIIARM directly.After that,based on the three principles of similar block structure,the same corresponding channel number,and the minimum computational complexity,the more lightweight student model is designed,and the FSP matrix pairing between the NAS model and student model is completed.Finally,by minimizing the loss between the FSP matrix pairs of the NAS model and student model,the student model’s weight adjustment is completed.Thus the ultra-lightweight and high accuracy STIIARM is obtained.The proposed method’s effectiveness is verified by the simulation experiments on the ISAR image dataset of five types of space targets. 展开更多
关键词 Space target ISAR image Neural architecture search Knowledge distillation lightweight model
在线阅读 下载PDF
Improved lightweight road damage detection based on YOLOv5
8
作者 LIU Chang SUN Yu +2 位作者 CHEN Jin YANG Jing WANG Fengchao 《Optoelectronics Letters》 2025年第5期314-320,共7页
There is a problem of real-time detection difficulty in road surface damage detection. This paper proposes an improved lightweight model based on you only look once version 5(YOLOv5). Firstly, this paper fully utilize... There is a problem of real-time detection difficulty in road surface damage detection. This paper proposes an improved lightweight model based on you only look once version 5(YOLOv5). Firstly, this paper fully utilized the convolutional neural network(CNN) + ghosting bottleneck(G_bneck) architecture to reduce redundant feature maps. Afterwards, we upgraded the original upsampling algorithm to content-aware reassembly of features(CARAFE) and increased the receptive field. Finally, we replaced the spatial pyramid pooling fast(SPPF) module with the basic receptive field block(Basic RFB) pooling module and added dilated convolution. After comparative experiments, we can see that the number of parameters and model size of the improved algorithm in this paper have been reduced by nearly half compared to the YOLOv5s. The frame rate per second(FPS) has been increased by 3.25 times. The mean average precision(m AP@0.5: 0.95) has increased by 8%—17% compared to other lightweight algorithms. 展开更多
关键词 road surface damage detection convolutional neural network feature maps convolutional neural network cnn lightweight model yolov improved lightweight model spatial pyram
原文传递
Cephalopods Classification Using Fine Tuned Lightweight Transfer Learning Models
9
作者 P.Anantha Prabha G.Suchitra R.Saravanan 《Intelligent Automation & Soft Computing》 SCIE 2023年第3期3065-3079,共15页
Cephalopods identification is a formidable task that involves hand inspection and close observation by a malacologist.Manual observation and iden-tification take time and are always contingent on the involvement of expe... Cephalopods identification is a formidable task that involves hand inspection and close observation by a malacologist.Manual observation and iden-tification take time and are always contingent on the involvement of experts.A system is proposed to alleviate this challenge that uses transfer learning techni-ques to classify the cephalopods automatically.In the proposed method,only the Lightweight pre-trained networks are chosen to enable IoT in the task of cephalopod recognition.First,the efficiency of the chosen models is determined by evaluating their performance and comparing thefindings.Second,the models arefine-tuned by adding dense layers and tweaking hyperparameters to improve the classification of accuracy.The models also employ a well-tuned Rectified Adam optimizer to increase the accuracy rates.Third,Adam with Gradient Cen-tralisation(RAdamGC)is proposed and used infine-tuned models to reduce the training time.The framework enables an Internet of Things(IoT)or embedded device to perform the classification tasks by embedding a suitable lightweight pre-trained network.Thefine-tuned models,MobileNetV2,InceptionV3,and NASNet Mobile have achieved a classification accuracy of 89.74%,87.12%,and 89.74%,respectively.Thefindings have indicated that thefine-tuned models can classify different kinds of cephalopods.The results have also demonstrated that there is a significant reduction in the training time with RAdamGC. 展开更多
关键词 CEPHALOPODS transfer learning lightweight models classification deep learning fish IOT
在线阅读 下载PDF
MSAMamba-UNet:A Lightweight Multi-Scale Adaptive Mamba Network for Skin Lesion Segmentation
10
作者 Shouming Hou Jianchao Hou +2 位作者 Yuteng Pang Aoyu Xia Beibei Hou 《Journal of Bionic Engineering》 2025年第6期3209-3225,共17页
Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion siz... Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet. 展开更多
关键词 TRANSFORMER Segmenting skin lesions Mamba lightweight model MULTI-SCALE
在线阅读 下载PDF
Lightweight Residual Multi-Head Convolution with Channel Attention(ResMHCNN)for End-to-End Classification of Medical Images
11
作者 Sudhakar Tummala Sajjad Hussain Chauhdary +3 位作者 Vikash Singh Roshan Kumar Seifedine Kadry Jungeun Kim 《Computer Modeling in Engineering & Sciences》 2025年第9期3585-3605,共21页
Lightweight deep learning models are increasingly required in resource-constrained environments such as mobile devices and the Internet of Medical Things(IoMT).Multi-head convolution with channel attention can facilit... Lightweight deep learning models are increasingly required in resource-constrained environments such as mobile devices and the Internet of Medical Things(IoMT).Multi-head convolution with channel attention can facilitate learning activations relevant to different kernel sizes within a multi-head convolutional layer.Therefore,this study investigates the capability of novel lightweight models incorporating residual multi-head convolution with channel attention(ResMHCNN)blocks to classify medical images.We introduced three novel lightweight deep learning models(BT-Net,LCC-Net,and BC-Net)utilizing the ResMHCNN block as their backbone.These models were crossvalidated and tested on three publicly available medical image datasets:a brain tumor dataset from Figshare consisting of T1-weighted magnetic resonance imaging slices of meningioma,glioma,and pituitary tumors;the LC25000 dataset,which includes microscopic images of lung and colon cancers;and the BreaKHis dataset,containing benign and malignant breast microscopic images.The lightweight models achieved accuracies of 96.9%for 3-class brain tumor classification using BT-Net,and 99.7%for 5-class lung and colon cancer classification using LCC-Net.For 2-class breast cancer classification,BC-Net achieved an accuracy of 96.7%.The parameter counts for the proposed lightweight models—LCC-Net,BC-Net,and BT-Net—are 0.528,0.226,and 1.154 million,respectively.The presented lightweight models,featuring ResMHCNN blocks,may be effectively employed for accurate medical image classification.In the future,these models might be tested for viability in resource-constrained systems such as mobile devices and IoMT platforms. 展开更多
关键词 lightweight models brain tumor breast cancer lung cancer colon cancer multi-head CNN
在线阅读 下载PDF
Mineral identification in thin sections using a lightweight and attention mechanism
12
作者 Xin Zhang Wei Dang +4 位作者 Jun Liu Zijuan Yin Guichao Du Yawen He Yankai Xue 《Natural Gas Industry B》 2025年第2期135-146,共12页
Mineral identification is foundational to geological survey research,mineral resource exploration,and mining engineering.Considering the diversity of mineral types and the challenge of achieving high recognition accur... Mineral identification is foundational to geological survey research,mineral resource exploration,and mining engineering.Considering the diversity of mineral types and the challenge of achieving high recognition accuracy for similar features,this study introduces a mineral detection method based on YOLOv8-SBI.This work enhances feature extraction by integrating spatial pyramid pooling-fast(SPPF)with the simplified self-attention module(SimAM),significantly improving the precision of mineral feature detection.In the feature fusion network,a weighted bidirectional feature pyramid network is employed for advanced cross-channel feature integration,effectively reducing feature redundancy.Additionally,Inner-Intersection Over Union(InnerIOU)is used as the loss function to improve the average quality localization performance of anchor boxes.Experimental results show that the YOLOv8-SBI model achieves an accuracy of 67.9%,a recall of 74.3%,a mAP@0.5 of 75.8%,and a mAP@0.5:0.95 of 56.7%,with a real-time detection speed of 244.2 frames per second.Compared to YOLOv8,YOLOv8-SBI demonstrates a significant improvement with 15.4%increase in accuracy,28.5%increase in recall,and increases of 28.1%and 20.9%in mAP@0.5 and mAP@0.5:0.95,respectively.Furthermore,relative to other models,such as YOLOv3,YOLOv5,YOLOv6,YOLOv8,YOLOv9,and YOLOv10,YOLOv8-SBI has a smaller parameter size of only 3.01×10^(6).This highlights the optimal balance between detection accuracy and speed,thereby offering robust technical support for intelligent mineral classification. 展开更多
关键词 Deep learning Neural networks lightweight models Attention mechanisms Mineral identification
在线阅读 下载PDF
Efficient and lightweight 3D building reconstruction from drone imagery using sparse line and point clouds
13
作者 Xiongjie YIN Jinquan HE Zhanglin CHENG 《虚拟现实与智能硬件(中英文)》 2025年第2期111-126,共16页
Efficient three-dimensional(3D)building reconstruction from drone imagery often faces data acquisition,storage,and computational challenges because of its reliance on dense point clouds.In this study,we introduced a n... Efficient three-dimensional(3D)building reconstruction from drone imagery often faces data acquisition,storage,and computational challenges because of its reliance on dense point clouds.In this study,we introduced a novel method for efficient and lightweight 3D building reconstruction from drone imagery using line clouds and sparse point clouds.Our approach eliminates the need to generate dense point clouds,and thus significantly reduces the computational burden by reconstructing 3D models directly from sparse data.We addressed the limitations of line clouds for plane detection and reconstruction by using a new algorithm.This algorithm projects 3D line clouds onto a 2D plane,clusters the projections to identify potential planes,and refines them using sparse point clouds to ensure an accurate and efficient model reconstruction.Extensive qualitative and quantitative experiments demonstrated the effectiveness of our method,demonstrating its superiority over existing techniques in terms of simplicity and efficiency. 展开更多
关键词 3D reconstruction Line clouds Sparse clouds lightweight models
在线阅读 下载PDF
AW-HRNet:A Lightweight High-Resolution Crack Segmentation Network Integrating Spatial Robustness and Frequency-Domain Enhancement
14
作者 Dewang Ma Tong Lu 《Journal of Electronic Research and Application》 2025年第6期7-17,共11页
The study presents AW-HRNet,a lightweight high-resolution crack segmentation network that couples Adaptive residual enhancement(AREM)in the spatial domain with Wavelet-based decomposition-reconstruction(WDRM)in the fr... The study presents AW-HRNet,a lightweight high-resolution crack segmentation network that couples Adaptive residual enhancement(AREM)in the spatial domain with Wavelet-based decomposition-reconstruction(WDRM)in the frequency domain.AREM introduces a learnable channel-wise scaling after standard 3×3 convolution and merges it through a residual path to stabilize crack-sensitive responses while suppressing noise.WDRM performs DWT to decouple LL/LH/HL/HH sub-bands,conducts lightweight cross-band fusion,and applies IDWT to restore detail-enhanced features,unifying global topology and boundary sharpness without deformable offsets.Integrated into a high-resolution backbone with auxiliary deep supervision,AW-HRNet attains 79.07%mIoU on CrackSeg9k with only 1.24M parameters and 0.73 GFLOPs,offering an excellent accuracy-efficiency trade-off and strong robustness for real-world deployment. 展开更多
关键词 Crack segmentation lightweight model Wavelet decomposition and reconstruction Feature enhancement
在线阅读 下载PDF
Lightweight Classroom Student Action Recognition Method Based on Spatiotemporal Multimodal Feature Fusion
15
作者 Shaodong Zou Di Wu +2 位作者 Jianhou Gan Juxiang Zhou Jiatian Mei 《Computers, Materials & Continua》 2025年第4期1101-1116,共16页
The task of student action recognition in the classroom is to precisely capture and analyze the actions of students in classroom videos,providing a foundation for realizing intelligent and accurate teaching.However,th... The task of student action recognition in the classroom is to precisely capture and analyze the actions of students in classroom videos,providing a foundation for realizing intelligent and accurate teaching.However,the complex nature of the classroom environment has added challenges and difficulties in the process of student action recognition.In this research article,with regard to the circumstances where students are prone to be occluded and classroom computing resources are restricted in real classroom scenarios,a lightweight multi-modal fusion action recognition approach is put forward.This proposed method is capable of enhancing the accuracy of student action recognition while concurrently diminishing the number of parameters of the model and the Computation Amount,thereby achieving a more efficient and accurate recognition performance.In the feature extraction stage,this method fuses the keypoint heatmap with the RGB(Red-Green-Blue color model)image.In order to fully utilize the unique information of different modalities for feature complementarity,a Feature Fusion Module(FFE)is introduced.The FFE encodes and fuses the unique features of the two modalities during the feature extraction process.This fusion strategy not only achieves fusion and complementarity between modalities,but also improves the overall model performance.Furthermore,to reduce the computational load and parameter scale of the model,we use keypoint information to crop RGB images.At the same time,the first three networks of the lightweight feature extraction network X3D are used to extract dual-branch features.These methods significantly reduce the computational load and parameter scale.The number of parameters of the model is 1.40 million,and the computation amount is 5.04 billion floating-point operations per second(GFLOPs),achieving an efficient lightweight design.In the Student Classroom Action Dataset(SCAD),the accuracy of the model is 88.36%.In NTU 60(Nanyang Technological University Red-Green-Blue-Depth RGB+Ddataset with 60 categories),the accuracies on X-Sub(The people in the training set are different from those in the test set)and X-View(The perspectives of the training set and the test set are different)are 95.76%and 98.82%,respectively.On the NTU 120 dataset(Nanyang Technological University Red-Green-Blue-Depth dataset with 120 categories),RGB+Dthe accuracies on X-Sub and X-Set(the perspectives of the training set and the test set are different)are 91.97%and 93.45%,respectively.The model has achieved a balance in terms of accuracy,computation amount,and the number of parameters. 展开更多
关键词 Action recognition student classroom action multimodal fusion lightweight model design
在线阅读 下载PDF
Brittleness Generation Mechanism and Failure Model of High Strength Lightweight Aggregate Concrete
16
作者 胡曙光 《Journal of Wuhan University of Technology(Materials Science)》 SCIE EI CAS 2006年第z1期15-18,共4页
The brittleness generation mechanism of high strength lightweight aggregate con-crete(HSLWAC) was presented, and it was indicated that lightweight aggregate was the vulnerable spot, initiating brittleness. Based on th... The brittleness generation mechanism of high strength lightweight aggregate con-crete(HSLWAC) was presented, and it was indicated that lightweight aggregate was the vulnerable spot, initiating brittleness. Based on the analysis of the brittleness failure by the load-deflection curve, the brittleness presented by HSLWAC was more prominent compared with ordinary lightweight aggregate concrete of the same strength grade. The model of brittleness failure was also established. 展开更多
关键词 high strength lightweight aggregate concrete(HSLWAC) BRITTLENESS failure model
在线阅读 下载PDF
基于改进RT-DETR的井盖病害轻量化检测算法
17
作者 孟志永 吴晨曦 +4 位作者 王鹏 张明 张秀清 杨云飞 张龙龙 《计算机工程与应用》 北大核心 2026年第4期238-249,共12页
针对井盖病害检测任务中检测精度和轻量化难以平衡的问题,提出一种基于改进RT-DETR-R18的井盖病害检测算法。设计一种改进的主干网络,结合内容感知混合器(content-aware mixer,CAMixer)模块和CSP架构,提升网络的特征提取能力,并有效减... 针对井盖病害检测任务中检测精度和轻量化难以平衡的问题,提出一种基于改进RT-DETR-R18的井盖病害检测算法。设计一种改进的主干网络,结合内容感知混合器(content-aware mixer,CAMixer)模块和CSP架构,提升网络的特征提取能力,并有效减少模型的计算量。提出DTAB(dilated transformer attention block)模块,通过分组通道自注意力(grouped channel self-attention,G-CSA)避免多尺度下的信息泄露,通过掩码窗口自注意力(masked window self-attention,M-WSA)增强细节特征的提取能力。采用改进的RetBlockC3模块,引入Manhattan自注意力机制,进一步提升模型对局部细节和小目标的捕捉能力。提出一种改进的下采样模块PSConv(pinwheel-shaped convolution),通过多方向卷积核设计和尺度自适应机制,扩大感受野并增强细节区域的检测能力。实验结果表明,与原始RT-DETR-R18模型相比,改进后的RT-DETR在井盖病害数据集上mAP@0.5从86.2%提高到92.0%,计算量从58.6 GFLOPs降到45.1 GFLOPs。在公开数据集RDD2022和NWPU VHR-10上,所提模型相比原始模型在mAP@0.5上分别提高4.7和1.0个百分点。所提算法在保持高精度的基础上,实现轻量化设计,满足井盖病害检测对效率和性能的实际需求。 展开更多
关键词 RT-DETR-R18 井盖病害 轻量化模型 注意力机制
在线阅读 下载PDF
LRM-YOLO:一种面向工业现场的轻量化安全帽佩戴检测方法
18
作者 张新君 王贺桐 张永库 《安全与环境学报》 北大核心 2026年第1期151-159,共9页
为解决复杂工地环境下安全帽佩戴检测模型存在准确率不足与部署困难的问题,提出了一种基于轻量化深度学习的目标检测模型LRM-YOLO。首先,设计了一种轻量级实时监控网络(Lightweight Real-time Monitoring Network,LRMN),结合部分卷积与... 为解决复杂工地环境下安全帽佩戴检测模型存在准确率不足与部署困难的问题,提出了一种基于轻量化深度学习的目标检测模型LRM-YOLO。首先,设计了一种轻量级实时监控网络(Lightweight Real-time Monitoring Network,LRMN),结合部分卷积与多层感知机,引入DropPath机制,减少冗余计算和内存访问,实现高效的特征提取。其次,设计了一种轻量化高效检测头(Lightweight Efficient Detection Head,LED-Head),采用共享卷积和解卷积增强模块,提升特征分辨率,同时结合动态缩放和分布式焦点损失(Distribution Focal Loss,DFL)函数解码技术,进一步优化边界框定位精度。试验结果表明,与YOLOv11n相比,LRM-YOLO在保持检测精度的同时,模型参数量减小28.0%,减少了31.7%的计算开销,模型存储容量缩减25.5%。所提出的检测方法兼具高效性与实用性,为工业现场的智能安全监测提供了可行方案。 展开更多
关键词 安全工程 安全帽检测 轻量化模型 YOLO 部分卷积 智能安全监测
原文传递
结合多模态检测头的小蠹类害虫细粒度识别模型
19
作者 李巨虎 路佳 +2 位作者 徐玉立 李世豪 蔡祥 《农业工程学报》 北大核心 2026年第1期273-283,共11页
为解决小蠹类害虫(Dendroctonus spp)物种多样性高、近缘种形态相似且常同域分布导致的种类鉴定困难问题。该研究提出了能够细粒度识别小蠹虫种类的FGRS-Net(fine-grained recognition for scolytidae network)模型。首先,为缓解样本不... 为解决小蠹类害虫(Dendroctonus spp)物种多样性高、近缘种形态相似且常同域分布导致的种类鉴定困难问题。该研究提出了能够细粒度识别小蠹虫种类的FGRS-Net(fine-grained recognition for scolytidae network)模型。首先,为缓解样本不足导致的识别偏差,该研究自主设计了基于多模态嵌入的检测头模块;其次,为提取跨尺度鉴别特征,利用注意力机制混合模块ACmix(attention convolution mixer)实现了融合特征捕捉;为进一步获取特征并降低参数量,引入了全维度动态卷积模块ODConv(omni-dimensional dynamic convolution)重点关注昆虫细粒度特征;并通过剪枝以及知识蒸馏轻量化模型;为全面评估模型在实际应用中的可靠性,该研究在低照度、模糊及复杂背景遮挡等多种干扰条件下进行了系统的鲁棒性测试,并在不同计算架构的边缘设备上完成了部署验证。试验结果显示,FGRS-Net的平均精度均值达到89.3%,召回率为98%,浮点运算量降低16%,NVIDIA RTX 5090 GPU部署帧率达到289帧/s;双平台开发板部署帧率分别为11、27帧/s。实践表明,FGRS-Net模型具有精确度高和轻量化的优点,相比于现有主流模型具有较好的竞争力,该研究方法可为后续细粒度小蠹虫识别提供参考。 展开更多
关键词 小蠹虫检测 细粒度分类 多模态学习 轻量化模型 动态卷积
在线阅读 下载PDF
基于形态指纹特征的耕地遥感监测轻量化大模型构建
20
作者 唐华俊 吴文斌 +12 位作者 余强毅 史云 段玉林 李文娟 钱建平 宋茜 夏浪 李会宾 苏宝峰 范蓓蕾 胡琼 叶剑秋 张帅 《中国农业科学》 北大核心 2026年第1期78-89,共12页
耕地资源及其利用时空动态事关国家粮食安全、资源安全和生态安全。现阶段耕地遥感监测总体沿用了“数据—(模型)—信息”的科学研究范式,注重影像解译与信息提取过程的模型改进与精度提升,面临“信息海量、知识难求、服务受限”的困境... 耕地资源及其利用时空动态事关国家粮食安全、资源安全和生态安全。现阶段耕地遥感监测总体沿用了“数据—(模型)—信息”的科学研究范式,注重影像解译与信息提取过程的模型改进与精度提升,面临“信息海量、知识难求、服务受限”的困境,难以满足耕地保护利用实际需求,亟待提升科学研究成果对国家重大需求的支撑服务效能。人工智能(artificial intelligence,AI)技术加速推动数据主动检索与分析向智能化的知识服务与赋能转变,大型多模态模型在文本、图像、音频、视频等多模态数据处理中的突出优势,能够有效挖掘各类遥感监测信息和提供智能知识服务。本文在系统分析国内外最新研究进展、全面梳理耕地遥感监测应用需求的基础上,总结了通过耕地形态认知其结构与功能的核心特点,进而提出基于形态指纹特征的耕地遥感监测轻量化大模型构建思路。首先,针对不同主体分析需求,将耕地遥感监测应用场景归纳为4个方面,包括耕地数量和利用、高标准农田建设、耕地质量退化、耕地农情动态,明晰不同场景对监测信息和知识服务的差异化要求;其次,从人类认知的视角出发,解析耕地形态蕴含的“精细信息”和“宏观知识”特征,为耕地遥感监测大模型构建提供新的切入点;最后,结合多模态遥感数据与通用大语言模型,构建具备感知、推理、学习与执行能力的耕地遥感监测人工智能体(AI Agent),强化注意力机制,集中并抓住耕地形态重要特征,构建基于形态指纹特征的遥感监测轻量化大模型,实现“精细信息—宏观知识—智慧决策”融合,解决数据信息产品多但可用性知识服务不足的现实困境。 展开更多
关键词 人工智能 耕地遥感监测 应用场景 形态特征 注意力机制 轻量化大模型
在线阅读 下载PDF
上一页 1 2 70 下一页 到第
使用帮助 返回顶部