期刊文献+
共找到433篇文章
< 1 2 22 >
每页显示 20 50 100
Full Perception Head:Bridging the Gap Between Local and Global Features
1
作者 Jie Hua Zhongyuan Wang +3 位作者 Xin Tian Qin Zou Jinsheng Xiao Jiayi Ma 《IEEE/CAA Journal of Automatica Sinica》 2025年第7期1391-1406,共16页
Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and te... Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and textures,while global features extracted by full connection layers,etc.,represent the overall structure and long-range relationships within the image.These features are crucial for accurate object detection,yet most existing methods focus on aggregating local and global features,often overlooking the importance of medium-range dependencies.To address this gap,we propose a novel full perception module(FPModule),a simple yet effective feature extraction module designed to simultaneously capture local details,medium-range dependencies,and long-range dependencies.Building on this,we construct a full perception head(FP-Head)by cascading multiple FP-Modules,enabling the prediction layer to leverage the most informative features.Experimental results in the MS COCO dataset demonstrate that our approach significantly enhances object recognition and localization,achieving 2.7−5.7 APval gains when integrated into standard object detectors.Notably,the FP-Module is a universal solution that can be seamlessly incorporated into existing detectors to boost performance.The code will be released at https://github.com/Idcogroup/FP-Head. 展开更多
关键词 feature aggregation full perception module medium-range dependencies object detection
在线阅读 下载PDF
Adaptive Multi-Feature Fusion for Vehicle Micro-Motor Noise Recognition Considering Auditory Perception 被引量:1
2
作者 Ting Zhao Weiping Ding +1 位作者 Haibo Huang Yudong Wu 《Sound & Vibration》 EI 2023年第1期133-153,共21页
The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assem... The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assembly errors,and other imperfections that may arise during the design or manufacturing phases.Conse-quently,these micro-motors might generate anomalous noises during their operation,consequently exerting a substantial adverse influence on the overall comfort of drivers and passengers.Automobile micro-motors exhibit a diverse array of structural variations,consequently leading to the manifestation of a multitude of distinctive auditory irregularities.To address the identification of diverse forms of abnormal noise,this research presents a novel approach rooted in the utilization of vibro-acoustic fusion-convolutional neural network(VAF-CNN).This method entails the deployment of distinct network branches,each serving to capture disparate features from the multi-sensor data,all the while considering the auditory perception traits inherent in the human auditory sys-tem.The intermediary layer integrates the concept of adaptive weighting of multi-sensor features,thus affording a calibration mechanism for the features hailing from multiple sensors,thereby enabling a further refinement of features within the branch network.For optimal model efficacy,a feature fusion mechanism is implemented in the concluding layer.To substantiate the efficacy of the proposed approach,this paper initially employs an augmented data methodology inspired by modified SpecAugment,applied to the dataset of abnormal noise sam-ples,encompassing scenarios both with and without in-vehicle interior noise.This serves to mitigate the issue of limited sample availability.Subsequent comparative evaluations are executed,contrasting the performance of the model founded upon single-sensor data against other feature fusion models reliant on multi-sensor data.The experimental results substantiate that the suggested methodology yields heightened recognition accuracy and greater resilience against interference.Moreover,it holds notable practical significance in the engineering domain,as it furnishes valuable support for the targeted management of noise emanating from vehicle micro-motors. 展开更多
关键词 Auditory perception MULTI-SENSOR feature adaptive fusion abnormal noise recognition vehicle interior noise
在线阅读 下载PDF
Skeleton-Silhouette Complementary Perception: Toward Robust Gait Recognition
3
作者 Xiaokai Liu Luyuan Hao 《Journal of Clinical and Nursing Research》 2025年第11期372-377,共6页
Gait,the unique pattern of how a person walks,has emerged as one of the most promising biometric features in modern intelligent sensing.Unlike fingerprints or facial characteristics,gait can be captured unobtrusively ... Gait,the unique pattern of how a person walks,has emerged as one of the most promising biometric features in modern intelligent sensing.Unlike fingerprints or facial characteristics,gait can be captured unobtrusively and at a distance,without requiring the subject’s awareness or cooperation.This makes it highly suitable for long-range surveillance,forensic investigation,and smart environments where contactless recognition is crucial.Traditional gait-recognition systems rely either on silhouettes,which capture the outer appearance of a person,or on skeletons,which describe the internal structure of human motion.Each modality provides only a partial understanding of gait.Silhouettes emphasize shape and contour but are easily distorted by clothing or carried objects;skeletons describe motion dynamics and limb coordination but lose discriminative details about body shape.This article presents the concept of Complementary Semantic Embedding(CSE),a unified framework that merges silhouette and skeleton information into a comprehensive semantic representation of human walking.By modeling the complementary nature of appearance and structure,the approach achieves more robust and accurate gait recognition even under challenging conditions. 展开更多
关键词 Complementary perception Gait recognition feature fusion
在线阅读 下载PDF
Visual feature inter-learning for sign language recognition in emergency medicine
4
作者 WEI Chao LI Yunpeng LIU Jingze 《Optoelectronics Letters》 2025年第10期619-625,共7页
Accessible communication based on sign language recognition(SLR)is the key to emergency medical assistance for the hearing-impaired community.Balancing the capture of both local and global information in SLR for emerg... Accessible communication based on sign language recognition(SLR)is the key to emergency medical assistance for the hearing-impaired community.Balancing the capture of both local and global information in SLR for emergency medicine poses a significant challenge.To address this,we propose a novel approach based on the inter-learning of visual features between global and local information.Specifically,our method enhances the perception capabilities of the visual feature extractor by strategically leveraging the strengths of convolutional neural network(CNN),which are adept at capturing local features,and visual transformers which perform well at perceiving global features.Furthermore,to mitigate the issue of overfitting caused by the limited availability of sign language data for emergency medical applications,we introduce an enhanced short temporal module for data augmentation through additional subsequences.Experimental results on three publicly available sign language datasets demonstrate the efficacy of the proposed approach. 展开更多
关键词 sign language recognition slr visual feature inter learning emergency medicine visual feature extractor capture both local global information enhances perception capabilities emergency medical assistance sign language recognition
原文传递
Partition feature extraction of hyperspectral images for in situ intelligent lithology identification
5
作者 Zhenhao Xu Shan Li +2 位作者 Peng Lin Heng Shi Yanfei Lou 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第12期7736-7752,共17页
Imaging hyperspectral technology has distinctive advantages of non-destructive and non-contact measurement,and the integration of spectral and spatial data.These characteristics present new methodologies for intellige... Imaging hyperspectral technology has distinctive advantages of non-destructive and non-contact measurement,and the integration of spectral and spatial data.These characteristics present new methodologies for intelligent geological sensing in tunnels and other underground engineering projects.However,the in situ acquisition and rapid classification of hyperspectral images in underground still faces great challenges,including the difficulty in obtaining uniform hyperspectral images and the complexity of deploying sophisticated models on mobile platforms.This study proposes an intelligent lithology identification method based on partition feature extraction of hyperspectral images.Firstly,pixel-level hyperspectral information from representative lithological regions is extracted and fused to obtain rock hyperspectral image partition features.Subsequently,an SG-SNV-PCA-DNN(SSPD)model specifically designed for optimizing rock hyperspectral data,performing spectral dimensionality reduction,and identifying lithology is integrated.In an experimental study involving 3420 hyperspectral images,the SSPD identification model achieved the highest accuracy in the testing set,reaching 98.77%.Moreover,the speed of the SSPD model was found to be 18.5%faster than that of the unprocessed model,with an accuracy improvement of 5.22%.In contrast,the ResNet-101 model,used for point-by-point identification based on non-partitioned features,achieved a maximum accuracy of 97.86%in the testing set.In addition,the partition feature extraction methods significantly reduce computational complexity.An objective evaluation of various models demonstrated that the SSPD model exhibited superior performance,achieving a precision(P)of 99.46%,a recall(R)of 99.44%,and F1 score(F1)of 99.45%.Additionally,a pioneering in situ detection work was carried out in a tunnel using underground hyperspectral imaging technology. 展开更多
关键词 In situ lithology identification Hyperspectral image Partition feature extraction Rock hyperspectral Underground intelligent geological perception Geological remote sensing technology
在线阅读 下载PDF
ST-SIGMA:Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting 被引量:6
6
作者 Yang Fang Bei Luo +3 位作者 Ting Zhao Dong He Bingbing Jiang Qilie Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第4期744-757,共14页
Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges... Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios. 展开更多
关键词 feature fusion graph interaction hierarchical aggregation scene perception scene semantics trajectory forecasting
在线阅读 下载PDF
Regional Evolution Features and Coordinated Development Strategies for Northeast China 被引量:3
7
作者 MEI Lin XU Xiaopo CHEN Mingxiu 《Chinese Geographical Science》 SCIE CSCD 2006年第4期378-382,共5页
Northeast China, as the most important production base of agriculture, forestry, and livestock-breeding as well as the old industrial base in the whole country, has been playin a key role in the construction and deve... Northeast China, as the most important production base of agriculture, forestry, and livestock-breeding as well as the old industrial base in the whole country, has been playin a key role in the construction and development of China's economy. However, after the policy of reform and open-up was taken in China. the economic development speed and efficiency ofthis area have turned to be evidently lower than those of coastal area and the national average level as well, which is so-called 'Northeast Phenomenon' and 'Neo-Northeast Phenomenon'. In terms of those phenomena, this paper firstly reviews the spatial and temporal features of the regional evolution of this area so as to unveil the profound forming causes of 'Northeast Phenomena' and 'Neo-Northeast Phenomena'. And then the paper makes a further exploration into the status quo of this region and its forming causes by analyzing its economy gross, industrial structure, product structure, regional eco-categories, etc. At the end of the paper, the authors put forward the basic coordinated development strategies for Northeast China. namely we can revitalize this area by means of adjustment of economic structure, regional coordination, planning urban and rural areas as a whole, institutional innovation, etc. 展开更多
关键词 regional evolution spatial-temporal feature coordinated development strategy Northeast China
在线阅读 下载PDF
STGSA:A Novel Spatial-Temporal Graph Synchronous Aggregation Model for Traffic Prediction 被引量:3
8
作者 Zebing Wei Hongxia Zhao +5 位作者 Zhishuai Li Xiaojie Bu Yuanyuan Chen Xiqiao Zhang Yisheng Lv Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第1期226-238,共13页
The success of intelligent transportation systems relies heavily on accurate traffic prediction,in which how to model the underlying spatial-temporal information from traffic data has come under the spotlight.Most exi... The success of intelligent transportation systems relies heavily on accurate traffic prediction,in which how to model the underlying spatial-temporal information from traffic data has come under the spotlight.Most existing frameworks typically utilize separate modules for spatial and temporal correlations modeling.However,this stepwise pattern may limit the effectiveness and efficiency in spatial-temporal feature extraction and cause the overlook of important information in some steps.Furthermore,it is lacking sufficient guidance from prior information while modeling based on a given spatial adjacency graph(e.g.,deriving from the geodesic distance or approximate connectivity),and may not reflect the actual interaction between nodes.To overcome those limitations,our paper proposes a spatial-temporal graph synchronous aggregation(STGSA)model to extract the localized and long-term spatial-temporal dependencies simultaneously.Specifically,a tailored graph aggregation method in the vertex domain is designed to extract spatial and temporal features in one graph convolution process.In each STGSA block,we devise a directed temporal correlation graph to represent the localized and long-term dependencies between nodes,and the potential temporal dependence is further fine-tuned by an adaptive weighting operation.Meanwhile,we construct an elaborated spatial adjacency matrix to represent the road sensor graph by considering both physical distance and node similarity in a datadriven manner.Then,inspired by the multi-head attention mechanism which can jointly emphasize information from different r epresentation subspaces,we construct a multi-stream module based on the STGSA blocks to capture global information.It projects the embedding input repeatedly with multiple different channels.Finally,the predicted values are generated by stacking several multi-stream modules.Extensive experiments are constructed on six real-world datasets,and numerical results show that the proposed STGSA model significantly outperforms the benchmarks. 展开更多
关键词 Deep learning graph neural network(GNN) multistream spatial-temporal feature extraction temporal graph traffic prediction
在线阅读 下载PDF
Spatial-Temporal Characteristics of Regional Extreme Low Temperature Events in China during 1960-2009 被引量:1
9
作者 WANG Xiao-Juan GONG Zhi-Qiang +1 位作者 REN Fu-Min FENG Guo-Lin 《Advances in Climate Change Research》 SCIE 2012年第4期186-194,共9页
An objective identification technique is used to detect regional extreme low temperature events (RELTE) in China during 1960-2009. Their spatial-temporal characteristics are analyzed. The results indicate that the l... An objective identification technique is used to detect regional extreme low temperature events (RELTE) in China during 1960-2009. Their spatial-temporal characteristics are analyzed. The results indicate that the lowest temperatures of RELTE, together with the frequency distribution of the geometric latitude center, exhibit a double-peak feature. The RELTE frequently happen near the geometric area of 30°N and 42°N before the mid-1980s, but shifted afterwards to 30°N. During 1960-2009, the frequency~ intensity, and the maximum impacted area of RELTE show overall decreasing trends. Due to the contribution of RELTE, with long duratioh and large spatial range, which account for 10% of the total RELTE, there is a significant turning point in the late 1980s. A change to a much more steady state after the late 1990s is identified. In addition, the integrated indices of RELTE are classified and analyzed. 展开更多
关键词 regional extreme low temperature events spatial-temporal features turning point frequency distribution
在线阅读 下载PDF
Spatial-Temporal Correlation 3D Vehicle Detection and Tracking System with Multiple Surveillance Cameras
10
作者 薛炜彭 吴明虎 王琳 《Journal of Shanghai Jiaotong university(Science)》 EI 2023年第1期52-60,共9页
Compared to 3D object detection using a single camera,multiple cameras can overcome some limitations on field-of-view,occlusion,and low detection confidence.This study employs multiple surveillance cameras and develop... Compared to 3D object detection using a single camera,multiple cameras can overcome some limitations on field-of-view,occlusion,and low detection confidence.This study employs multiple surveillance cameras and develops a cooperative 3D object detection and tracking framework by incorporating temporal and spatial information.The framework consists of a 3D vehicle detection model,cooperatively spatial-temporal relation scheme,and heuristic camera constellation method.Specifically,the proposed cross-camera association scheme combines the geometric relationship between multiple cameras and objects in corresponding detections.The spatial-temporal method is designed to associate vehicles between different points of view at a single timestamp and fulfill vehicle tracking in the time aspect.The proposed framework is evaluated based on a synthetic cooperative dataset and shows high reliability,where the cooperative perception can recall more than 66%of the trajectory instead of 11%for single-point sensing.This could contribute to full-range surveillance for intelligent transportation systems. 展开更多
关键词 multi-object tracking 3D detection multiple sensors cooperative perception spatial-temporal correlation intelligent transportation system
原文传递
Hyperspectral remote sensing identification of marine oil emulsions based on the fusion of spatial and spectral features
11
作者 Xinyue Huang Yi Ma +1 位作者 Zongchen Jiang Junfang Yang 《Acta Oceanologica Sinica》 SCIE CAS CSCD 2024年第3期139-154,共16页
Marine oil spill emulsions are difficult to recover,and the damage to the environment is not easy to eliminate.The use of remote sensing to accurately identify oil spill emulsions is highly important for the protectio... Marine oil spill emulsions are difficult to recover,and the damage to the environment is not easy to eliminate.The use of remote sensing to accurately identify oil spill emulsions is highly important for the protection of marine environments.However,the spectrum of oil emulsions changes due to different water content.Hyperspectral remote sensing and deep learning can use spectral and spatial information to identify different types of oil emulsions.Nonetheless,hyperspectral data can also cause information redundancy,reducing classification accuracy and efficiency,and even overfitting in machine learning models.To address these problems,an oil emulsion deep-learning identification model with spatial-spectral feature fusion is established,and feature bands that can distinguish between crude oil,seawater,water-in-oil emulsion(WO),and oil-in-water emulsion(OW)are filtered based on a standard deviation threshold–mutual information method.Using oil spill airborne hyperspectral data,we conducted identification experiments on oil emulsions in different background waters and under different spatial and temporal conditions,analyzed the transferability of the model,and explored the effects of feature band selection and spectral resolution on the identification of oil emulsions.The results show the following.(1)The standard deviation–mutual information feature selection method is able to effectively extract feature bands that can distinguish between WO,OW,oil slick,and seawater.The number of bands was reduced from 224 to 134 after feature selection on the Airborne Visible Infrared Imaging Spectrometer(AVIRIS)data and from 126 to 100 on the S185 data.(2)With feature selection,the overall accuracy and Kappa of the identification results for the training area are 91.80%and 0.86,respectively,improved by 2.62%and 0.04,and the overall accuracy and Kappa of the identification results for the migration area are 86.53%and 0.80,respectively,improved by 3.45%and 0.05.(3)The oil emulsion identification model has a certain degree of transferability and can effectively identify oil spill emulsions for AVIRIS data at different times and locations,with an overall accuracy of more than 80%,Kappa coefficient of more than 0.7,and F1 score of 0.75 or more for each category.(4)As the spectral resolution decreasing,the model yields different degrees of misclassification for areas with a mixed distribution of oil slick and seawater or mixed distribution of WO and OW.Based on the above experimental results,we demonstrate that the oil emulsion identification model with spatial–spectral feature fusion achieves a high accuracy rate in identifying oil emulsion using airborne hyperspectral data,and can be applied to images under different spatial and temporal conditions.Furthermore,we also elucidate the impact of factors such as spectral resolution and background water bodies on the identification process.These findings provide new reference for future endeavors in automated marine oil spill detection. 展开更多
关键词 oil emulsions IDENTIFICATION hyperspectral remote sensing feature selection convolutional neural network(CNN) spatial-temporal transferability
在线阅读 下载PDF
基于稀疏点匹配的协同式未知目标跟踪方法
12
作者 郎荣玲 魏才伦 +1 位作者 范亚 高飞 《航空学报》 北大核心 2026年第3期83-95,共13页
对未知目标的实时感知与持续跟踪是智能系统自主决策的重要前提,在实际应用中存在缺乏目标类别先验信息和训练样本匮乏的问题,使得未知目标的感知与跟踪更具挑战性。针对此问题,提出了一种基于任意分割模型(SAM)与稀疏特征点匹配的未知... 对未知目标的实时感知与持续跟踪是智能系统自主决策的重要前提,在实际应用中存在缺乏目标类别先验信息和训练样本匮乏的问题,使得未知目标的感知与跟踪更具挑战性。针对此问题,提出了一种基于任意分割模型(SAM)与稀疏特征点匹配的未知目标跟踪方法。该方法首先通过提示点引导SAM模型感知并分割图像中的未知目标,随后利用基于卷积神经网络的特征点提取模型,获取目标图像的稀疏特征点作为目标信息,并通过基于注意力机制的匹配网络在后续帧中匹配这些特征点,完成目标信息传播。在此基础上,设计了一个基于特征点一致性的迭代式SAM模块(ISPC),利用匹配的特征点持续引导SAM模型对后续图像帧的目标进行分割,从而实现未知目标的稳定跟踪。此外基于稀疏特征点的轻量化目标信息,可以在多智能体之间高效共享,构建了一个协同式目标跟踪系统。在DAVIS 2017数据集和自构建的近红外视频数据集上,评估了系统的目标跟踪性能与零训练样本目标的泛化能力。实验结果表明,该方法在处理未知类别目标的协同感知与跟踪任务中,表现出良好的鲁棒性和准确性。 展开更多
关键词 目标跟踪 目标分割 特征提取 特征匹配 协同感知
原文传递
基于红外传感远程监控的电力系统发热风险自动感知
13
作者 黎锐烽 黄国泳 +1 位作者 刘颖 黄杰辉 《传感技术学报》 北大核心 2026年第1期221-226,共6页
电力设备因长时间作业,内部容易出现发热问题,若不能及时发现,轻则零件部分损坏,重则大面积停电。当前电力系统发热风险检测主要采用人工巡检的方式,作业速度慢,且易出错,为此,提出基于红外传感远程监控的电力系统发热风险自动感知方法... 电力设备因长时间作业,内部容易出现发热问题,若不能及时发现,轻则零件部分损坏,重则大面积停电。当前电力系统发热风险检测主要采用人工巡检的方式,作业速度慢,且易出错,为此,提出基于红外传感远程监控的电力系统发热风险自动感知方法。考虑到红外传感远程监控图像分辨率较低,通过仿射变换将红外监控图像转换成可见光图像,利用速度增强的稳定特征(Speeded-Up Robust Features,SURF)算子、最佳箱优先搜索(Best Bin First,BBF)算法匹配图像特征点,并通过二次规划对偶问题找出特征点最佳分类超平面,确定图像发热风险区域,完成电力系统发热风险自动感知。实验结果表明,所提方法的发热风险点检测误差保持在0.1℃内,且整体耗时低于6 ms。 展开更多
关键词 红外传感 发热风险自动感知 远程监控 特征点匹配 支持向量机 超平面
在线阅读 下载PDF
边缘计算环境下多模态人工智能感知方法研究
14
作者 王雅隽 周萍 《智能物联技术》 2026年第3期85-88,共4页
边缘计算架构为多模态人工智能感知提供了低延迟与高效率的实现路径。通过在边缘节点部署轻量化神经网络,采用模型剪枝、量化及知识蒸馏等压缩技术,模型体积降至15 MB,推理延迟控制在95 ms。结合注意力融合机制与分布式协同推理策略,多... 边缘计算架构为多模态人工智能感知提供了低延迟与高效率的实现路径。通过在边缘节点部署轻量化神经网络,采用模型剪枝、量化及知识蒸馏等压缩技术,模型体积降至15 MB,推理延迟控制在95 ms。结合注意力融合机制与分布式协同推理策略,多模态感知精度提高至91.3%,在智能制造与自动驾驶等场景中有效降低了网络带宽占用和系统响应延迟,具有重要的实用价值。 展开更多
关键词 边缘计算 多模态感知 轻量化 特征融合
在线阅读 下载PDF
关键特征感知并行细粒度特征提取的密集行人检测
15
作者 齐向明 刘晓暄 王子键 《计算机工程与应用》 北大核心 2026年第1期297-306,共10页
针对密集行人检测中存在目标密集且相互遮挡问题,提出一种关键特征感知并行细粒度特征提取的密集行人检测算法。以YOLOv8n为基线网络,在vision Transformer中再次加入自注意力机制得到DS-ViT(dual search)深度感知全局关键特征,优化CBS... 针对密集行人检测中存在目标密集且相互遮挡问题,提出一种关键特征感知并行细粒度特征提取的密集行人检测算法。以YOLOv8n为基线网络,在vision Transformer中再次加入自注意力机制得到DS-ViT(dual search)深度感知全局关键特征,优化CBS使用3个3×3Conv,设计双支路加入空间注意力机制得到FE-Conv(feature enhance)增强空间和通道双重特征提取局部细粒度,DS-ViT与FE-Conv并行重构主干网络,增强多尺度特征提取能力;颈部网络输入端嵌入空间注意力机制,增强多层次特征融合;检测网络新增三个卷积层,删减20×20检测头,降低漏检和错检率。在自制数据集上做消融实验和对比实验,与基线网络对比,mAP、Precision、Recall、IoU和FPS分别提高5.4个百分点、4.9个百分点、6.4个百分点、6.2个百分点和6.2,Parameters值仅增加1×10^(5),表明该算法有良好表现。在公开数据集WiderPerson上做泛化实验,较基线网络平均检测精度提升1.6个百分点,表明该算法具备较好鲁棒性。 展开更多
关键词 密集行人检测 关键特征深度感知 细粒度特征双重提取 YOLOv8n 视觉变换器(ViT) CBS
在线阅读 下载PDF
基于改进YOLOv8的钢材表面缺陷检测
16
作者 李昊璇 孙宇翔 《测试技术学报》 2026年第1期1-9,共9页
为应对钢材表面缺陷检测中存在的缺陷种类多元、尺寸跨度大、检测精度较低及模型泛化能力不足等挑战,提出了一种钢材表面缺陷检测改进算法,命名为CGP-YOLO。首先,在Neck部分,使用内容感知特征重组(Content-Aware Reassembly of Features... 为应对钢材表面缺陷检测中存在的缺陷种类多元、尺寸跨度大、检测精度较低及模型泛化能力不足等挑战,提出了一种钢材表面缺陷检测改进算法,命名为CGP-YOLO。首先,在Neck部分,使用内容感知特征重组(Content-Aware Reassembly of Features,CARAFE)替换最近邻插值算子,解决了上采样特征图出现的块状效应、细节丢失严重等问题;其次,在Head前引入全局注意力机制(Global Attention Mechanism,GAM),通过调控通道和空间特征的交互增强了模型的表征能力;最后,引入可编程梯度信息(Programmable Gradient Information,PGI)模块,通过其多级辅助信息组件逐步整合不同尺度的特征,有效提高了模型对不同尺度缺陷敏感性。与初始算法相比,算法CGP-YOLO性能显著提升,平均精度均值达到了80.8%,高于原算法3.6百分点。 展开更多
关键词 YOLOv8 钢材表面缺陷检测 内容感知特征重组 全局注意力机制 可编程梯度信息
在线阅读 下载PDF
基于改进Otsu算法的轴承图像阈值分割方法
17
作者 赵宇航 吴超华 +2 位作者 王鑫 张晟琦 史晓亮 《机电工程》 北大核心 2026年第1期34-44,共11页
针对轴承装配后质检时,工业相机采集到的图像存在混合噪声和背光源过曝,导致图像分割精度不高、影响后续处理的问题,提出了一种基于多特征感知与贝叶斯优化的改进大津算法(Otsu)的轴承图像阈值分割方法。首先,在图像感知层面,读取、计... 针对轴承装配后质检时,工业相机采集到的图像存在混合噪声和背光源过曝,导致图像分割精度不高、影响后续处理的问题,提出了一种基于多特征感知与贝叶斯优化的改进大津算法(Otsu)的轴承图像阈值分割方法。首先,在图像感知层面,读取、计算了图像的灰度值、梯度幅值、局部二值模型并完成了储存,对三特征数据分别进行了归一化处理,加权融合了三特征数据,得到了轴承图像的加权融合图;然后,在计算效率层面,引入了贝叶斯优化与分块动态规划方法,替代了Otsu的穷举法;最后,结合感知方法与加速方法,得到了一种基于多特征感知与贝叶斯优化的改进Otsu算法(MFB-Otsu),并与其他阈值分割算法进行了性能对比实验。研究结果表明:与Otsu算法、自适应分割算法对比,MFB-Otsu算法在保留了轴承图像全局亮度分布的基础上,强化了外轮廓边缘,抑制了边缘噪声,输出图像边缘平滑,背景与前景彻底分离;在实测边缘提取中,边缘信息保留完整,噪点干扰率相较于Otsu降低了55%;在客观图像评价指标方面,该算法的分割准确率、交并比、F1值分别达到0.997、0.989、0.994,优于Otsu算法和自适应分割算法;在计算效率层面,相较于Otsu提高了12.9%。改进Otsu算法在轴承尺寸检测中表现出良好的通用性与稳定性,具有一定的工程应用价值。 展开更多
关键词 轴承装配 图像处理 图像分割 大津算法 多特征感知和贝叶斯优化 Otsu加速方法 三维度特征加权 分块动态优化
在线阅读 下载PDF
跨域交互与感知增强图长短时记忆网络的脑电情绪识别方法
18
作者 赵超莹 黄鑫 +3 位作者 龙恳 郭中原 龙虹毓 陈昌川 《西南大学学报(自然科学版)》 北大核心 2026年第4期167-181,共15页
针对脑电信号(Electroencephalogram,EEG)情绪识别中存在的非欧几里得空间拓扑结构复杂、单一特征域表征信息不全以及时序动态变化显著等问题,构建了一种跨域交互与感知增强图长短时记忆网络(Cross-Domain Perception-Enhanced Graph Bi... 针对脑电信号(Electroencephalogram,EEG)情绪识别中存在的非欧几里得空间拓扑结构复杂、单一特征域表征信息不全以及时序动态变化显著等问题,构建了一种跨域交互与感知增强图长短时记忆网络(Cross-Domain Perception-Enhanced Graph Bi-LSTM,CD-PEBL)模型。该模型首先提取时域微分熵(Differential Entropy,DE)与频域功率谱密度(Power Spectral Density,PSD)特征,并在跨域特征交互模块(Cross-Domain Feature Interaction Module,CDFI)中通过交叉注意力机制对齐融合时频互补信息,并进一步结合多维度校准机制生成跨域增强特征。其次,构建感知增强自适应图融合模块(Perception-Enhanced Adaptive Graph Fusion Module,PEAGF),利用皮尔逊相关系数(Pearson Correlation Coefficient,PCC)与相位锁定值(Phase Locking Value,PLV)刻画通道间的功能连接,并在多视图动态图建模框架下提取多尺度空间拓扑表征,通过感知增强门控机制自适应融合不同视图的特征。随后,将融合后的时空特征序列输入基于双向长短时记忆网络(Bi-directional Long Short-Term Memory,Bi-LSTM)的时序依赖建模模块(Bi-directional Temporal Dependency Modeling Module,BTDM),以捕捉与情绪相关的双向时序关联。最后,通过多层感知机(Multi-Layer Perceptron,MLP)输出情绪类别。基于DEAP数据集和SEED数据集的实验结果表明,所提方法能够有效提升情绪识别的准确率,并在多项指标上优于现有主流模型。 展开更多
关键词 脑电信号 情绪识别 跨域特征交互 图神经网络 感知增强 双向长短时记忆网络
原文传递
音乐情绪的环境调控:基于虚拟现实和脑电的频段特异性研究
19
作者 周燕彬 黄敏 +4 位作者 江乐旗 王铭勋 高婷婷 陈俊名 金严君 《生物医学工程学杂志》 北大核心 2026年第1期61-69,78,共10页
音乐情绪感知作为解码人类情感本质的关键路径,其调控机制的解析对发展精准神经调控策略至关重要。尽管脑电(EEG)信号可捕捉音乐情绪加工的动态神经活动,并可结合虚拟现实(VR)技术在音乐情绪调节中起到沉浸式增效作用,但VR环境—音乐情... 音乐情绪感知作为解码人类情感本质的关键路径,其调控机制的解析对发展精准神经调控策略至关重要。尽管脑电(EEG)信号可捕捉音乐情绪加工的动态神经活动,并可结合虚拟现实(VR)技术在音乐情绪调节中起到沉浸式增效作用,但VR环境—音乐情绪—神经活动的关联机制尚未明晰。本研究构建“VR环境—音乐刺激—EEG信号响应”多模态试验范式,通过多频段特征分析方法,系统解析不同虚拟场景切换中音乐情绪感知的神经动态规律。本文研究结果表明,大脑右颞叶在现实与虚拟场景对比中呈现明显的电位变化,后部脑区对不同虚拟场景敏感;而环境对EEG信号低频和高频活动产生了特异性调制,且δ频段能量占比呈现音乐情绪感知与环境的依赖性分化。本研究通过虚拟场景调控音乐情绪感知试验,系统揭示了环境因素对音乐情绪的频段特异性调制效应,确立了δ频段的能量占比作为环境—情绪交互的关键生物标志物,为沉浸式情绪调节策略的开发和临床心理干预提供了重要的理论依据与量化评估方法。 展开更多
关键词 音乐情绪感知 虚拟现实 脑电信号 多频段特征分析
原文传递
融合傅里叶卷积与差异感知的钢材表面微小缺陷检测算法
20
作者 张胜伟 曹洁 《广西师范大学学报(自然科学版)》 北大核心 2026年第2期90-102,共13页
针对当前钢材表面缺陷检测方法对微小缺陷检测效果不佳的问题,本文提出一种融合傅里叶卷积与差异感知的钢材表面微小缺陷检测算法。该算法使用CSP-FFCM替换主干网络中的BasicBlock,通过在空间域和频域中进行卷积操作,以减少计算开销并... 针对当前钢材表面缺陷检测方法对微小缺陷检测效果不佳的问题,本文提出一种融合傅里叶卷积与差异感知的钢材表面微小缺陷检测算法。该算法使用CSP-FFCM替换主干网络中的BasicBlock,通过在空间域和频域中进行卷积操作,以减少计算开销并提升网络的特征提取能力。然后,提出多尺度特征层优化策略,在保留细粒度特征信息的同时,优化计算资源分配,确保模型对微小缺陷细节信息的有效捕捉。最后,设计差异感知特征增强模块,通过强化微小缺陷的特征表示能力,进一步提升模型对微小缺陷的检测性能。实验结果表明,本文算法在NEU-DET和GC10-DET数据集上mAP分别达到83.7%和73.1%,在钢材表面微小缺陷的高精度检测任务中表现出显著的性能优势。 展开更多
关键词 微小缺陷检测 傅里叶卷积 多尺度特征层优化 差异感知
在线阅读 下载PDF
上一页 1 2 22 下一页 到第
使用帮助 返回顶部