期刊文献+
共找到417篇文章
< 1 2 21 >
每页显示 20 50 100
Full Perception Head:Bridging the Gap Between Local and Global Features
1
作者 Jie Hua Zhongyuan Wang +3 位作者 Xin Tian Qin Zou Jinsheng Xiao Jiayi Ma 《IEEE/CAA Journal of Automatica Sinica》 2025年第7期1391-1406,共16页
Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and te... Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and textures,while global features extracted by full connection layers,etc.,represent the overall structure and long-range relationships within the image.These features are crucial for accurate object detection,yet most existing methods focus on aggregating local and global features,often overlooking the importance of medium-range dependencies.To address this gap,we propose a novel full perception module(FPModule),a simple yet effective feature extraction module designed to simultaneously capture local details,medium-range dependencies,and long-range dependencies.Building on this,we construct a full perception head(FP-Head)by cascading multiple FP-Modules,enabling the prediction layer to leverage the most informative features.Experimental results in the MS COCO dataset demonstrate that our approach significantly enhances object recognition and localization,achieving 2.7−5.7 APval gains when integrated into standard object detectors.Notably,the FP-Module is a universal solution that can be seamlessly incorporated into existing detectors to boost performance.The code will be released at https://github.com/Idcogroup/FP-Head. 展开更多
关键词 feature aggregation full perception module medium-range dependencies object detection
在线阅读 下载PDF
Adaptive Multi-Feature Fusion for Vehicle Micro-Motor Noise Recognition Considering Auditory Perception 被引量:1
2
作者 Ting Zhao Weiping Ding +1 位作者 Haibo Huang Yudong Wu 《Sound & Vibration》 EI 2023年第1期133-153,共21页
The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assem... The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assembly errors,and other imperfections that may arise during the design or manufacturing phases.Conse-quently,these micro-motors might generate anomalous noises during their operation,consequently exerting a substantial adverse influence on the overall comfort of drivers and passengers.Automobile micro-motors exhibit a diverse array of structural variations,consequently leading to the manifestation of a multitude of distinctive auditory irregularities.To address the identification of diverse forms of abnormal noise,this research presents a novel approach rooted in the utilization of vibro-acoustic fusion-convolutional neural network(VAF-CNN).This method entails the deployment of distinct network branches,each serving to capture disparate features from the multi-sensor data,all the while considering the auditory perception traits inherent in the human auditory sys-tem.The intermediary layer integrates the concept of adaptive weighting of multi-sensor features,thus affording a calibration mechanism for the features hailing from multiple sensors,thereby enabling a further refinement of features within the branch network.For optimal model efficacy,a feature fusion mechanism is implemented in the concluding layer.To substantiate the efficacy of the proposed approach,this paper initially employs an augmented data methodology inspired by modified SpecAugment,applied to the dataset of abnormal noise sam-ples,encompassing scenarios both with and without in-vehicle interior noise.This serves to mitigate the issue of limited sample availability.Subsequent comparative evaluations are executed,contrasting the performance of the model founded upon single-sensor data against other feature fusion models reliant on multi-sensor data.The experimental results substantiate that the suggested methodology yields heightened recognition accuracy and greater resilience against interference.Moreover,it holds notable practical significance in the engineering domain,as it furnishes valuable support for the targeted management of noise emanating from vehicle micro-motors. 展开更多
关键词 Auditory perception MULTI-SENSOR feature adaptive fusion abnormal noise recognition vehicle interior noise
在线阅读 下载PDF
Skeleton-Silhouette Complementary Perception: Toward Robust Gait Recognition
3
作者 Xiaokai Liu Luyuan Hao 《Journal of Clinical and Nursing Research》 2025年第11期372-377,共6页
Gait,the unique pattern of how a person walks,has emerged as one of the most promising biometric features in modern intelligent sensing.Unlike fingerprints or facial characteristics,gait can be captured unobtrusively ... Gait,the unique pattern of how a person walks,has emerged as one of the most promising biometric features in modern intelligent sensing.Unlike fingerprints or facial characteristics,gait can be captured unobtrusively and at a distance,without requiring the subject’s awareness or cooperation.This makes it highly suitable for long-range surveillance,forensic investigation,and smart environments where contactless recognition is crucial.Traditional gait-recognition systems rely either on silhouettes,which capture the outer appearance of a person,or on skeletons,which describe the internal structure of human motion.Each modality provides only a partial understanding of gait.Silhouettes emphasize shape and contour but are easily distorted by clothing or carried objects;skeletons describe motion dynamics and limb coordination but lose discriminative details about body shape.This article presents the concept of Complementary Semantic Embedding(CSE),a unified framework that merges silhouette and skeleton information into a comprehensive semantic representation of human walking.By modeling the complementary nature of appearance and structure,the approach achieves more robust and accurate gait recognition even under challenging conditions. 展开更多
关键词 Complementary perception Gait recognition feature fusion
在线阅读 下载PDF
Visual feature inter-learning for sign language recognition in emergency medicine
4
作者 WEI Chao LI Yunpeng LIU Jingze 《Optoelectronics Letters》 2025年第10期619-625,共7页
Accessible communication based on sign language recognition(SLR)is the key to emergency medical assistance for the hearing-impaired community.Balancing the capture of both local and global information in SLR for emerg... Accessible communication based on sign language recognition(SLR)is the key to emergency medical assistance for the hearing-impaired community.Balancing the capture of both local and global information in SLR for emergency medicine poses a significant challenge.To address this,we propose a novel approach based on the inter-learning of visual features between global and local information.Specifically,our method enhances the perception capabilities of the visual feature extractor by strategically leveraging the strengths of convolutional neural network(CNN),which are adept at capturing local features,and visual transformers which perform well at perceiving global features.Furthermore,to mitigate the issue of overfitting caused by the limited availability of sign language data for emergency medical applications,we introduce an enhanced short temporal module for data augmentation through additional subsequences.Experimental results on three publicly available sign language datasets demonstrate the efficacy of the proposed approach. 展开更多
关键词 sign language recognition slr visual feature inter learning emergency medicine visual feature extractor capture both local global information enhances perception capabilities emergency medical assistance sign language recognition
原文传递
Partition feature extraction of hyperspectral images for in situ intelligent lithology identification
5
作者 Zhenhao Xu Shan Li +2 位作者 Peng Lin Heng Shi Yanfei Lou 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第12期7736-7752,共17页
Imaging hyperspectral technology has distinctive advantages of non-destructive and non-contact measurement,and the integration of spectral and spatial data.These characteristics present new methodologies for intellige... Imaging hyperspectral technology has distinctive advantages of non-destructive and non-contact measurement,and the integration of spectral and spatial data.These characteristics present new methodologies for intelligent geological sensing in tunnels and other underground engineering projects.However,the in situ acquisition and rapid classification of hyperspectral images in underground still faces great challenges,including the difficulty in obtaining uniform hyperspectral images and the complexity of deploying sophisticated models on mobile platforms.This study proposes an intelligent lithology identification method based on partition feature extraction of hyperspectral images.Firstly,pixel-level hyperspectral information from representative lithological regions is extracted and fused to obtain rock hyperspectral image partition features.Subsequently,an SG-SNV-PCA-DNN(SSPD)model specifically designed for optimizing rock hyperspectral data,performing spectral dimensionality reduction,and identifying lithology is integrated.In an experimental study involving 3420 hyperspectral images,the SSPD identification model achieved the highest accuracy in the testing set,reaching 98.77%.Moreover,the speed of the SSPD model was found to be 18.5%faster than that of the unprocessed model,with an accuracy improvement of 5.22%.In contrast,the ResNet-101 model,used for point-by-point identification based on non-partitioned features,achieved a maximum accuracy of 97.86%in the testing set.In addition,the partition feature extraction methods significantly reduce computational complexity.An objective evaluation of various models demonstrated that the SSPD model exhibited superior performance,achieving a precision(P)of 99.46%,a recall(R)of 99.44%,and F1 score(F1)of 99.45%.Additionally,a pioneering in situ detection work was carried out in a tunnel using underground hyperspectral imaging technology. 展开更多
关键词 In situ lithology identification Hyperspectral image Partition feature extraction Rock hyperspectral Underground intelligent geological perception Geological remote sensing technology
在线阅读 下载PDF
ST-SIGMA:Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting 被引量:6
6
作者 Yang Fang Bei Luo +3 位作者 Ting Zhao Dong He Bingbing Jiang Qilie Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第4期744-757,共14页
Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges... Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios. 展开更多
关键词 feature fusion graph interaction hierarchical aggregation scene perception scene semantics trajectory forecasting
在线阅读 下载PDF
Regional Evolution Features and Coordinated Development Strategies for Northeast China 被引量:3
7
作者 MEI Lin XU Xiaopo CHEN Mingxiu 《Chinese Geographical Science》 SCIE CSCD 2006年第4期378-382,共5页
Northeast China, as the most important production base of agriculture, forestry, and livestock-breeding as well as the old industrial base in the whole country, has been playin a key role in the construction and deve... Northeast China, as the most important production base of agriculture, forestry, and livestock-breeding as well as the old industrial base in the whole country, has been playin a key role in the construction and development of China's economy. However, after the policy of reform and open-up was taken in China. the economic development speed and efficiency ofthis area have turned to be evidently lower than those of coastal area and the national average level as well, which is so-called 'Northeast Phenomenon' and 'Neo-Northeast Phenomenon'. In terms of those phenomena, this paper firstly reviews the spatial and temporal features of the regional evolution of this area so as to unveil the profound forming causes of 'Northeast Phenomena' and 'Neo-Northeast Phenomena'. And then the paper makes a further exploration into the status quo of this region and its forming causes by analyzing its economy gross, industrial structure, product structure, regional eco-categories, etc. At the end of the paper, the authors put forward the basic coordinated development strategies for Northeast China. namely we can revitalize this area by means of adjustment of economic structure, regional coordination, planning urban and rural areas as a whole, institutional innovation, etc. 展开更多
关键词 regional evolution spatial-temporal feature coordinated development strategy Northeast China
在线阅读 下载PDF
STGSA:A Novel Spatial-Temporal Graph Synchronous Aggregation Model for Traffic Prediction 被引量:3
8
作者 Zebing Wei Hongxia Zhao +5 位作者 Zhishuai Li Xiaojie Bu Yuanyuan Chen Xiqiao Zhang Yisheng Lv Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第1期226-238,共13页
The success of intelligent transportation systems relies heavily on accurate traffic prediction,in which how to model the underlying spatial-temporal information from traffic data has come under the spotlight.Most exi... The success of intelligent transportation systems relies heavily on accurate traffic prediction,in which how to model the underlying spatial-temporal information from traffic data has come under the spotlight.Most existing frameworks typically utilize separate modules for spatial and temporal correlations modeling.However,this stepwise pattern may limit the effectiveness and efficiency in spatial-temporal feature extraction and cause the overlook of important information in some steps.Furthermore,it is lacking sufficient guidance from prior information while modeling based on a given spatial adjacency graph(e.g.,deriving from the geodesic distance or approximate connectivity),and may not reflect the actual interaction between nodes.To overcome those limitations,our paper proposes a spatial-temporal graph synchronous aggregation(STGSA)model to extract the localized and long-term spatial-temporal dependencies simultaneously.Specifically,a tailored graph aggregation method in the vertex domain is designed to extract spatial and temporal features in one graph convolution process.In each STGSA block,we devise a directed temporal correlation graph to represent the localized and long-term dependencies between nodes,and the potential temporal dependence is further fine-tuned by an adaptive weighting operation.Meanwhile,we construct an elaborated spatial adjacency matrix to represent the road sensor graph by considering both physical distance and node similarity in a datadriven manner.Then,inspired by the multi-head attention mechanism which can jointly emphasize information from different r epresentation subspaces,we construct a multi-stream module based on the STGSA blocks to capture global information.It projects the embedding input repeatedly with multiple different channels.Finally,the predicted values are generated by stacking several multi-stream modules.Extensive experiments are constructed on six real-world datasets,and numerical results show that the proposed STGSA model significantly outperforms the benchmarks. 展开更多
关键词 Deep learning graph neural network(GNN) multistream spatial-temporal feature extraction temporal graph traffic prediction
在线阅读 下载PDF
Spatial-Temporal Characteristics of Regional Extreme Low Temperature Events in China during 1960-2009 被引量:1
9
作者 WANG Xiao-Juan GONG Zhi-Qiang +1 位作者 REN Fu-Min FENG Guo-Lin 《Advances in Climate Change Research》 SCIE 2012年第4期186-194,共9页
An objective identification technique is used to detect regional extreme low temperature events (RELTE) in China during 1960-2009. Their spatial-temporal characteristics are analyzed. The results indicate that the l... An objective identification technique is used to detect regional extreme low temperature events (RELTE) in China during 1960-2009. Their spatial-temporal characteristics are analyzed. The results indicate that the lowest temperatures of RELTE, together with the frequency distribution of the geometric latitude center, exhibit a double-peak feature. The RELTE frequently happen near the geometric area of 30°N and 42°N before the mid-1980s, but shifted afterwards to 30°N. During 1960-2009, the frequency~ intensity, and the maximum impacted area of RELTE show overall decreasing trends. Due to the contribution of RELTE, with long duratioh and large spatial range, which account for 10% of the total RELTE, there is a significant turning point in the late 1980s. A change to a much more steady state after the late 1990s is identified. In addition, the integrated indices of RELTE are classified and analyzed. 展开更多
关键词 regional extreme low temperature events spatial-temporal features turning point frequency distribution
在线阅读 下载PDF
Spatial-Temporal Correlation 3D Vehicle Detection and Tracking System with Multiple Surveillance Cameras
10
作者 薛炜彭 吴明虎 王琳 《Journal of Shanghai Jiaotong university(Science)》 EI 2023年第1期52-60,共9页
Compared to 3D object detection using a single camera,multiple cameras can overcome some limitations on field-of-view,occlusion,and low detection confidence.This study employs multiple surveillance cameras and develop... Compared to 3D object detection using a single camera,multiple cameras can overcome some limitations on field-of-view,occlusion,and low detection confidence.This study employs multiple surveillance cameras and develops a cooperative 3D object detection and tracking framework by incorporating temporal and spatial information.The framework consists of a 3D vehicle detection model,cooperatively spatial-temporal relation scheme,and heuristic camera constellation method.Specifically,the proposed cross-camera association scheme combines the geometric relationship between multiple cameras and objects in corresponding detections.The spatial-temporal method is designed to associate vehicles between different points of view at a single timestamp and fulfill vehicle tracking in the time aspect.The proposed framework is evaluated based on a synthetic cooperative dataset and shows high reliability,where the cooperative perception can recall more than 66%of the trajectory instead of 11%for single-point sensing.This could contribute to full-range surveillance for intelligent transportation systems. 展开更多
关键词 multi-object tracking 3D detection multiple sensors cooperative perception spatial-temporal correlation intelligent transportation system
原文传递
Hyperspectral remote sensing identification of marine oil emulsions based on the fusion of spatial and spectral features
11
作者 Xinyue Huang Yi Ma +1 位作者 Zongchen Jiang Junfang Yang 《Acta Oceanologica Sinica》 SCIE CAS CSCD 2024年第3期139-154,共16页
Marine oil spill emulsions are difficult to recover,and the damage to the environment is not easy to eliminate.The use of remote sensing to accurately identify oil spill emulsions is highly important for the protectio... Marine oil spill emulsions are difficult to recover,and the damage to the environment is not easy to eliminate.The use of remote sensing to accurately identify oil spill emulsions is highly important for the protection of marine environments.However,the spectrum of oil emulsions changes due to different water content.Hyperspectral remote sensing and deep learning can use spectral and spatial information to identify different types of oil emulsions.Nonetheless,hyperspectral data can also cause information redundancy,reducing classification accuracy and efficiency,and even overfitting in machine learning models.To address these problems,an oil emulsion deep-learning identification model with spatial-spectral feature fusion is established,and feature bands that can distinguish between crude oil,seawater,water-in-oil emulsion(WO),and oil-in-water emulsion(OW)are filtered based on a standard deviation threshold–mutual information method.Using oil spill airborne hyperspectral data,we conducted identification experiments on oil emulsions in different background waters and under different spatial and temporal conditions,analyzed the transferability of the model,and explored the effects of feature band selection and spectral resolution on the identification of oil emulsions.The results show the following.(1)The standard deviation–mutual information feature selection method is able to effectively extract feature bands that can distinguish between WO,OW,oil slick,and seawater.The number of bands was reduced from 224 to 134 after feature selection on the Airborne Visible Infrared Imaging Spectrometer(AVIRIS)data and from 126 to 100 on the S185 data.(2)With feature selection,the overall accuracy and Kappa of the identification results for the training area are 91.80%and 0.86,respectively,improved by 2.62%and 0.04,and the overall accuracy and Kappa of the identification results for the migration area are 86.53%and 0.80,respectively,improved by 3.45%and 0.05.(3)The oil emulsion identification model has a certain degree of transferability and can effectively identify oil spill emulsions for AVIRIS data at different times and locations,with an overall accuracy of more than 80%,Kappa coefficient of more than 0.7,and F1 score of 0.75 or more for each category.(4)As the spectral resolution decreasing,the model yields different degrees of misclassification for areas with a mixed distribution of oil slick and seawater or mixed distribution of WO and OW.Based on the above experimental results,we demonstrate that the oil emulsion identification model with spatial–spectral feature fusion achieves a high accuracy rate in identifying oil emulsion using airborne hyperspectral data,and can be applied to images under different spatial and temporal conditions.Furthermore,we also elucidate the impact of factors such as spectral resolution and background water bodies on the identification process.These findings provide new reference for future endeavors in automated marine oil spill detection. 展开更多
关键词 oil emulsions IDENTIFICATION hyperspectral remote sensing feature selection convolutional neural network(CNN) spatial-temporal transferability
在线阅读 下载PDF
基于红外传感远程监控的电力系统发热风险自动感知
12
作者 黎锐烽 黄国泳 +1 位作者 刘颖 黄杰辉 《传感技术学报》 北大核心 2026年第1期221-226,共6页
电力设备因长时间作业,内部容易出现发热问题,若不能及时发现,轻则零件部分损坏,重则大面积停电。当前电力系统发热风险检测主要采用人工巡检的方式,作业速度慢,且易出错,为此,提出基于红外传感远程监控的电力系统发热风险自动感知方法... 电力设备因长时间作业,内部容易出现发热问题,若不能及时发现,轻则零件部分损坏,重则大面积停电。当前电力系统发热风险检测主要采用人工巡检的方式,作业速度慢,且易出错,为此,提出基于红外传感远程监控的电力系统发热风险自动感知方法。考虑到红外传感远程监控图像分辨率较低,通过仿射变换将红外监控图像转换成可见光图像,利用速度增强的稳定特征(Speeded-Up Robust Features,SURF)算子、最佳箱优先搜索(Best Bin First,BBF)算法匹配图像特征点,并通过二次规划对偶问题找出特征点最佳分类超平面,确定图像发热风险区域,完成电力系统发热风险自动感知。实验结果表明,所提方法的发热风险点检测误差保持在0.1℃内,且整体耗时低于6 ms。 展开更多
关键词 红外传感 发热风险自动感知 远程监控 特征点匹配 支持向量机 超平面
在线阅读 下载PDF
关键特征感知并行细粒度特征提取的密集行人检测
13
作者 齐向明 刘晓暄 王子键 《计算机工程与应用》 北大核心 2026年第1期297-306,共10页
针对密集行人检测中存在目标密集且相互遮挡问题,提出一种关键特征感知并行细粒度特征提取的密集行人检测算法。以YOLOv8n为基线网络,在vision Transformer中再次加入自注意力机制得到DS-ViT(dual search)深度感知全局关键特征,优化CBS... 针对密集行人检测中存在目标密集且相互遮挡问题,提出一种关键特征感知并行细粒度特征提取的密集行人检测算法。以YOLOv8n为基线网络,在vision Transformer中再次加入自注意力机制得到DS-ViT(dual search)深度感知全局关键特征,优化CBS使用3个3×3Conv,设计双支路加入空间注意力机制得到FE-Conv(feature enhance)增强空间和通道双重特征提取局部细粒度,DS-ViT与FE-Conv并行重构主干网络,增强多尺度特征提取能力;颈部网络输入端嵌入空间注意力机制,增强多层次特征融合;检测网络新增三个卷积层,删减20×20检测头,降低漏检和错检率。在自制数据集上做消融实验和对比实验,与基线网络对比,mAP、Precision、Recall、IoU和FPS分别提高5.4个百分点、4.9个百分点、6.4个百分点、6.2个百分点和6.2,Parameters值仅增加1×10^(5),表明该算法有良好表现。在公开数据集WiderPerson上做泛化实验,较基线网络平均检测精度提升1.6个百分点,表明该算法具备较好鲁棒性。 展开更多
关键词 密集行人检测 关键特征深度感知 细粒度特征双重提取 YOLOv8n 视觉变换器(ViT) CBS
在线阅读 下载PDF
基于改进YOLOv8的钢材表面缺陷检测
14
作者 李昊璇 孙宇翔 《测试技术学报》 2026年第1期1-9,共9页
为应对钢材表面缺陷检测中存在的缺陷种类多元、尺寸跨度大、检测精度较低及模型泛化能力不足等挑战,提出了一种钢材表面缺陷检测改进算法,命名为CGP-YOLO。首先,在Neck部分,使用内容感知特征重组(Content-Aware Reassembly of Features... 为应对钢材表面缺陷检测中存在的缺陷种类多元、尺寸跨度大、检测精度较低及模型泛化能力不足等挑战,提出了一种钢材表面缺陷检测改进算法,命名为CGP-YOLO。首先,在Neck部分,使用内容感知特征重组(Content-Aware Reassembly of Features,CARAFE)替换最近邻插值算子,解决了上采样特征图出现的块状效应、细节丢失严重等问题;其次,在Head前引入全局注意力机制(Global Attention Mechanism,GAM),通过调控通道和空间特征的交互增强了模型的表征能力;最后,引入可编程梯度信息(Programmable Gradient Information,PGI)模块,通过其多级辅助信息组件逐步整合不同尺度的特征,有效提高了模型对不同尺度缺陷敏感性。与初始算法相比,算法CGP-YOLO性能显著提升,平均精度均值达到了80.8%,高于原算法3.6百分点。 展开更多
关键词 YOLOv8 钢材表面缺陷检测 内容感知特征重组 全局注意力机制 可编程梯度信息
在线阅读 下载PDF
基于改进Otsu算法的轴承图像阈值分割方法
15
作者 赵宇航 吴超华 +2 位作者 王鑫 张晟琦 史晓亮 《机电工程》 北大核心 2026年第1期34-44,共11页
针对轴承装配后质检时,工业相机采集到的图像存在混合噪声和背光源过曝,导致图像分割精度不高、影响后续处理的问题,提出了一种基于多特征感知与贝叶斯优化的改进大津算法(Otsu)的轴承图像阈值分割方法。首先,在图像感知层面,读取、计... 针对轴承装配后质检时,工业相机采集到的图像存在混合噪声和背光源过曝,导致图像分割精度不高、影响后续处理的问题,提出了一种基于多特征感知与贝叶斯优化的改进大津算法(Otsu)的轴承图像阈值分割方法。首先,在图像感知层面,读取、计算了图像的灰度值、梯度幅值、局部二值模型并完成了储存,对三特征数据分别进行了归一化处理,加权融合了三特征数据,得到了轴承图像的加权融合图;然后,在计算效率层面,引入了贝叶斯优化与分块动态规划方法,替代了Otsu的穷举法;最后,结合感知方法与加速方法,得到了一种基于多特征感知与贝叶斯优化的改进Otsu算法(MFB-Otsu),并与其他阈值分割算法进行了性能对比实验。研究结果表明:与Otsu算法、自适应分割算法对比,MFB-Otsu算法在保留了轴承图像全局亮度分布的基础上,强化了外轮廓边缘,抑制了边缘噪声,输出图像边缘平滑,背景与前景彻底分离;在实测边缘提取中,边缘信息保留完整,噪点干扰率相较于Otsu降低了55%;在客观图像评价指标方面,该算法的分割准确率、交并比、F1值分别达到0.997、0.989、0.994,优于Otsu算法和自适应分割算法;在计算效率层面,相较于Otsu提高了12.9%。改进Otsu算法在轴承尺寸检测中表现出良好的通用性与稳定性,具有一定的工程应用价值。 展开更多
关键词 轴承装配 图像处理 图像分割 大津算法 多特征感知和贝叶斯优化 Otsu加速方法 三维度特征加权 分块动态优化
在线阅读 下载PDF
融合傅里叶卷积与差异感知的钢材表面微小缺陷检测算法
16
作者 张胜伟 曹洁 《广西师范大学学报(自然科学版)》 北大核心 2026年第2期90-102,共13页
针对当前钢材表面缺陷检测方法对微小缺陷检测效果不佳的问题,本文提出一种融合傅里叶卷积与差异感知的钢材表面微小缺陷检测算法。该算法使用CSP-FFCM替换主干网络中的BasicBlock,通过在空间域和频域中进行卷积操作,以减少计算开销并... 针对当前钢材表面缺陷检测方法对微小缺陷检测效果不佳的问题,本文提出一种融合傅里叶卷积与差异感知的钢材表面微小缺陷检测算法。该算法使用CSP-FFCM替换主干网络中的BasicBlock,通过在空间域和频域中进行卷积操作,以减少计算开销并提升网络的特征提取能力。然后,提出多尺度特征层优化策略,在保留细粒度特征信息的同时,优化计算资源分配,确保模型对微小缺陷细节信息的有效捕捉。最后,设计差异感知特征增强模块,通过强化微小缺陷的特征表示能力,进一步提升模型对微小缺陷的检测性能。实验结果表明,本文算法在NEU-DET和GC10-DET数据集上mAP分别达到83.7%和73.1%,在钢材表面微小缺陷的高精度检测任务中表现出显著的性能优势。 展开更多
关键词 微小缺陷检测 傅里叶卷积 多尺度特征层优化 差异感知
在线阅读 下载PDF
面向自动驾驶的多尺度目标三维检测算法
17
作者 刘嫚 陈晓楠 《现代电子技术》 北大核心 2026年第1期141-147,共7页
在自动驾驶场景中,使用单目相机进行三维目标检测是一项具有挑战性的任务,尤其是在复杂道路环境下,目标的尺度差异和遮挡现象容易导致误检或漏检。针对这一问题,文中提出一种基于特征融合与增强的单目三维目标检测算法。首先,构建Faster... 在自动驾驶场景中,使用单目相机进行三维目标检测是一项具有挑战性的任务,尤其是在复杂道路环境下,目标的尺度差异和遮挡现象容易导致误检或漏检。针对这一问题,文中提出一种基于特征融合与增强的单目三维目标检测算法。首先,构建FasterNet+作为骨干网络,通过优化嵌入层和块结构,增强细节信息的提取,提升网络的整体性能;其次,设计多维特征自适应融合模块,自适应地选择并融合高维与低维特征,解决高维特征丢失小目标信息和低维特征缺乏上下文信息的问题;最后,引入特征增强注意力模块,突出特定目标区域,进一步提升网络在目标定位和分类方面的精度。在nuScenes数据集上的实验结果表明,其mAP和NDS比基准方法分别提高0.038和0.035,可以有效检测出不同类型和尺度的目标,并展现出更强的鲁棒性,为自动驾驶场景中的多维目标检测提供了一种新思路。 展开更多
关键词 自动驾驶 单目相机 三维目标检测 多尺度感知 特征融合 注意力机制 机器视觉
在线阅读 下载PDF
基于时频多通道特征的无人机信号智能识别方法
18
作者 白建胜 李琼怡 +1 位作者 姚金杰 王黎明 《测试技术学报》 2026年第1期34-44,共11页
针对复杂空域环境中快速机动、型号各异的无人机目标的精准感知需求,提出一种基于时频多通道特征的无人机信号识别方法。信号处理部分,首先对无人机信号去趋势化去噪,消除直流分量和低频漂移从而降低频域干扰,然后,利用离散傅里叶变换... 针对复杂空域环境中快速机动、型号各异的无人机目标的精准感知需求,提出一种基于时频多通道特征的无人机信号识别方法。信号处理部分,首先对无人机信号去趋势化去噪,消除直流分量和低频漂移从而降低频域干扰,然后,利用离散傅里叶变换获取时频信息,通过频谱拼接与振幅归一化构建能量频谱数据,针对其空间异质性沿频域进行多特征通道处理。模型部分,基于多通道二维卷积模型(MC-2DCNN)提出多特征通道信号张量化处理策略,精准适配模型卷积特性。通过频域特征的空间解耦与结构化重组,在保障特征拓扑邻接关系的同时借助通道分离机制实现了跨频段特征的层次化提取和差异化建模,增强了模型鲁棒性和泛化性。实验表明,所提方法在-10~10 dB噪声范围内对机型与飞行模式的识别准确率分别达到98.15%和96.32%,在电磁噪声、梳状谱和阻塞式干扰的混叠噪声场景下识别准确率可达91.97%和90.56%,满足军民领域无人探测系统的智能识别需求。 展开更多
关键词 射频信号感知 多通道特征 无人机识别 时频变换
在线阅读 下载PDF
变压器充油设备火灾中的消防炮视觉感知对准策略与实现方法
19
作者 王海雷 李宗泽 +1 位作者 赵俊 朱红 《软件导刊》 2026年第2期193-201,共9页
变压器充油设备火灾在短时间内迅速蔓延,不仅会损坏设备,还会造成电力系统停运。其中,燃油会助长火势、增加扑救难度,对周边工农业生产、居民生活造成巨大影响、经济损失。如何在火灾早期及时响应、报警、控制火灾发展和蔓延,根据火势... 变压器充油设备火灾在短时间内迅速蔓延,不仅会损坏设备,还会造成电力系统停运。其中,燃油会助长火势、增加扑救难度,对周边工农业生产、居民生活造成巨大影响、经济损失。如何在火灾早期及时响应、报警、控制火灾发展和蔓延,根据火势制定合理的灭火扑救方案,成为亟待解决的问题。为此,在深入研究变压器充油设备火灾燃烧特点的基础上,提出针对变压器充油设备火灾的消防炮视觉感知对准策略与实现方法。首先,利用视觉感知技术获取火焰对象;其次,基于火焰对象形态与变压器火灾燃烧特性提出水炮对准策略,以动态确定灭火点;最后,基于灭火点空间坐标实现消防炮联动,控制消防炮指向灭火点位。实验证明,所提方法对火焰对象的感知F1值高达0.914,所选灭火点与燃烧特性密切相关且更合理,消防炮的近距灭火联动控制误差小于0.1 m。 展开更多
关键词 充油设备火灾 火焰感知 特征提取 灭火点 消防炮
在线阅读 下载PDF
以温度场均匀稳定性为核心的钛合金智能铸造设计研究
20
作者 李宝文 王雪锋 《铸造》 2026年第3期288-292,共5页
钛合金铸造因为高活性熔体与复杂型腔的强交互作用,导致其调控精度难以控制。为突破瓶颈,提出构建基于多物理场耦合模型的熔体流动与传热协同优化机制,以及开发融合高维特征提取与降维技术的铸造过程温度场实时闭环控制系统。期望本研... 钛合金铸造因为高活性熔体与复杂型腔的强交互作用,导致其调控精度难以控制。为突破瓶颈,提出构建基于多物理场耦合模型的熔体流动与传热协同优化机制,以及开发融合高维特征提取与降维技术的铸造过程温度场实时闭环控制系统。期望本研究构建的智能控制系统框架为钛合金铸件铸造提供理论支撑与技术保障。 展开更多
关键词 钛合金铸造 温度场均匀性 智能控制系统 高维特征降维 多模态感知
在线阅读 下载PDF
上一页 1 2 21 下一页 到第
使用帮助 返回顶部