期刊文献+
共找到102,826篇文章
< 1 2 250 >
每页显示 20 50 100
An Unsupervised Online Detection Method for Foreign Objects in Complex Environments
1
作者 YANG Xiaoyang YANG Yanzhu DENG Haiping 《Journal of Donghua University(English Edition)》 2026年第1期140-151,共12页
In modern industrial production,foreign object detection in complex environments is crucial to ensure product quality and production safety.Detection systems based on deep-learning image processing algorithms often fa... In modern industrial production,foreign object detection in complex environments is crucial to ensure product quality and production safety.Detection systems based on deep-learning image processing algorithms often face challenges with handling high-resolution images and achieving accurate detection against complex backgrounds.To address these issues,this study employs the PatchCore unsupervised anomaly detection algorithm combined with data augmentation techniques to enhance the system’s generalization capability across varying lighting conditions,viewing angles,and object scales.The proposed method is evaluated in a complex industrial detection scenario involving the bogie of an electric multiple unit(EMU).A dataset consisting of complex backgrounds,diverse lighting conditions,and multiple viewing angles is constructed to validate the performance of the detection system in real industrial environments.Experimental results show that the proposed model achieves an average area under the receiver operating characteristic curve(AUROC)of 0.92 and an average F1 score of 0.85.Combined with data augmentation,the proposed model exhibits improvements in AUROC by 0.06 and F1 score by 0.03,demonstrating enhanced accuracy and robustness for foreign object detection in complex industrial settings.In addition,the effects of key factors on detection performance are systematically analyzed,providing practical guidance for parameter selection in real industrial applications. 展开更多
关键词 foreign object detection unsupervised learning data augmentation complex environment BOGIE DATASET
在线阅读 下载PDF
TQU-GraspingObject:3D Common Objects Detection,Recognition,and Localization on Point Cloud for Hand Grasping in Sharing Environments
2
作者 Thi-Loan Nguyen Huy-Nam Chu +2 位作者 The-Thanh Hua Trung-Nghia Phung Van-Hung Le 《Computers, Materials & Continua》 2026年第5期1701-1722,共22页
To support the process of grasping objects on a tabletop for the blind or robotic arm,it is necessary to address fundamental computer vision tasks,such as detecting,recognizing,and locating objects in space,and determ... To support the process of grasping objects on a tabletop for the blind or robotic arm,it is necessary to address fundamental computer vision tasks,such as detecting,recognizing,and locating objects in space,and determining the position of the grasping information.These results can then be used to guide the visually impaired or to execute grasping tasks with a robotic arm.In this paper,we collected,annotated,and published the benchmark TQUGraspingObject dataset for testing,validation,and evaluation of deep learning(DL)models for detecting,recognizing,and localizing grasping objects in 2D and 3D space,especially 3D point cloud data.Our dataset is collected in a shared room,with common everyday objects placed on the tabletop in jumbled positions by Intel RealSense D435(IR-D435).This dataset includes more than 63k RGB-D pairs and related data such as normalized 3D object point cloud,3D object point cloud segmented,coordinate system normalizationmatrix,3D object point cloud normalized,and hand pose for grasping each object.At the same time,we also conducted experiments on fourDL networks with the best performance:SSD-MobileNetV3,ResNet50-Transformer,ResNet101-Transformer,and YOLOv12.The results present that YOLOv12 has the most suitable results in detecting and recognizing objects in images.All data,annotations,toolkit,source code,point cloud data,and results are publicly available on our project website:https://github.com/HuaTThanhIT2327Tqu/datasetv2. 展开更多
关键词 Grasping object of blind/Robot arm TQU-graspingobject benchmark dataset 3D point cloud data deep learning(DL) object detection/recognition intel realsense D435(IR-D435)
在线阅读 下载PDF
基于YOLO-Crab与改进的DeepSORT的水下河蟹检测与计数方法
3
作者 吴佳文 姬伟 +1 位作者 翟科龙 许波 《电子测量技术》 北大核心 2025年第17期132-141,共10页
为实现淡水池塘中无人水产养殖船的精准投饵,提出了YOLO-Crab+改进的DeepSORT的河蟹计数方法。首先,针对水下河蟹图像模糊、对比度低等问题,提出了一种基于CLAHE预处理下的YOLOv8的河蟹检测模型YOLO-Crab。YOLO-Crab在主干中增加坐标注... 为实现淡水池塘中无人水产养殖船的精准投饵,提出了YOLO-Crab+改进的DeepSORT的河蟹计数方法。首先,针对水下河蟹图像模糊、对比度低等问题,提出了一种基于CLAHE预处理下的YOLOv8的河蟹检测模型YOLO-Crab。YOLO-Crab在主干中增加坐标注意力机制,提高检测精度,同时,通过SimSPPF池化和GSConv+Slim Neck设计减轻模型量级。改进的DeepSORT算法用DIOU匹配替代IOU匹配来解决水草遮挡导致的河蟹ID跳变问题。实验表明,YOLO-Crab模型检测精度和F1分别达到了97.3%和94%,计数方法平均精度为81%。同时,将模型移植到Jeston AGX Orin上,检测精度达到95%,检测速度为60fps,提升了50%,计数精度为78%,能够为无人水产养殖船精准投饵提供可靠依据。 展开更多
关键词 无人水产养殖 YOLOv8 目标检测 河蟹计数 DeepSORT
原文传递
Transforming Education with Photogrammetry:Creating Realistic 3D Objects for Augmented Reality Applications
4
作者 Kaviyaraj Ravichandran Uma Mohan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2025年第1期185-208,共24页
Augmented reality(AR)is an emerging dynamic technology that effectively supports education across different levels.The increased use of mobile devices has an even greater impact.As the demand for AR applications in ed... Augmented reality(AR)is an emerging dynamic technology that effectively supports education across different levels.The increased use of mobile devices has an even greater impact.As the demand for AR applications in education continues to increase,educators actively seek innovative and immersive methods to engage students in learning.However,exploring these possibilities also entails identifying and overcoming existing barriers to optimal educational integration.Concurrently,this surge in demand has prompted the identification of specific barriers,one of which is three-dimensional(3D)modeling.Creating 3D objects for augmented reality education applications can be challenging and time-consuming for the educators.To address this,we have developed a pipeline that creates realistic 3D objects from the two-dimensional(2D)photograph.Applications for augmented and virtual reality can then utilize these created 3D objects.We evaluated the proposed pipeline based on the usability of the 3D object and performance metrics.Quantitatively,with 117 respondents,the co-creation team was surveyed with openended questions to evaluate the precision of the 3D object created by the proposed photogrammetry pipeline.We analyzed the survey data using descriptive-analytical methods and found that the proposed pipeline produces 3D models that are positively accurate when compared to real-world objects,with an average mean score above 8.This study adds new knowledge in creating 3D objects for augmented reality applications by using the photogrammetry technique;finally,it discusses potential problems and future research directions for 3D objects in the education sector. 展开更多
关键词 Augmented reality education immersive learning 3D object creation PHOTOGRAMMETRY and StructureFromMotion
在线阅读 下载PDF
Study on Color Difference of Color Reproduction of 3D Objects
5
作者 GU Chong DENG Yi-qiang 《印刷与数字媒体技术研究》 北大核心 2025年第4期33-38,69,共7页
To investigate the applicability of four commonly used color difference formulas(CIELAB,CIE94,CMC(1:1),and CIEDE2000)in the printing field on 3D objects,as well as the impact of four standard light sources(D65,D50,A,a... To investigate the applicability of four commonly used color difference formulas(CIELAB,CIE94,CMC(1:1),and CIEDE2000)in the printing field on 3D objects,as well as the impact of four standard light sources(D65,D50,A,and TL84)on 3D color difference evaluations,50 glossy spheres with a diameter of 2cm based on the Sailner J4003D color printing device were created.These spheres were centered around the five recommended colors(gray,red,yellow,green,and blue)by CIE.Color difference was calculated according to the four formulas,and 111 pairs of experimental samples meeting the CIELAB gray scale color difference requirements(1.0-14.0)were selected.Ten observers,aged between 22 and 27 with normal color vision,were participated in this study,using the gray scale method from psychophysical experiments to conduct color difference evaluations under the four light sources,with repeated experiments for each observer.The results indicated that the overall effect of the D65 light source on 3D objects color difference was minimal.In contrast,D50 and A light sources had a significant impact within the small color difference range,while the TL84 light source influenced both large and small color difference considerably.Among the four color difference formulas,CIEDE2000 demonstrated the best predictive performance for color difference in 3D objects,followed by CMC(1:1),CIE94,and CIELAB. 展开更多
关键词 Color difference formula 3D objects Light source Gray scale Normalized residual sum of squares
在线阅读 下载PDF
Transorbital craniocerebral injury caused by metallic foreign objects
6
作者 Chongqing Yang Hongguang Cui +2 位作者 Xiawei Wang Chenying Yu Yan Long 《World Journal of Emergency Medicine》 2025年第3期277-279,共3页
Transorbital craniocerebral injury is a relatively rare type of penetrating head injury that poses a significant threat to the ocular and cerebral structures.^([1])The clinical prognosis of transorbital craniocerebral... Transorbital craniocerebral injury is a relatively rare type of penetrating head injury that poses a significant threat to the ocular and cerebral structures.^([1])The clinical prognosis of transorbital craniocerebral injury is closely related to the size,shape,speed,nature,and trajectory of the foreign object,as well as the incidence of central nervous system damage and secondary complications.The foreign objects reported to have caused these injuries are categorized into wooden items,metallic items,^([2-8])and other materials,which penetrate the intracranial region via fi ve major pathways,including the orbital roof (OR),superior orbital fissure (SOF),inferior orbital fissure(IOF),optic canal (OC),and sphenoid wing.Herein,we present eight cases of transorbital craniocerebral injury caused by an unusual metallic foreign body. 展开更多
关键词 transorbital craniocerebral injury ocular cerebral structures foreign objectas central nervous system damage penetrating head injury foreign objects metallic foreign objects clinical prognosis
暂未订购
Exploration of the Application of Artificial Intelligence Technology in the Transformation of Old Objects
7
作者 Tonghuan Zhang Xinyu Yang +1 位作者 Ying Chen Qiufan Xie 《Journal of Electronic Research and Application》 2025年第2期51-57,共7页
With the rapid development of technology,artificial intelligence(AI)is increasingly being applied in various fields.In today’s context of resource scarcity,pursuit of sustainable development and resource reuse,the tr... With the rapid development of technology,artificial intelligence(AI)is increasingly being applied in various fields.In today’s context of resource scarcity,pursuit of sustainable development and resource reuse,the transformation of old objects is particularly important.This article analyzes the current status of old object transformation and the opportunities brought by the internet to old objects and delves into the application of artificial intelligence in old object transformation.The focus is on five aspects:intelligent identification and classification,intelligent evaluation and prediction,automation integration,intelligent design and optimization,and integration of 3D printing technology.Finally,the process of“redesigning an old furniture,such as a wooden desk,through AI technology”is described,including the recycling,identification,detection,design,transformation,and final user feedback of the old wooden desk.This illustrates the unlimited potential of the“AI+old object transformation”approach,advocates for people to strengthen green environmental protection,and drives sustainable development. 展开更多
关键词 Artificial Intelligence(AI) Old object transformation Environmental protection
在线阅读 下载PDF
Physics-Informed Graph Learning for Shape Prediction in Robot Manipulate of Deformable Linear Objects
8
作者 Meixuan Wang Junliang Wang +2 位作者 Jie Zhang Xinting Liao Guojin Li 《Chinese Journal of Mechanical Engineering》 2025年第6期154-165,共12页
Shape prediction of deformable linear objects(DLO)plays critical roles in robotics,medical devices,aerospace,and manufacturing,especially in manipulating objects such as cables,wires,and fibers.Due to the inherent fle... Shape prediction of deformable linear objects(DLO)plays critical roles in robotics,medical devices,aerospace,and manufacturing,especially in manipulating objects such as cables,wires,and fibers.Due to the inherent flexibility of DLO and their complex deformation behaviors,such as bending and torsion,it is challenging to predict their dynamic characteristics accurately.Although the traditional physical modeling method can simulate the complex deformation behavior of DLO,the calculation cost is high and it is difficult to meet the demand of real-time prediction.In addition,the scarcity of data resources also limits the prediction accuracy of existing models.To solve these problems,a method of fiber shape prediction based on a physical information graph neural network(PIGNN)is proposed in this paper.This method cleverly combines the powerful expressive power of graph neural networks with the strict constraints of physical laws.Specifically,we learn the initial deformation model of the fiber through graph neural networks(GNN)to provide a good initial estimate for the model,which helps alleviate the problem of data resource scarcity.During the training process,we incorporate the physical prior knowledge of the dynamic deformation of the fiber optics into the loss function as a constraint,which is then fed back to the network model.This ensures that the shape of the fiber optics gradually approaches the true target shape,effectively solving the complex nonlinear behavior prediction problem of deformable linear objects.Experimental results demonstrate that,compared to traditional methods,the proposed method significantly reduces execution time and prediction error when handling the complex deformations of deformable fibers.This showcases its potential application value and superiority in fiber manipulation. 展开更多
关键词 Deformable linear objects Fiber Physics-informed graph neural network(PIGNN) Shape prediction
在线阅读 下载PDF
Implementing Convolutional Neural Networks to Detect Dangerous Objects in Video Surveillance Systems
9
作者 Carlos Rojas Cristian Bravo +1 位作者 Carlos Enrique Montenegro-Marín Rubén González-Crespo 《Computers, Materials & Continua》 2025年第12期5489-5507,共19页
The increasing prevalence of violent incidents in public spaces has created an urgent need for intelligent surveillance systems capable of detecting dangerous objects in real time.While traditional video surveillance ... The increasing prevalence of violent incidents in public spaces has created an urgent need for intelligent surveillance systems capable of detecting dangerous objects in real time.While traditional video surveillance relies on human monitoring,this approach suffers from limitations such as fatigue and delayed response times.This study addresses these challenges by developing an automated detection system using advanced deep learning techniques to enhance public safety.Our approach leverages state-of-the-art convolutional neural networks(CNNs),specifically You Only Look Once version 4(YOLOv4)and EfficientDet,for real-time object detection.The system was trained on a comprehensive dataset of over 50,000 images,enhanced through data augmentation techniques to improve robustness across varying lighting conditions and viewing angles.Cloud-based deployment on Amazon Web Services(AWS)ensured scalability and efficient processing.Experimental evaluations demonstrated high performance,with YOLOv4 achieving 92%accuracy and processing images in 0.45 s,while EfficientDet reached 93%accuracy with a slightly longer processing time of 0.55 s per image.Field tests in high-traffic environments such as train stations and shopping malls confirmed the system’s reliability,with a false alarm rate of only 4.5%.The integration of automatic alerts enabled rapid security responses to potential threats.The proposed CNN-based system provides an effective solution for real-time detection of dangerous objects in video surveillance,significantly improving response times and public safety.While YOLOv4 proved more suitable for speed-critical applications,EfficientDet offered marginally better accuracy.Future work will focus on optimizing the system for low-light conditions and further reducing false positives.This research contributes to the advancement of AI-driven surveillance technologies,offering a scalable framework adaptable to various security scenarios. 展开更多
关键词 Automatic detection of objects convolutional neural networks deep learning real-time image processing video surveillance systems automatic alerts
在线阅读 下载PDF
Semantic segmentation of camouflage objects via fusing reconstructed multispectral and RGB images
10
作者 Feng Huang Gonghan Yang +5 位作者 Jing Chen Yixuan Xu Jingze Su Guimin Huang Shu Wang Wenxi Liu 《Defence Technology(防务技术)》 2025年第8期324-337,共14页
Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions.However,camouflage object segmentation is increasingly challenging du... Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions.However,camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry.Although multispectral-RGB based technology shows promise,conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities,limiting their performance.Here,we propose the Reconstructed Multispectral-RGB Fusion Network(RMRF-Net),which reconstructs RGB images into multispectral ones,enabling efficient multimodal segmentation using only an RGB camera.Specifically,RMRF-Net employs a divergentsimilarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours.Notably,we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset,including 11 object categories.Experimental results demonstrate that RMRF-Net outperforms existing methods,achieving 17.38 FPS on the NVIDIA Jetson AGX Orin,with only a 0.96%drop in mIoU compared to the RTX 3090,showing its practical applicability in multimodal remote sensing. 展开更多
关键词 Camouflage object detection Reconstructed multispectral image(MSI) Unmanned aerial vehicle(UAV) Semantic segmentation Remote sensing
在线阅读 下载PDF
An intelligent detection method for directional bolt hole objects of shield tunnel lining structures
11
作者 Yiding Ma Dechun Lu +3 位作者 Fanchao Kong Tao Tian Dongmei Zhang Xiuli Du 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第12期7555-7569,共15页
Most image-based object detection methods employ horizontal bounding boxes(HBBs)to capture objects in tunnel images.However,these bounding boxes often fail to effectively enclose objects oriented in arbitrary directio... Most image-based object detection methods employ horizontal bounding boxes(HBBs)to capture objects in tunnel images.However,these bounding boxes often fail to effectively enclose objects oriented in arbitrary directions,resulting in reduced accuracy and suboptimal detection performance.Moreover,HBBs cannot provide directional information for rotated objects.This study proposes a rotated detection method for identifying apparent defects in shield tunnels.Specifically,the oriented region-convolutional neural network(oriented R-CNN)is utilized to detect rotated objects in tunnel images.To enhance feature extraction,a novel hybrid backbone combining CNN-based networks with Swin Transformers is proposed.A feature fusion strategy is employed to integrate features extracted from both networks.Additionally,a neck network based on the bidirectional-feature pyramid network(Bi-FPN)is designed to combine multi-scale object features.The bolt hole dataset is curated to evaluate the efficacyof the proposed method.In addition,a dedicated pre-processing approach is developed for large-sized images to accommodate the rotated,dense,and small-scale characteristics of objects in tunnel images.Experimental results demonstrate that the proposed method achieves a more than 4%improvement in mAP_(50-95)compared to other rotated detectors and a 6.6%-12.7%improvement over mainstream horizontal detectors.Furthermore,the proposed method outperforms mainstream methods by 6.5%-14.7%in detecting leakage bolt holes,underscoring its significant engineering applicability. 展开更多
关键词 Apparent defects of shield tunnels Rotated object detection Swin transformer Oriented region-convolutional neural network(oriented R-CNN)
在线阅读 下载PDF
育种新时代水稻杂交育种技术与策略探讨 被引量:1
12
作者 吕文彦 程海涛 +1 位作者 马兆惠 田淑华 《中国农业科学》 北大核心 2026年第2期233-238,共6页
随着时间与技术的发展,作物育种经历了1.0到4.0世代,正向育种5.0世代发展。目前,虽然育种3.0世代和育种4.0世代得到广泛重视,但只有育种2.0世代的杂交育种才能够使亲本实现全基因组重组,出现基因内和基因间大量的、复杂的和不可预见的互... 随着时间与技术的发展,作物育种经历了1.0到4.0世代,正向育种5.0世代发展。目前,虽然育种3.0世代和育种4.0世代得到广泛重视,但只有育种2.0世代的杂交育种才能够使亲本实现全基因组重组,出现基因内和基因间大量的、复杂的和不可预见的互作,可能这才是导致突破性性状产生的基础,因此,在育种新时代背景下,杂交育种依然占有重要地位。但目前,以水稻为例,在科学性和有效性方面,广大育种工作者在杂交育种操作上仍然存在提高的空间。为选育高产、优质、多抗品种,克服品种的同质化,水稻杂交育种应注意以下几点:(1)育种目标要结合当地的自然条件,协调有利性状组配,使高产、优质、多抗的目标性状与具体品种相结合,避免品种同质化。(2)由于F_(1)综合双亲优良性状且具有一定的杂种优势,可能是同一组合表现最好的世代,F_(1)综合表现不良,其后代很难出现符合育种目标的期望类型。因此,此世代应作为一个重点选择世代,有利于提高育种效率。(3)在育种早代,因为主要是进行世代的促进,为提高育种效能,应采取直播形式,从而节省土地和资源。而育种中代应与早代测验相结合,以增强预见性,进一步筛选组合,提高育种效率。(4)高世代选择时,应在田间筛选后,进一步在室内比较组合间的穗部性状,选出最优组合,以实现优中选优。(5)育种5.0世代的智能型品种就是能够适应广域环境的生态与生物因子,并能满足生产需要的广适性品种,由于作物生长环境条件的复杂性,为实现广适性育种目标,应对品种进行多年、多点的广泛鉴定。总之,通过优化杂交育种的田间操作和选择技术,会大大提高育种效率,为选育出突破性品种奠定基础。 展开更多
关键词 水稻 杂交育种 育种目标 选择技术 世代促进 广适性
在线阅读 下载PDF
基于改进YOLO11n-seg的蟹塘水草清理路径规划
13
作者 胡庆松 杨尚青 +4 位作者 陈雷雷 李俊 马天利 张晓苓 李东波 《上海海洋大学学报》 北大核心 2026年第2期417-430,共14页
为了提升河蟹养殖池塘水草管理水平,本研究结合并优化图像处理与水草清理船工作路径规划方法,以形成高质量水草管控方案。实验采用无人机航拍采集蟹塘不同时期的图像,提出一种改进YOLO11nseg网络模型,融入动态上采样算子的轻量化高级筛... 为了提升河蟹养殖池塘水草管理水平,本研究结合并优化图像处理与水草清理船工作路径规划方法,以形成高质量水草管控方案。实验采用无人机航拍采集蟹塘不同时期的图像,提出一种改进YOLO11nseg网络模型,融入动态上采样算子的轻量化高级筛选路径聚合网络(High-level screening path aggregation network-dysample, HSPAN-D)模块对Neck层面进行改进,使用轻量化的特征提取模块C3k2_Faster_EMA替换原有的C3k2,并引入EfficientHead轻量化分割头。在模型识别与处理结果基础上构建蟹塘栅格地图,并设计水草目标清理区域筛选机制,通过路径优化策略改进A^(*)算法实现水草清理路径规划。结果显示,改进模型在参数量下降39.4%、计算量减少25.5%及模型体积缩减34.5%的条件下将水草识别精确率提高了1.6%,mAP提升了0.5%;改进A^(*)算法规划路径相对人工清理路径对水草面积占比控制更精准,相对原A^(*)算法总路径长度减少14.25 m,清理船转向减少10次,规划平均用时减少1.97 s。研究表明所提出的轻量化改进策略在显著降低模型计算负担的同时提升了识别精度,结合改进路径规划算法可有效实现蟹塘水草的高效精准清理。本研究可为水草清理船实际作业提供有效的路径规划参考。 展开更多
关键词 水草清理船 蟹塘 航拍图像 路径规划 YOLO11n-seg A^(*)算法
原文传递
基于多尺度特征增强的航拍小目标检测算法 被引量:1
14
作者 肖剑 何昕泽 +2 位作者 程鸿亮 杨小苑 胡欣 《浙江大学学报(工学版)》 北大核心 2026年第1期19-31,共13页
针对航拍图像小目标检测中存在的检测精度低和模型参数量大的问题,提出兼顾性能与资源消耗的航拍小目标检测算法.以YOLOv8s为基准网络,通过降低通道维数和加强对高频特征的关注,提出自适应细节增强模块(ADEM),在减少冗余信息的同时加强... 针对航拍图像小目标检测中存在的检测精度低和模型参数量大的问题,提出兼顾性能与资源消耗的航拍小目标检测算法.以YOLOv8s为基准网络,通过降低通道维数和加强对高频特征的关注,提出自适应细节增强模块(ADEM),在减少冗余信息的同时加强对小目标细粒度特征的捕获;基于PAN-FPN架构调整特征融合网络,增加对浅层特征的关注,同时引入多尺度卷积核增强对目标上下文信息的关注,以适应小目标检测场景;针对传统IoU灵活性、泛化性不强的问题,构建参数可调的Nin-IoU,通过引入可调参数,实现对IoU的针对性调整,以适应不同检测任务的需求;提出轻量化检测头,在增强多尺度特征信息交融的同时减少冗余信息的传递.结果表明,在VisDrone2019数据集上,所提算法以8.08×106的参数量实现了mAP0.5=50.3%的检测精度;相较于基准算法YOLOv8s,参数量降低了27.4%,精度提升了11.5个百分点.在DOTA与DIOR数据集上的实验结果表明,所提算法具有较强的泛化能力. 展开更多
关键词 目标检测 YOLOv8 无人机图像 特征融合 损失函数
在线阅读 下载PDF
YOLO-DyMiF:一种面向低算力平台的动态多尺度交通标志检测网络
15
作者 宋绍剑 李昊 +1 位作者 李刚 李国进 《液晶与显示》 北大核心 2026年第3期388-401,共14页
为了解决自动驾驶场景中交通标志目标体积小、易被环境干扰而导致检测精度低,以及车载平台算力和功耗有限、难以支撑复杂模型的问题,本文提出了一种改进的轻量化检测算法YOLO-DyMiF(Dynamic Mixer and Feature Fusion)。该模型在YOLOv10... 为了解决自动驾驶场景中交通标志目标体积小、易被环境干扰而导致检测精度低,以及车载平台算力和功耗有限、难以支撑复杂模型的问题,本文提出了一种改进的轻量化检测算法YOLO-DyMiF(Dynamic Mixer and Feature Fusion)。该模型在YOLOv10n的基础上进行了两方面改进:首先,设计一种基于动态高效卷积(Adaptive Efficient Conv,AEConv)的高效动态混合器(Efficient Dynamic Mixer Structure,EDMS),并将其嵌入C3k2模块以构建C3k2_EDMS模块,用于替换YOLOv10n模型中的C2f模块,在保持主干网络特征表达能力的前提下有效压缩参数规模;其次,设计了以分层多尺度空间增强模块(Hierarchical Multi-scale Spatial Enhancement,HMSE)为核心的动态特征融合颈部网络,它通过跨层交互和自适应加权融合增强多尺度特征表征能力,在兼顾中、大目标检测性能的同时提升小目标交通标志检测精度。在TT100K数据集上的实验结果表明,与当前领先的Mamba-YOLOt相比,YOLO-DyMiF算法的mAP50提高1%,模型参数量下降了58.3%,计算量下降了42.3%。所提出的模型能够在确保高检测精度的同时显著降低计算成本,可以为自动驾驶场景中的交通标志检测提供可靠的技术支持。 展开更多
关键词 目标检测 交通标志 自动驾驶 多尺度目标 边缘计算
在线阅读 下载PDF
基于稀疏点匹配的协同式未知目标跟踪方法
16
作者 郎荣玲 魏才伦 +1 位作者 范亚 高飞 《航空学报》 北大核心 2026年第3期83-95,共13页
对未知目标的实时感知与持续跟踪是智能系统自主决策的重要前提,在实际应用中存在缺乏目标类别先验信息和训练样本匮乏的问题,使得未知目标的感知与跟踪更具挑战性。针对此问题,提出了一种基于任意分割模型(SAM)与稀疏特征点匹配的未知... 对未知目标的实时感知与持续跟踪是智能系统自主决策的重要前提,在实际应用中存在缺乏目标类别先验信息和训练样本匮乏的问题,使得未知目标的感知与跟踪更具挑战性。针对此问题,提出了一种基于任意分割模型(SAM)与稀疏特征点匹配的未知目标跟踪方法。该方法首先通过提示点引导SAM模型感知并分割图像中的未知目标,随后利用基于卷积神经网络的特征点提取模型,获取目标图像的稀疏特征点作为目标信息,并通过基于注意力机制的匹配网络在后续帧中匹配这些特征点,完成目标信息传播。在此基础上,设计了一个基于特征点一致性的迭代式SAM模块(ISPC),利用匹配的特征点持续引导SAM模型对后续图像帧的目标进行分割,从而实现未知目标的稳定跟踪。此外基于稀疏特征点的轻量化目标信息,可以在多智能体之间高效共享,构建了一个协同式目标跟踪系统。在DAVIS 2017数据集和自构建的近红外视频数据集上,评估了系统的目标跟踪性能与零训练样本目标的泛化能力。实验结果表明,该方法在处理未知类别目标的协同感知与跟踪任务中,表现出良好的鲁棒性和准确性。 展开更多
关键词 目标跟踪 目标分割 特征提取 特征匹配 协同感知
原文传递
DUIE-YOLO:一种基于图像增强的水下鱿鱼目标检测算法 被引量:1
17
作者 曹莉凌 胡浩宇 曹守启 《上海海洋大学学报》 北大核心 2026年第1期254-269,共16页
为了解决水下图像因模糊和色偏导致的目标检测精度下降问题,提升复杂水下环境中鱿鱼检测的准确性和鲁棒性,本研究提出一种基于图像增强的水下鱿鱼检测算法DUIE-YOLO,采用“先增强后检测”的级联框架,由DUIE-Net增强模块和YOLOv8-HD检测... 为了解决水下图像因模糊和色偏导致的目标检测精度下降问题,提升复杂水下环境中鱿鱼检测的准确性和鲁棒性,本研究提出一种基于图像增强的水下鱿鱼检测算法DUIE-YOLO,采用“先增强后检测”的级联框架,由DUIE-Net增强模块和YOLOv8-HD检测模块组成。DUIE-Net模块通过颜色校正、多尺度特征融合、特征恢复与增强及去雾优化,显著提升图像质量;YOLOv8-HD检测模块结合FasterNet网络、小目标检测头、CoordAttention注意力机制及ShapeIoU损失函数,优化特征提取能力与小目标检测精度。实验结果表明,DUIE-YOLO相比原始YOLOv8n在Precision、Recall、F1-score和mAP等4个关键指标上分别提升4.2%、6.8%、5.7%和5.5%。联合实验结果显示,DUIE-Net与YOLOv8-HD的组合相比基线(Raw+YOLOv8n),mAP提升40.3%,Precision提升10.5%,Recall提升53%,F1-score提升31%,证明该算法具有显著的级联优化效果。研究表明,DUIE-YOLO通过图像增强与检测模块的协同优化,有效解决了水下图像质量差导致的检测性能下降问题。本研究为复杂水下环境中的目标识别提供了高精度的解决方案,对海洋生物监测与资源开发具有重要应用价值。 展开更多
关键词 水下鱿鱼检测 目标检测 图像增强 多尺度特征融合 YOLOv8
原文传递
基于YOLOv8s多阶段算法的幼猪吮乳行为识别研究
18
作者 陈创业 刘兹豪 +4 位作者 胡天让 谢晓丽 李洋 陈立涛 刘根新 《农机化研究》 北大核心 2026年第3期185-193,共9页
针对幼猪吮乳行为识别精度不足和个体目标跟踪困难的问题,采用以计算机视觉为基础的自动检测体系,整合YOLOv8s、DeepSORT、LSTM 3个算法模块,提出了一种多阶段的行为识别方法。首先,通过YOLOv8s对视频里的幼猪目标进行实时检测,再借助De... 针对幼猪吮乳行为识别精度不足和个体目标跟踪困难的问题,采用以计算机视觉为基础的自动检测体系,整合YOLOv8s、DeepSORT、LSTM 3个算法模块,提出了一种多阶段的行为识别方法。首先,通过YOLOv8s对视频里的幼猪目标进行实时检测,再借助DeepSORT算法来实行跨帧目标追踪并分配唯一标识;然后,把多张连续检测图片输入到LSTM模型里进行时序建模,从而判定出该段时间范围内的幼猪是否正在吮乳。于养殖场的母猪产房拍摄了26 320张照片、采集了4 930组行为序列数据集进行试验,结果表明,在mAP@0.5评价标准下,以YOLOv8s模型为基准的目标检测准确率为91.7%,召回率为92.3%,系统整体追踪准确值(MOTA)达到85.6%,且系统可在复杂的养殖环境下做到稳定运行。将该系统布置到云端平台上,可进行云端处理、数据可视化和远程监控等功能,即时展示每头幼猪的吮乳次数和时长,快速找出进食异常的幼猪个体,优化管理效率。 展开更多
关键词 幼猪行为识别 目标检测 多目标跟踪 时序模型 吮乳监测 智能养殖
在线阅读 下载PDF
RIC-YOLOv8n:矿下料车超挂轻量化实时检测算法
19
作者 丁玲 李露 +1 位作者 李永康 赵作鹏 《计算机工程与应用》 北大核心 2026年第2期371-383,共13页
针对矿井下作业环境复杂、光照不足、煤尘干扰等因素导致的传统目标检测算法在检测矿下料车超挂时表现不佳问题,提出了一种料车超挂轻量化实时检测算法RIC-YOLOv8n。使用轻量化模块C2f_RegNetY替换YOLOv8n中主干和颈部网络中的C2f模块,... 针对矿井下作业环境复杂、光照不足、煤尘干扰等因素导致的传统目标检测算法在检测矿下料车超挂时表现不佳问题,提出了一种料车超挂轻量化实时检测算法RIC-YOLOv8n。使用轻量化模块C2f_RegNetY替换YOLOv8n中主干和颈部网络中的C2f模块,减少了模型参数量并加快了模型推理速度;为了提高检测头的特征提取性能,采用联合信息对齐学习方法增强分类和回归任务的对齐能力;通过DeepSort进行矿下料车的目标追踪,设计了Residual_IBN模块替换DeepSort特征提取网络中的残差网络,提高了目标追踪的性能。通过自制的矿下料车检测与跟踪数据集进行算法验证,实验结果显示:RIC-YOLOv8n在矿下料车识别平均精度达到91.4%,基于RICYOLOv8n和改进的DeepSort目标追踪算法在多目标追踪准确率达到89.13%,检测速度达到61 FPS。提出的RICYOLOv8n和改进的DeepSort算法能较好的平衡检测速度与精度,适用于矿井下料车检测实时性作业的需要。 展开更多
关键词 目标检测 目标追踪 YOLOv8n 联合对齐解耦头 DeepSort 料车计数
在线阅读 下载PDF
青贮饲料收获机自动跟随抛送系统研究现状与发展趋势 被引量:1
20
作者 张姬 孙振洋 +3 位作者 宋占华 于镇伟 闫云鹏 田富洋 《农机化研究》 北大核心 2026年第2期284-292,共9页
青贮饲料因具有生产成本低、收获效益高、原料易得和营养均衡等优点,逐渐成为畜牧产业的主要饲料。传统青贮收获作业中人工依赖度高、抛料均匀性不足且抛送筒控制人员存在一定的安全隐患。青贮饲料收获机自动跟随抛送系统通过信息采集... 青贮饲料因具有生产成本低、收获效益高、原料易得和营养均衡等优点,逐渐成为畜牧产业的主要饲料。传统青贮收获作业中人工依赖度高、抛料均匀性不足且抛送筒控制人员存在一定的安全隐患。青贮饲料收获机自动跟随抛送系统通过信息采集设备实时获取料箱位置与环境动态信息,根据设定的青贮饲料填充模式进行抛送作业,解析填充状态,同时液压伺服控制系统根据识别定位情况动态调节抛送筒旋转角度与出料高度,实现青贮饲料落料点的控制。本文系统综述了当前国内外青贮饲料收获机自动跟随抛送系统的研究现状;分析了机器视觉、激光雷达与传感器在自动抛送系统中的工作原理与具体应用方法;针对我国青贮饲料收获机自动跟随抛送系统发展存在的问题,提出了研发多模态感知架构、开发高动态液压伺服系统与低惯量抛送筒材料、构建“青贮机-伴随车”群体协同作业模式的建议;同时,对青贮饲料收获机自动跟随抛送系统的发展方向进行预测,以期为我国青贮饲料收获机自动跟随抛送系统的研究提供参考。 展开更多
关键词 青贮饲料收获机 自动跟随抛送系统 机器视觉 激光雷达 目标检测
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部