期刊文献+
共找到3,269篇文章
< 1 2 164 >
每页显示 20 50 100
Multi-resolution image segmentation based on Gaussian mixture model 被引量:5
1
作者 Tang Yinggan Liu Dong Guan Xinping 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第4期870-874,共5页
Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassificatio... Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassification. A new segmentation method, called multi-resolution Ganssian mixture model method, is proposed. First, an image pyramid is constructed and son-father link relationship is built between each level of pyramid. Then the mixture model segmentation method is applied to the top level. The segmentation result on the top level is passed top-down to the bottom level according to the son-father link relationship between levels. The proposed method considers not only local but also global information of image, it overcomes the effect of noise and can obtain better segmentation result. Experimental result demonstrates its effectiveness. 展开更多
关键词 image segmentation multi-resolution Ganssian mixture model.
在线阅读 下载PDF
Multi-resolution texture segmentation using fractal dimension
2
作者 Hsu Taoi HU Kuo-Jui WANG Je-chuang 《通讯和计算机(中英文版)》 2009年第11期30-33,42,共5页
关键词 分形维数 纹理分割 多分辨率 应用 维数计算 框架基础 纹理边界 边缘检测
在线阅读 下载PDF
Multi-Atlas Based Methods in Brain MR Image Segmentation 被引量:1
3
作者 孙亮 张丽 张道强 《Chinese Medical Sciences Journal》 CAS CSCD 2019年第2期110-119,共10页
Brain region-of-interesting (ROI) segmentation is an important prerequisite step for many computeraid brain disease analyses.However,the human brain has the complicated anatomical structure.Meanwhile,the brain MR imag... Brain region-of-interesting (ROI) segmentation is an important prerequisite step for many computeraid brain disease analyses.However,the human brain has the complicated anatomical structure.Meanwhile,the brain MR images often suffer from the low intensity contrast around the boundary of ROIs,large inter-subject variance and large inner-subject variance.To address these issues,many multi-atlas based segmentation methods are proposed for brain ROI segmentation in the last decade.In this paper,multi-atlas based methods for brain MR image segmentation were reviewed regarding several registration toolboxes which are widely used in the multi-atlas methods,conventional methods for label fusion,datasets that have been used for evaluating the multiatlas methods,as well as the applications of multi-atlas based segmentation in clinical researches.We propose that incorporating the anatomical prior into the end-to-end deep learning architectures for brain ROI segmentation is an important direction in the future. 展开更多
关键词 multi-atlas BRAIN segmentation MAGNETIC RESONANCE
暂未订购
Classification and Extraction of Urban Land-Use Information from High-Resolution Image Based on Object Multi-features 被引量:7
4
作者 孔春芳 徐凯 吴冲龙 《Journal of China University of Geosciences》 SCIE CSCD 2006年第2期151-157,共7页
Urban land provides a suitable location for various economic activities which affect the development of surrounding areas. With rapid industrialization and urbanization, the contradictions in land-use become more noti... Urban land provides a suitable location for various economic activities which affect the development of surrounding areas. With rapid industrialization and urbanization, the contradictions in land-use become more noticeable. Urban administrators and decision-makers seek modern methods and technology to provide information support for urban growth. Recently, with the fast development of high-resolution sensor technology, more relevant data can be obtained, which is an advantage in studying the sustainable development of urban land-use. However, these data are only information sources and are a mixture of "information" and "noise". Processing, analysis and information extraction from remote sensing data is necessary to provide useful information. This paper extracts urban land-use information from a high-resolution image by using the multi-feature information of the image objects, and adopts an object-oriented image analysis approach and multi-scale image segmentation technology. A classification and extraction model is set up based on the multi-features of the image objects, in order to contribute to information for reasonable planning and effective management. This new image analysis approach offers a satisfactory solution for extracting information quickly and efficiently. 展开更多
关键词 urban land-use multi-features OBJECT-ORIENTED segmentation CLASSIFICATION extraction.
在线阅读 下载PDF
A Local Contrast Fusion Based 3D Otsu Algorithm for Multilevel Image Segmentation 被引量:13
5
作者 Ashish Kumar Bhandari Arunangshu Ghosh Immadisetty Vinod Kumar 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2020年第1期200-213,共14页
To overcome the shortcomings of 1 D and 2 D Otsu’s thresholding techniques, the 3 D Otsu method has been developed.Among all Otsu’s methods, 3 D Otsu technique provides the best threshold values for the multi-level ... To overcome the shortcomings of 1 D and 2 D Otsu’s thresholding techniques, the 3 D Otsu method has been developed.Among all Otsu’s methods, 3 D Otsu technique provides the best threshold values for the multi-level thresholding processes. In this paper, to improve the quality of segmented images, a simple and effective multilevel thresholding method is introduced. The proposed approach focuses on preserving edge detail by computing the 3 D Otsu along the fusion phenomena. The advantages of the presented scheme include higher quality outcomes, better preservation of tiny details and boundaries and reduced execution time with rising threshold levels. The fusion approach depends upon the differences between pixel intensity values within a small local space of an image;it aims to improve localized information after the thresholding process. The fusion of images based on local contrast can improve image segmentation performance by minimizing the loss of local contrast, loss of details and gray-level distributions. Results show that the proposed method yields more promising segmentation results when compared to conventional1 D Otsu, 2 D Otsu and 3 D Otsu methods, as evident from the objective and subjective evaluations. 展开更多
关键词 1D Otsu 2D Otsu 3D Otsu image fusion local contrast multi-level image segmentation
在线阅读 下载PDF
A Semi-Vectorial Hybrid Morphological Segmentation of Multicomponent Images Based on Multithreshold Analysis of Multidimensional Compact Histogram 被引量:1
6
作者 Adles Kouassi Sié Ouattara +2 位作者 Jean-Claude Okaingni Wognin J. Vangah Alain Clement 《Open Journal of Applied Sciences》 2017年第11期597-610,共14页
In this work, we propose an original approach of semi-vectorial hybrid morphological segmentation for multicomponent images or multidimensional data by analyzing compact multidimensional histograms based on different ... In this work, we propose an original approach of semi-vectorial hybrid morphological segmentation for multicomponent images or multidimensional data by analyzing compact multidimensional histograms based on different orders. Its principle consists first of segment marginally each component of the multicomponent image into different numbers of classes fixed at K. The segmentation of each component of the image uses a scalar segmentation strategy by histogram analysis;we mainly count the methods by searching for peaks or modes of the histogram and those based on a multi-thresholding of the histogram. It is the latter that we have used in this paper, it relies particularly on the multi-thresholding method of OTSU. Then, in the case where i) each component of the image admits exactly K classes, K vector thresholds are constructed by an optimal pairing of which each component of the vector thresholds are those resulting from the marginal segmentations. In addition, the multidimensional compact histogram of the multicomponent image is computed and the attribute tuples or ‘colors’ of the histogram are ordered relative to the threshold vectors to produce (K + 1) intervals in the partial order giving rise to a segmentation of the multidimensional histogram into K classes. The remaining colors of the histogram are assigned to the closest class relative to their center of gravity. ii) In the contrary case, a vectorial spatial matching between the classes of the scalar components of the image is produced to obtain an over-segmentation, then an interclass fusion is performed to obtain a maximum of K classes. Indeed, the relevance of our segmentation method has been highlighted in relation to other methods, such as K-means, using unsupervised and supervised quantitative segmentation evaluation criteria. So the robustness of our method relatively to noise has been tested. 展开更多
关键词 MORPHOLOGICAL segmentation Vectorial Orders Semi-Vectorial segmentation multiDIMENSIONAL COMPACT HISTOGRAM multi-Thresholds Fusion Inter-Class Classification
暂未订购
Multi-agent based evolutional algorithm in medicine image segmentation
7
作者 LIANG Jun XU Zheng-chuan MAO Dong-mei CHENG Xian-yi 《通讯和计算机(中英文版)》 2009年第6期31-35,共5页
关键词 医学图像 计算机技术 图像处理技术 自然纹理图像
在线阅读 下载PDF
A Multi-Agent Approach to Arabic Handwritten Text Segmentation
8
作者 Ashraf Elnagar Rahima Bentrcia 《Journal of Intelligent Learning Systems and Applications》 2012年第3期207-215,共9页
The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider... The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider the “Naskh” font style. The segmentation algorithm employs seven agents in order to detect regions where segmentation is illegal. Feature points (end points) are extracted from the remaining regions of the word-image. Initially, the middle of every two successive end points is considered as a candidate segmentation point based on a set of rules. The experimental results are very promising as we achieved a success rate of 86%. 展开更多
关键词 CHARACTER segmentation Handwritten Recognition Systems multi-AGENTS ARABIC HANDWRITING
在线阅读 下载PDF
High-resolution flood modeling of urban areas using MSN_Flood 被引量:3
9
作者 Michael Hartnett Stephen Nash 《Water Science and Engineering》 EI CAS CSCD 2017年第3期175-183,共9页
Although existing hydraulic models have been used to simulate and predict urban flooding, most of these models are inadequate due to the high spatial resolution required to simulate flows in urban floodplains. Nesting... Although existing hydraulic models have been used to simulate and predict urban flooding, most of these models are inadequate due to the high spatial resolution required to simulate flows in urban floodplains. Nesting high-resolution subdomains within coarser-resolution models is an efficient solution for enabling simultaneous calculation of flooding due to tides, surges, and high river flows. MSN_Flood has been developed to incorporate moving boundaries around nested domains, permitting alternate flooding and drying along the boundary and in the interior of the domain. Ghost cells adjacent to open boundary cells convert open boundaries, in effect, into internal boundaries. The moving boundary may be multi-segmented and non-continuous, with recirculating flow across the boundary. When combined with a bespoke adaptive interpolation scheme, this approach facilitates a dynamic internal boundary. Based on an alternating-direction semi-implicit finite difference scheme,MSN_Flood was used to hindcast a major flood event in Cork City resulting from the combined pressures of fluvial, tidal, and storm surge processes. The results show that the model is computationally efficient, as the 2-m high-resolution nest is used only in the urban flooded region.Elsewhere, lower-resolution nests are used. The results also show that the model is highly accurate when compared with measured data. The model is capable of incorporating nested sub-domains when the nested boundary is multi-segmented and highly complex with lateral gradients of elevation and velocities. This is a major benefit when modelling urban floodplains at very high resolution. 展开更多
关键词 multi-scale NESTING Surge-fluvial FLOODING Urban FLOODING multi-segmented BOUNDARY Moving BOUNDARY
在线阅读 下载PDF
基于Multi-WHFPN与SimAM注意力机制的版面分割 被引量:1
10
作者 杨陈慧 周小亮 +2 位作者 张恒 孙政 业宁 《电子测量技术》 北大核心 2024年第1期159-168,共10页
作为OCR的预处理工作,版面分割技术越来越受到学术界和工业界重视。针对版面分割中遇到的检测速度慢、目标区域边界不准确以及细小目标易遗漏等问题,提出了YOLOv7-MSY模型。此模型首先借鉴残差连接思想,提出了Multi-WHFPN网络结构。它... 作为OCR的预处理工作,版面分割技术越来越受到学术界和工业界重视。针对版面分割中遇到的检测速度慢、目标区域边界不准确以及细小目标易遗漏等问题,提出了YOLOv7-MSY模型。此模型首先借鉴残差连接思想,提出了Multi-WHFPN网络结构。它采用可训练的权重参数,突出特征融合过程中特征重要性,并添加了小目标检测头,从而提升对小目标的检测性能;其次,引入SimAM注意力机制,可以在不增加额外参数的基础上在3D维度评估特征权重,以增强重要特征,抑制无效特征;最后,使用YEIOU来代替原模型中的定位损失函数,提升了模型的收敛速度与回归精度。在江苏省档案馆提供的数据集上进行实验对比,YOLOv7-MSY对目标区域边界检测更加敏感,对细小目标的检测效果更好。YOLOv7-MSY的mAP@.5达到了0.871,相较于原YOLOv7模型提高了7.84%。该模型的版面分割的效果优于其他类型的版面分割算法,具有良好的泛化性能,并且版面分割速度处于较高水平。 展开更多
关键词 版面分割 YOLOv7-MSY multi-WHFPN SimAM注意力机制 YEIOU
原文传递
Multi-segment and Multi-ply Overlapping Process of Multi Coupled Activities Based on Valid Information Evolution 被引量:1
11
作者 WANG Zhiliang WANG Yunxia QIU Shenghai 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2013年第1期176-188,共13页
Complex product development will inevitably face the design planning of the multi-coupled activities, and overlapping these activities could potentially reduce product development time, but there is a risk of the addi... Complex product development will inevitably face the design planning of the multi-coupled activities, and overlapping these activities could potentially reduce product development time, but there is a risk of the additional cost. Although the downstream task information dependence to the upstream task is already considered in the current researches, but the design process overall iteration caused by the information interdependence between activities is hardly discussed; especially the impact on the design process' overall iteration from the valid information accumulation process. Secondly, most studies only focus on the single overlapping process of two activities, rarely take multi-segment and multi-ply overlapping process of multi coupled activities into account; especially the inherent link between product development time and cost which originates from the overlapping process of multi coupled activities. For the purpose of solving the above problems, as to the insufficiency of the accumulated valid information in overlapping process, the function of the valid information evolution (VIE) degree is constructed. Stochastic process theory is used to describe the design information exchange and the valid information accumulation in the overlapping segment, and then the planning models of the single overlapping segment are built. On these bases, by analyzing overlapping processes and overlapping features of multi-coupling activities, multi-segment and multi-ply overlapping planning models are built; by sorting overlapping processes and analyzing the construction of these planning models, two conclusions are obtained: (1) As to multi-segment and multi-ply overlapping of multi coupled activities, the total decrement of the task set development time is the sum of the time decrement caused by basic overlapping segments, and minus the sum of the time increment caused by multiple overlapping segments; (2) the total increment of development cost is the sum of the cost increment caused by all overlapping process. And then, based on overlapping degree analysis of these planning models, by the V1E degree function, the four lemmas theory proofs are represented, and two propositions are finally proved: (1) The multi-ply overlapping of the multi coupled activities will weaken the basic overlapping effect on the development cycle time reduction (2) Overlapping the multi coupled activities will decrease product development cycle, but increase product development cost. And there is trade-off between development time and cost. And so, two methods are given to slacken and eliminate multi-ply overlapping effects. At last, an example about a vehicle upper subsystem design illustrates the application of the proposed models; compared with a sequential execution pattern, the decreasing of development cycle (22%) and the increasing of development cost (3%) show the validity of the method in the example The proposed research not only lays a theoretical foundation for correctly planning complex product development process, but also provides specific and effective operation methods for overlapping multi coupled activities. 展开更多
关键词 multi coupled activities valid information evolution multi-segment multi-ply overlapping development time and cost trade-ofl iteration
在线阅读 下载PDF
Creation of Multiple Subwavelength Focal Spot Segments Using Phase Modulated Radially Polarized Multi Gaussian Beam
12
作者 K.Prabakaran K.B.Rajesh +4 位作者 S.Sumathira M.D.Bharathi R.Hemamalini A.M.Musthafa V.Aroulmoji 《Chinese Physics Letters》 SCIE CAS CSCD 2016年第9期48-51,共4页
Based on the vector diffraction theory, the effect of complex phase filters on intensity distribution of a radially polarized multi Gaussian beam in the focal region of high NA lens is theoretically investigated. It i... Based on the vector diffraction theory, the effect of complex phase filters on intensity distribution of a radially polarized multi Gaussian beam in the focal region of high NA lens is theoretically investigated. It is observed that a properly designed multi belt complex phase filter can generate subwavelength novel focal patterns including splitting of focal spots and generation of multiple focal spot segments such as eight, six and four focal spots along the optical axis are obtained. We expect that such an investigation is useful for optical manipulation and material processing, multiple high refractive index particle trapping technologies. 展开更多
关键词 of for Creation of multiple Subwavelength Focal Spot segments Using Phase Modulated Radially Polarized multi Gaussian Beam on is in
原文传递
DDoS Defense Algorithm Based on Multi-Segment Timeout Technology 被引量:1
13
作者 DU Ruizhong YANG Xiaohui MA Xiaoxue HE Xinfeng 《Wuhan University Journal of Natural Sciences》 CAS 2006年第6期1823-1826,共4页
Through the analysis to the DDoS(distributed denial of service) attack, it will conclude that at different time segments, the arrive rate of normal SYN (Synchronization) package are similar, while the abnormal pac... Through the analysis to the DDoS(distributed denial of service) attack, it will conclude that at different time segments, the arrive rate of normal SYN (Synchronization) package are similar, while the abnormal packages are different with the normal ones. Toward this situation a DDoS defense algorithm based on multi-segment timeout technology is presented, more than one timeout segment are set to control the net flow. Experiment results show that in the case of little flow, multi-segment timeout has the ability dynamic defense, so the system performance is improved and the system has high response rate. 展开更多
关键词 DDoS(distributed denial of service) multi-segments timeout dynamic defense net flow analysis
在线阅读 下载PDF
An intelligent target-segmentation algorithm for aerial images
14
作者 陈东 王炎 崔有志 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 1999年第3期16-17,共2页
Knowledge-based multi-feaure-fusion and multi-resolution analysis is for adapive image segment -ation and experimental results show it can be used to extraot targets from complicated backgcround with the help of Prior... Knowledge-based multi-feaure-fusion and multi-resolution analysis is for adapive image segment -ation and experimental results show it can be used to extraot targets from complicated backgcround with the help of Priorknowledge and artificial intelligence. 展开更多
关键词 TARGET segmentation prior KNOWLEDGE multi-feature-fusion and multi-resolution-analysis
在线阅读 下载PDF
High-power and high optical conversion efficiency diode-end-pumped laser with multi-segmented Nd:YAG/Nd:YVO-4
15
作者 Meng-Yao Wu Peng-Fei Qu +3 位作者 Shi-Yu Wang SEl Zhen Guo De-Fang Cai Bing-Bin Li 《Chinese Physics B》 SCIE EI CAS CSCD 2018年第9期306-310,共5页
A novel flat-flat resonator consisting of two crystals(Nd:YAG + Nd:YVO4) is established for power scaling in a diode-end-pumped solid-state laser. We systematically compare laser characteristics between multi-seg... A novel flat-flat resonator consisting of two crystals(Nd:YAG + Nd:YVO4) is established for power scaling in a diode-end-pumped solid-state laser. We systematically compare laser characteristics between multi-segmented(Nd:YAG + Nd:YVO4) and conventional composite(Nd:YAG + Nd:YAG) crystals to demonstrate the feasibility of spectral line matching for output power scale-up in end-pumped lasers. A maximum continuous-wave output power of 79.2 W is reported at 1064 nm, with Mx2= 4.82, My2= 5.48, and a pumping power of 136 W in the multi-segmented crystals(Nd:YAG + Nd:YVO4). Compared to conventional composite crystals(Nd:YAG + Nd:YAG), the optical-optical conversion efficiency of multi-segmented crystals(Nd:YAG + Nd:YVO4) from 808 nm to 1064 nm is enhanced from 30% to 58.8%,while the laser output sensitivity as affected by the diode-laser temperature is reduced from 55% to 9%. 展开更多
关键词 diode-pumped solid-state laser multi-segmented crystals(Nd:YAG Nd:YVO4) spectral line matching diode-laser temperature
原文传递
Medical Image Segmentation Based on Wavelet Analysis and Gradient Vector Flow
16
作者 Ji Zhao Lina Zhang Minmin Yin 《Journal of Software Engineering and Applications》 2014年第12期1019-1030,共12页
Medical image segmentation is one of the key technologies in computer aided diagnosis. Due to the complexity and diversity of medical images, the wavelet multi-scale analysis is introduced into GVF (gradient vector fl... Medical image segmentation is one of the key technologies in computer aided diagnosis. Due to the complexity and diversity of medical images, the wavelet multi-scale analysis is introduced into GVF (gradient vector flow) snake model. The modulus values of each scale and phase angle values are calculated using wavelet transform, and the local maximum points of modulus values, which are the contours of the object edges, are obtained along phase angle direction at each scale. Then, location of the edges of the object and segmentation is implemented by GVF snake model. The experiments on some medical images show that the improved algorithm has small amount of computation, fast convergence and good robustness to noise. 展开更多
关键词 Pattern Recognition IMAGE segmentation GVF SNAKE Model WAVELET multi-SCALE Analysis MEDICAL IMAGE
在线阅读 下载PDF
Distributed C-Means Algorithm for Big Data Image Segmentation on a Massively Parallel and Distributed Virtual Machine Based on Cooperative Mobile Agents
17
作者 Fatéma Zahra Benchara Mohamed Youssfi +2 位作者 Omar Bouattane Hassan Ouajji Mohammed Ouadi Bensalah 《Journal of Software Engineering and Applications》 2015年第3期103-113,共11页
The aim of this paper is to present a distributed algorithm for big data classification, and its application for Magnetic Resonance Images (MRI) segmentation. We choose the well-known classification method which is th... The aim of this paper is to present a distributed algorithm for big data classification, and its application for Magnetic Resonance Images (MRI) segmentation. We choose the well-known classification method which is the c-means method. The proposed method is introduced in order to perform a cognitive program which is assigned to be implemented on a parallel and distributed machine based on mobile agents. The main idea of the proposed algorithm is to execute the c-means classification procedure by the Mobile Classification Agents (Team Workers) on different nodes on their data at the same time and provide the results to their Mobile Host Agent (Team Leader) which computes the global results and orchestrates the classification until the convergence condition is achieved and the output segmented images will be provided from the Mobile Classification Agents. The data in our case are the big data MRI image of size (m × n) which is splitted into (m × n) elementary images one per mobile classification agent to perform the classification procedure. The experimental results show that the use of the distributed architecture improves significantly the big data segmentation efficiency. 展开更多
关键词 multi-Agent System DISTRIBUTED ALGORITHM BIG Data IMAGE segmentation MRI IMAGE C-MEANS ALGORITHM Mobile Agent
在线阅读 下载PDF
基于深度学习的轻量级实时图像分割方法研究 被引量:2
18
作者 李建锋 熊明强 +3 位作者 陈园琼 王宗达 向涛 孙培玮 《通信学报》 北大核心 2025年第2期176-190,共15页
针对深度学习在各领域应用中因模型复杂度提升而引发的计算与存储负担,尤其在图像分割任务中面临的算法复杂性、实时响应不足及高内存占用问题,提出了一种轻量级且高效的分割网络架构——多尺度叠加融合网络(MSFNet)。MSFNet设计了一个... 针对深度学习在各领域应用中因模型复杂度提升而引发的计算与存储负担,尤其在图像分割任务中面临的算法复杂性、实时响应不足及高内存占用问题,提出了一种轻量级且高效的分割网络架构——多尺度叠加融合网络(MSFNet)。MSFNet设计了一个双分支多尺度边界融合模块,该模块通过融合不同尺度的特征信息与边界细节,有效提升了图像分割精度,同时显著减少了模型参数量。实验结果表明,MSFNet在3个公开数据集上表现优异,其模型参数量仅为0.6×10^(6),在RTX 3070 GPU上处理大小为800像素×800像素的图像仅需12 ms,显著提升了分割任务的执行效率和资源利用率。因此,该模型特别适合应用于资源有限的边缘设备或移动设备中,为实时图像分割应用提供了有力的技术支撑。 展开更多
关键词 图像分割 轻量级实时网络 双分支多尺度边界融合模块
在线阅读 下载PDF
基于改进YOLOv8的交通场景实例分割算法 被引量:4
19
作者 赵南南 高翡晨 《计算机工程》 北大核心 2025年第1期198-207,共10页
提出一种基于改进型YOLOv8的实例分割算法(DE-YOLO)。为减少图像中复杂背景的干扰,引入高效多尺度注意力机制,跨维交互使各特征组内空间语义特征平均分布。在主干网络部分,使用可变形卷积DCNv2结合C2f卷积层,突破原始卷积限制,提升可变... 提出一种基于改进型YOLOv8的实例分割算法(DE-YOLO)。为减少图像中复杂背景的干扰,引入高效多尺度注意力机制,跨维交互使各特征组内空间语义特征平均分布。在主干网络部分,使用可变形卷积DCNv2结合C2f卷积层,突破原始卷积限制,提升可变性。为减小有害梯度并提升检测器精度,采用动态非单调聚焦机制Wise-交并比(WIoU)替代联合完全交并(CIoU)损失函数进行质量评估,优化检测框定位,提升分割精度。同时,通过开启Mixup数据增强处理,充实数据集,丰富训练特征,提升模型学习能力。实验结果表明,DE-YOLO在城市景观数据集Cityscapes中的掩模平均精度均值(mAPmask)较基准模型YOLOv8n-seg提高了2.0百分点,IoU阈值为0.5时的平均精度提升了3.2百分点,所提算法在提升精度的同时,保持了优良的检测速度和较少的参数量,模型参数量较同类模型低2.2~31.3百分点。 展开更多
关键词 YOLOv8网络 实例分割 高效多尺度注意力 可变形卷积 损失函数
在线阅读 下载PDF
BEV感知学习在自动驾驶中的应用综述 被引量:3
20
作者 黄德启 黄海峰 +1 位作者 黄德意 刘振航 《计算机工程与应用》 北大核心 2025年第6期1-21,共21页
自动驾驶感知模块中作为采集输入的传感器种类不断发展,要使多模态数据统一地表征出来变得愈加困难。BEV感知学习在自动驾驶感知任务模块中可以使多模态数据统一融合到一个特征空间,相比于其他感知学习模型拥有更好的发展潜力。从研究... 自动驾驶感知模块中作为采集输入的传感器种类不断发展,要使多模态数据统一地表征出来变得愈加困难。BEV感知学习在自动驾驶感知任务模块中可以使多模态数据统一融合到一个特征空间,相比于其他感知学习模型拥有更好的发展潜力。从研究意义、空间部署、准备工作、算法发展及评价指标五个方面总结了BEV感知模型具有良好发展潜力的原因。BEV感知模型从框架角度概括为四个系列:Lift-Splat-Lss系列、IPM逆透视转换、MLP视图转换及Transformer视图转换;从输入数据概括为两类:第一类是纯图像特征的输入包括单目摄像头输入和多摄像头输入,第二类在融合数据输入中不仅是简单的点云数据和图像特征的数据融合,还包括了以点云数据为引导或监督的知识蒸馏融合和以引导切片方式去划分高度段的融合。概述了多目标追踪、地图分割、车道线检测及3D目标检测四种自动驾驶任务在BEV感知模型当中的应用,并总结了目前BEV感知学习四个系列框架的缺点。 展开更多
关键词 BEV感知学习 视图转换 多模态数据融合 多目标追踪 地图分割 车道线检测及3D目标检测
在线阅读 下载PDF
上一页 1 2 164 下一页 到第
使用帮助 返回顶部