期刊文献+
共找到261篇文章
< 1 2 14 >
每页显示 20 50 100
Scribble-Supervised Video Object Segmentation 被引量:3
1
作者 Peiliang Huang Junwei Han +2 位作者 Nian Liu Jun Ren Dingwen Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第2期339-353,共15页
Recently,video object segmentation has received great attention in the computer vision community.Most of the existing methods heavily rely on the pixel-wise human annotations,which are expensive and time-consuming to ... Recently,video object segmentation has received great attention in the computer vision community.Most of the existing methods heavily rely on the pixel-wise human annotations,which are expensive and time-consuming to obtain.To tackle this problem,we make an early attempt to achieve video object segmentation with scribble-level supervision,which can alleviate large amounts of human labor for collecting the manual annotation.However,using conventional network architectures and learning objective functions under this scenario cannot work well as the supervision information is highly sparse and incomplete.To address this issue,this paper introduces two novel elements to learn the video object segmentation model.The first one is the scribble attention module,which captures more accurate context information and learns an effective attention map to enhance the contrast between foreground and background.The other one is the scribble-supervised loss,which can optimize the unlabeled pixels and dynamically correct inaccurate segmented areas during the training stage.To evaluate the proposed method,we implement experiments on two video object segmentation benchmark datasets,You Tube-video object segmentation(VOS),and densely annotated video segmentation(DAVIS)-2017.We first generate the scribble annotations from the original per-pixel annotations.Then,we train our model and compare its test performance with the baseline models and other existing works.Extensive experiments demonstrate that the proposed method can work effectively and approach to the methods requiring the dense per-pixel annotations. 展开更多
关键词 Convolutional neural networks(CNNs) SCRIBBLE self-attention video object segmentation weakly supervised
在线阅读 下载PDF
Objective Performance Evaluation of Video Segmentation Algorithms with Ground-Truth 被引量:1
2
作者 杨高波 张兆扬 《Journal of Shanghai University(English Edition)》 CAS 2004年第1期70-74,共5页
While the development of particular video segmentation algorithms has attracted considerable research interest, relatively little effort has been devoted to provide a methodology for evaluating their performance. In t... While the development of particular video segmentation algorithms has attracted considerable research interest, relatively little effort has been devoted to provide a methodology for evaluating their performance. In this paper, we propose a methodology to objectively evaluate video segmentation algorithm with ground-truth, which is based on computing the deviation of segmentation results from the reference segmentation. Four different metrics based on classification pixels, edges, relative foreground area and relative position respectively are combined to address the spatial accuracy. Temporal coherency is evaluated by utilizing the difference of spatial accuracy between successive frames. The experimental results show the feasibility of our approach. Moreover, it is computationally more efficient than previous methods. It can be applied to provide an offline ranking among different segmentation algorithms and to optimally set the parameters for a given algorithm. 展开更多
关键词 video object segmentation performance evaluation MPEG-4.
在线阅读 下载PDF
Research on video motion object segmentation for content-based application
3
作者 包红强 ZHANG Zhao- yang +4 位作者 YU Song-yu WANG Suo-zhong WANG Nu-li FANG Yong WANG Zhi-gang 《Journal of Shanghai University(English Edition)》 CAS 2006年第2期142-143,共2页
With the development of the modern information society, more and more multimedia information is available. So the technology of multimedia processing is becoming the important task for the irrelevant area of scientist... With the development of the modern information society, more and more multimedia information is available. So the technology of multimedia processing is becoming the important task for the irrelevant area of scientist. Among of the multimedia, the visual informarion is more attractive due to its direct, vivid characteristic, but at the same rime the huge amount of video data causes many challenges if the video storage, processing and transmission. 展开更多
关键词 image processing video object segmentation spatiotemporal framework MPEG-4.
在线阅读 下载PDF
AUTOMATIC SEGMENTATION OF VIDEO OBJECT PLANES IN MPEG-4 BASED ON SPATIO-TEMPORAL INFORMATION
4
作者 XiaJinxiang HuangShunji 《Journal of Electronics(China)》 2004年第3期206-212,共7页
Segmentation of semantic Video Object Planes (VOP's) from video sequence is a key to the standard MPEG-4 with content-based video coding. In this paper, the approach of automatic Segmentation of VOP's Based on... Segmentation of semantic Video Object Planes (VOP's) from video sequence is a key to the standard MPEG-4 with content-based video coding. In this paper, the approach of automatic Segmentation of VOP's Based on Spatio-Temporal Information (SBSTI) is proposed.The proceeding results demonstrate the good performance of the algorithm. 展开更多
关键词 video sequence segmentation video object Plane (VOP) Based on spatiotemporal information MPEG-4
在线阅读 下载PDF
Evaluating quality of motion for unsupervised video object segmentation
5
作者 CHENG Guanjun SONG Huihui 《Optoelectronics Letters》 EI 2024年第6期379-384,共6页
Current mainstream unsupervised video object segmentation(UVOS) approaches typically incorporate optical flow as motion information to locate the primary objects in coherent video frames. However, they fuse appearance... Current mainstream unsupervised video object segmentation(UVOS) approaches typically incorporate optical flow as motion information to locate the primary objects in coherent video frames. However, they fuse appearance and motion information without evaluating the quality of the optical flow. When poor-quality optical flow is used for the interaction with the appearance information, it introduces significant noise and leads to a decline in overall performance. To alleviate this issue, we first employ a quality evaluation module(QEM) to evaluate the optical flow. Then, we select high-quality optical flow as motion cues to fuse with the appearance information, which can prevent poor-quality optical flow from diverting the network's attention. Moreover, we design an appearance-guided fusion module(AGFM) to better integrate appearance and motion information. Extensive experiments on several widely utilized datasets, including DAVIS-16, FBMS-59, and You Tube-Objects, demonstrate that the proposed method outperforms existing methods. 展开更多
关键词 Evaluating quality of motion for unsupervised video object segmentation
原文传递
Automatic Video Segmentation Algorithm by Background Model and Color Clustering
6
作者 沙芸 王军 刘玉树 《Journal of Beijing Institute of Technology》 EI CAS 2003年第S1期134-138,共5页
In order to detect the object in video efficiently, an automatic and real time video segmentation algorithm based on background model and color clustering is proposed. This algorithm consists of four phases: backgroun... In order to detect the object in video efficiently, an automatic and real time video segmentation algorithm based on background model and color clustering is proposed. This algorithm consists of four phases: background restoration, moving objects extract, moving objects region clustering and post processing. The threshold of the background restoration is not given in advanced. It can be gotten automatically. And a new object region cluster algorithm based on background model and color clustering to remove significance noise is proposed. An efficient method of eliminating shadow is also used. This approach was compared with other methods on pixel error ratio. The experiment result indicates the algorithm is correct and efficient. 展开更多
关键词 video segmentation background restoration object region cluster
在线阅读 下载PDF
Motion feature descriptor based moving objects segmentation
7
作者 Yuan Hui Chang Yilin +2 位作者 Ma Yanzhuo Bai Donglin Lu Zhaoyang 《High Technology Letters》 EI CAS 2012年第1期84-89,共6页
A novel moving objects segmentation method is proposed in this paper. A modified three dimensional recursive search (3DRS) algorithm is used in order to obtain motion information accurately. A motion feature descrip... A novel moving objects segmentation method is proposed in this paper. A modified three dimensional recursive search (3DRS) algorithm is used in order to obtain motion information accurately. A motion feature descriptor (MFD) is designed to describe motion feature of each block in a picture based on motion intensity, motion in occlusion areas, and motion correlation among neighbouring blocks. Then, a fuzzy C-means clustering algorithm (FCM) is implemented based on those MFDs so as to segment moving objects. Moreover, a new parameter named as gathering degree is used to distinguish foreground moving objects and background motion. Experimental results demonstrate the effectiveness of the proposed method. 展开更多
关键词 motion estimation (ME) motion feature descriptor (MFD) fuzzy C-means clustering .moving objects segmentation video analysis
在线阅读 下载PDF
MOTION-BASED REGION GROWING SEGMENTATION OF IMAGE SEQUENCES 被引量:1
8
作者 Lu Guanming Bi Houjie Jiang Ping(Department of Information Engineering, Nanjing University ofPosts & Telecommunications, Nanjing 210003) 《Journal of Electronics(China)》 2000年第1期53-58,共6页
This paper proposes a motion-based region growing segmentation scheme for the object-based video coding, which segments an image into homogeneous regions characterized by a coherent motion. It adopts a block matching ... This paper proposes a motion-based region growing segmentation scheme for the object-based video coding, which segments an image into homogeneous regions characterized by a coherent motion. It adopts a block matching algorithm to estimate motion vectors and uses morphological tools such as open-close by reconstruction and the region-growing version of the watershed algorithm for spatial segmentation to improve the temporal segmentation. In order to determine the reliable motion vectors, this paper also proposes a change detection algorithm and a multi-candidate pro- screening motion estimation method. Preliminary simulation results demonstrate that the proposed scheme is feasible. The main advantage of the scheme is its low computational load. 展开更多
关键词 CHANGE detection MOTION estimation Image segmentation object-based video CODING
在线阅读 下载PDF
Full-duplex strategy for video object segmentation 被引量:3
9
作者 Ge-Peng Ji Deng-Ping Fan +3 位作者 Keren Fu Zhe Wu Jianbing Shen Ling Shao 《Computational Visual Media》 SCIE EI CSCD 2023年第1期155-175,共21页
Previous video object segmentation approachesmainly focus on simplex solutions linking appearance and motion,limiting effective feature collaboration between these two cues.In this work,we study a novel and efficient ... Previous video object segmentation approachesmainly focus on simplex solutions linking appearance and motion,limiting effective feature collaboration between these two cues.In this work,we study a novel and efficient full-duplex strategy network(FSNet)to address this issue,by considering a better mutual restraint scheme linking motion and appearance allowing exploitation of cross-modal features from the fusion and decoding stage.Specifically,we introduce a relational cross-attention module(RCAM)to achieve bidirectional message propagation across embedding sub-spaces.To improve the model’s robustness and update inconsistent features from the spatiotemporal embeddings,we adopt a bidirectional purification module after the RCAM.Extensive experiments on five popular benchmarks show that our FSNet is robust to various challenging scenarios(e.g.,motion blur and occlusion),and compares well to leading methods both for video object segmentation and video salient object detection.The project is publicly available at https://github.com/GewelsJI/FSNet. 展开更多
关键词 video object segmentation(VOS) video salient object detection(V-SOD) visual attention
原文传递
Global video object segmentation with spatial constraint module
10
作者 Yadang Chen Duolin Wang +2 位作者 Zhiguo Chen Zhi-Xin Yang Enhua Wu 《Computational Visual Media》 SCIE EI CSCD 2023年第2期385-400,共16页
We present a lightweight and efficient semisupervised video object segmentation network based on the space-time memory framework.To some extent,our method solves the two difficulties encountered in traditional video o... We present a lightweight and efficient semisupervised video object segmentation network based on the space-time memory framework.To some extent,our method solves the two difficulties encountered in traditional video object segmentation:one is that the single frame calculation time is too long,and the other is that the current frame’s segmentation should use more information from past frames.The algorithm uses a global context(GC)module to achieve highperformance,real-time segmentation.The GC module can effectively integrate multi-frame image information without increased memory and can process each frame in real time.Moreover,the prediction mask of the previous frame is helpful for the segmentation of the current frame,so we input it into a spatial constraint module(SCM),which constrains the areas of segments in the current frame.The SCM effectively alleviates mismatching of similar targets yet consumes few additional resources.We added a refinement module to the decoder to improve boundary segmentation.Our model achieves state-of-the-art results on various datasets,scoring 80.1%on YouTube-VOS 2018 and a J&F score of 78.0%on DAVIS 2017,while taking 0.05 s per frame on the DAVIS 2016 validation dataset. 展开更多
关键词 video object segmentation semantic segmentation global context(GC)module spatial constraint
原文传递
基于动态嵌入特征的鲁棒半监督视频目标分割 被引量:2
11
作者 陈亚当 赵翊冰 吴恩华 《北京航空航天大学学报》 北大核心 2025年第7期2253-2261,共9页
针对半监督视频目标分割(VOS)方法存在推理时内存占用不断增加及仅依赖低级像素特征训练困难的问题,提出一种基于动态嵌入特征和辅助损失函数的半监督视频目标分割方法。使用动态嵌入特征建立恒定大小的记忆库;通过时空聚合方法,利用历... 针对半监督视频目标分割(VOS)方法存在推理时内存占用不断增加及仅依赖低级像素特征训练困难的问题,提出一种基于动态嵌入特征和辅助损失函数的半监督视频目标分割方法。使用动态嵌入特征建立恒定大小的记忆库;通过时空聚合方法,利用历史信息生成和更新动态嵌入特征;使用内存更新感应器来自适应控制记忆库的更新间隔,适应不同视频的运动模式;使用辅助损失函数,在高级语义特征层面上给网络提供辅助指导,并通过在多重特征层面多方面指导,提高模型精度和训练效率;针对视频前背景中相似目标误匹配的问题,设计一种时空约束模块,以利用视频的时间连续性特性更好地捕获前一帧掩码信息与当前帧之间的关联。实验结果表明:所提方法在DAVIS 2017验证集上达到84.5%J&F的精度,在YouTube-VOS 2019验证集达到82.4%J&F的精度。 展开更多
关键词 视频目标分割 时空记忆网络 时空约束 内存更新感应 动态嵌入特征
原文传递
全局特征增强及掩模矫正的半监督视频目标分割方法
12
作者 潘祖望 桂彦 +1 位作者 易宇航 张建明 《计算机辅助设计与图形学学报》 北大核心 2025年第10期1837-1848,共12页
针对视频目标分割中存在的相似目标辨别精确度低、分割误差累积等问题,提出一种全局特征增强及掩模矫正的半监督视频目标分割方法.首先采用全局上下文感知模块增强特征,利用2个全局存储单元建模特征的全局依赖关系,捕获视频帧内和帧间... 针对视频目标分割中存在的相似目标辨别精确度低、分割误差累积等问题,提出一种全局特征增强及掩模矫正的半监督视频目标分割方法.首先采用全局上下文感知模块增强特征,利用2个全局存储单元建模特征的全局依赖关系,捕获视频帧内和帧间的全局上下文信息,提高分割模型对相似干扰物的辨别能力;然后提出细节感知解码器,在解码初期阶段通过跳跃连接融合编码器特征,学习局部细节增强的解码特征;最后在解码后期阶段部署掩模矫正模块,估计粗糙分割掩模中不确定区域,并对模糊目标边界及分割错误区域进行矫正,获得精确的视频目标分割结果.在具有挑战的DAVIS和YouTube-VOS基准数据集上进行大量实验的结果表明,所提方法明显优于文中对比方法,在YouTube-VOS 2019验证集中的性能分数G相较于STCN和GSFM方法分别提高了1.6和0.3. 展开更多
关键词 半监督视频目标分割 时空记忆网络 全局特征增强 掩模矫正
在线阅读 下载PDF
运动提示引导自适应学习无监督视频目标分割
13
作者 韩志冬 胡升龙 +1 位作者 宋慧慧 张开华 《电子学报》 北大核心 2025年第7期2305-2323,共19页
现有无监督视频目标分割(Unsupervised Video Object Segmentation,UVOS)方法多采用像素级密集匹配策略,通过对齐融合多帧之间或单帧与光流之间的信息来提升模型性能.然而,在遮挡、相机抖动、运动模糊等挑战性场景中,光流估计误差易产... 现有无监督视频目标分割(Unsupervised Video Object Segmentation,UVOS)方法多采用像素级密集匹配策略,通过对齐融合多帧之间或单帧与光流之间的信息来提升模型性能.然而,在遮挡、相机抖动、运动模糊等挑战性场景中,光流估计误差易产生大量错误匹配,导致融合后的时空表征易过拟合运动噪声.为此,本文提出一种运动提示引导的自适应学习UVOS框架.通过设计一种无监督光流提示生成算法,将光流编码的密集运动信息转换为稀疏点和框提示,借助提示学习引导分割一切模型(Segment Anything Model,SAM)通过本文设计的两个轻量级适配器来自适应学习,从而获得更为鲁棒的时空表征,增强模型的抗噪能力.为获得有效的提示,设计了一种无监督运动提示生成算法.该算法基于光流特征计算一系列统计量,筛选出显著区域,再利用运动边缘信息去除伪显著区域的干扰,并设定自适应阈值进行过滤,生成提示显著运动目标所在区域的点和框坐标.为提升SAM在下游UVOS任务中的泛化性,提出一种自适应表征学习SAM模型.通过设计两个轻量级特征适配器,从SAM的通用知识库中自适应学习与下游UVOS任务相关的知识,以准确地粗定位目标.针对SAM基于纯Transformer架构在细节处理上的不足,基于卷积神经网络(Convolutional Neural Networks,CNN)架构设计了表观聚焦细化模块.由SAM得到的定位注意力图渐进式地引导细化过程,使模型的注意力从全局粗定位聚焦到局部细化,最终得到更加精确的分割掩码.本文方法在DAVIS16(DAVIS 2016)、FBMS(Financial and Business Management System)和YTOBJ(YouTube-OBJects)三个主流数据集上进行了充分验证.结果表明:本文方法在区域相似度指标上较当前先进方法分别提升了1.8%、1.6%和2.6%,充分表明了本文方法的有效性. 展开更多
关键词 无监督视频目标分割(UVOS) 光流噪声 分割一切模型(SAM) 提示学习 自适应表征学习 运动外观解耦 多模态
在线阅读 下载PDF
Deep Learning-based Moving Object Segmentation:Recent Progress and Research Prospects 被引量:2
14
作者 Rui Jiang Ruixiang Zhu +3 位作者 Hu Su Yinlin Li Yuan Xie Wei Zou 《Machine Intelligence Research》 EI CSCD 2023年第3期335-369,共35页
Moving object segmentation(MOS),aiming at segmenting moving objects from video frames,is an important and challenging task in computer vision and with various applications.With the development of deep learning(DL),MOS... Moving object segmentation(MOS),aiming at segmenting moving objects from video frames,is an important and challenging task in computer vision and with various applications.With the development of deep learning(DL),MOS has also entered the era of deep models toward spatiotemporal feature learning.This paper aims to provide the latest review of recent DL-based MOS methods proposed during the past three years.Specifically,we present a more up-to-date categorization based on model characteristics,then compare and discuss each category from feature learning(FL),and model training and evaluation perspectives.For FL,the methods reviewed are divided into three types:spatial FL,temporal FL,and spatiotemporal FL,then analyzed from input and model architectures aspects,three input types,and four typical preprocessing subnetworks are summarized.In terms of training,we discuss ideas for enhancing model transferability.In terms of evaluation,based on a previous categorization of scene dependent evaluation and scene independent evaluation,and combined with whether used videos are recorded with static or moving cameras,we further provide four subdivided evaluation setups and analyze that of reviewed methods.We also show performance comparisons of some reviewed MOS methods and analyze the advantages and disadvantages of reviewed MOS methods in terms of technology.Finally,based on the above comparisons and discussions,we present research prospects and future directions. 展开更多
关键词 Moving object segmentation(MOS) change detection background subtraction deep learning(DL) video understanding
原文传递
自约束多尺度记忆网络的超声心动视频分割算法研究
15
作者 岳宝坤 李智 +1 位作者 孙浩元 万岳炘 《贵州大学学报(自然科学版)》 2025年第3期104-114,共11页
在超声心动视频中,复杂的解剖结构和心跳周期内的形变伪影常导致分割区域混淆和错误,因此提出一种基于自约束多尺度记忆网络(self constrained multi-scale memory network,CSTM)的半监督超声心动视频分割算法。该算法采用目标检测网络S... 在超声心动视频中,复杂的解剖结构和心跳周期内的形变伪影常导致分割区域混淆和错误,因此提出一种基于自约束多尺度记忆网络(self constrained multi-scale memory network,CSTM)的半监督超声心动视频分割算法。该算法采用目标检测网络SAM-DETR定位超声心动视频中每帧的左心室区域,并利用该网络有效地提取左心室及其周围组织的特征,这些特征作为约束信息被输入到多尺度记忆网络中,指导有对象掩码的帧进行左心室分割并更新记忆信息,对于没有对象掩码的帧,通过查询记忆信息进行分割。将多尺度编码器与多层次细化解码器相结合构成多尺度记忆网络用于解决约束信息带来的边缘信息丢失问题,使CSTM能得到精确的分割效果。在公开数据集EchoNet-Dynamic上的实验结果显示,所提算法在Dice系数上达到90.5,Hausdorff距离为4.11,分割结果优于现有方法,验证了算法在超声心动图分割任务中的有效性和正确性。 展开更多
关键词 超声心动视频分割 半监督学习 目标检测
在线阅读 下载PDF
聚焦式学习分割一切提示的无监督视频目标分割
16
作者 沈勇辉 卜东旭 +1 位作者 张胜裕 宋慧慧 《计算机工程与科学》 北大核心 2025年第2期298-307,共10页
无监督视频目标分割旨在测试阶段自动定位和分割视频帧中的主要目标。目前,大多数模型、方法依赖于从RGB图提取的外观线索和从光流图提取的运动线索来进行目标分割。然而,目标遮挡、快速运动或静止等问题会导致光流获取的信息缺失,仅依... 无监督视频目标分割旨在测试阶段自动定位和分割视频帧中的主要目标。目前,大多数模型、方法依赖于从RGB图提取的外观线索和从光流图提取的运动线索来进行目标分割。然而,目标遮挡、快速运动或静止等问题会导致光流获取的信息缺失,仅依靠外观分支获取的有限信息难以实现良好的分割效果。为了解决这一问题,提出了一种聚焦式学习网络模型FPLNet,该模型引入额外的双分支结构以捕捉主要目标的位置信息和轮廓信息,从而弥补光流信息的缺失。首先,所提出的模型利用分割一切模型SAM的骨干网络提取外观和运动信息,从而提高模型的泛化性。然后,将额外引入的粗粒度和细粒度的2个分割分支共同作为聚焦式学习网络的提示部分。在解码部分,RGB外观信息、光流运动信息、粗粒度特征和细粒度特征逐步融合,以此模仿人类视觉系统,实现聚焦式学习目标特征的过程。在3个标准数据集上进行了大量的测试,实验结果表明,与现有的模型相比,所提出的模型拥有更优异的性能。 展开更多
关键词 无监督视频目标分割 聚焦式学习 分割一切模型
在线阅读 下载PDF
指代视频分割方法研究综述
17
作者 魏彩颖 贾磊 《计算机工程与应用》 北大核心 2025年第2期73-83,共11页
指代视频分割是计算机视觉和自然语言处理交叉领域的热点研究任务。目标是通过理解文本语义分割出给定视频的相关实体。与传统需预定义待分割物体类别的视觉分割任务不同,该任务不依赖于预定义的物体类别,而是通过理解给定的描述语句定... 指代视频分割是计算机视觉和自然语言处理交叉领域的热点研究任务。目标是通过理解文本语义分割出给定视频的相关实体。与传统需预定义待分割物体类别的视觉分割任务不同,该任务不依赖于预定义的物体类别,而是通过理解给定的描述语句定位目标并分割。由于文本描述的内容随机且无分割好的视频帧当作参考,使得该任务极具挑战。虽然是新兴的跨媒体理解任务,但在安防监控、车辆追踪以及行人重识别等领域具有极高的应用前景并已有较多性能显著的方法提出。由于缺乏指代视频分割方法的研究综述,因此现有的指代视频分割方法被系统梳理和分析。具体地,根据研究思路的不同粗略地将解决方法分为四类:基于动态卷积、基于注意力机制、基于多层次信息学习和基于端到端序列预测的指代视频分割;对各类及各类内具体方法的性能进行定量和定性的分析;总结现有工作的不足以及未来可进行改进的思路。 展开更多
关键词 跨模态检索 指代视频分割 跨模态理解
在线阅读 下载PDF
基于时空解耦和区域鲁棒性增强的半监督视频目标分割方法
18
作者 陈鹏宇 聂秀山 +1 位作者 李南君 李拓 《计算机应用》 北大核心 2025年第5期1379-1386,共8页
针对半监督视频目标分割(VOS)领域中基于记忆的方法存在由于目标交互造成的物体遮挡以及背景中类似对象或噪声的干扰等问题,提出一种基于时空解耦和区域鲁棒性增强的半监督VOS方法。首先,构建一个结构化Transformer架构去除所有像素共... 针对半监督视频目标分割(VOS)领域中基于记忆的方法存在由于目标交互造成的物体遮挡以及背景中类似对象或噪声的干扰等问题,提出一种基于时空解耦和区域鲁棒性增强的半监督VOS方法。首先,构建一个结构化Transformer架构去除所有像素共有的特征信息,突出每个像素之间的差异,深入挖掘视频帧中目标的关键特征;其次,解耦当前帧与长期记忆帧之间的相似性,区分为时空相关性和目标重要性2个关键维度,使得对像素级时空特征和目标特征的分析更精确,从而解决由目标交互造成的物体遮挡问题;最后,设计一个区域条形注意力(RSA)模块,利用长期记忆中的目标位置信息增强对前景区域的关注度并抑制背景噪声。实验结果表明,所提方法在DAVIS 2017验证集上比重新训练的AOT(Associating Objects with Transformers)模型的J&F指标高1.7个百分点,在YouTube-VOS2019验证集上比重新训练的AOT模型的总分高1.6个百分点。可见所提方法可有效解决半监督VOS存在的问题。 展开更多
关键词 视频目标分割 时空解耦 半监督学习 TRANSFORMER 条形注意力
在线阅读 下载PDF
基于离散余弦变换特征融合的无监督视频目标分割
19
作者 王玉琛 樊佳庆 宋慧慧 《计算机与数字工程》 2025年第2期395-402,共8页
无监督视频目标分割任务旨在对没有人工提供第一帧的目标分割真值掩膜的情况下,对视频中的前景对象进行定位和分割。现有的方法主要关注提高分割精度上,而忽略了内存和计算成本。通常,现有的方法只在空间域内根据重要性对特征进行增强,... 无监督视频目标分割任务旨在对没有人工提供第一帧的目标分割真值掩膜的情况下,对视频中的前景对象进行定位和分割。现有的方法主要关注提高分割精度上,而忽略了内存和计算成本。通常,现有的方法只在空间域内根据重要性对特征进行增强,忽略了特征在频域中的差异性。此外,现有方法也没有充分利用全局语义信息来引导视频目标的分割。为解决上述问题,论文提出一种基于离散余弦变换特征融合的轻量级无监督视频目标分割网络。首先,使用轻量的骨干网络同时提取外观与运动特征;接着,设计了离散余弦变换特征融合模块,用于对外观与运动特征的融合与增强;然后,利用大核卷积全局语义引导模块对大核卷积分解,在降低计算量的同时,保持提取全局语义信息的能力;最后,在全局语义信息的引导下逐级聚合频域增强后的多级特征,最终得到精确的分割结果。通过上述设计,论文方法最终只有14.7 M参数量。论文在DAVIS2016、FBMS和DAVSOD数据集上进行了大量的实验评测,实验结果充分表明,论文方法在J&F、MAE和Fm等多个指标上均取得了良好的性能;同时,保持了高效的推理速度。 展开更多
关键词 无监督视频目标分割 离散余弦变换 注意力机制 频域分析
在线阅读 下载PDF
Segmentation of Moving Objects Using Mathematical Morphology
20
作者 LU Guan-ming BI Hou-jie(Department of Information Engineering. Naming University of Posts and Telecommunications. Naming, 210003, P.R.China ) 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 1999年第2期63-66,共4页
This paper proposes a motion-based region growing segmentation scheme, which incorporatesluminance and motion information simultaneously and uses morphological tools such as open-close byreconstruction and the region-... This paper proposes a motion-based region growing segmentation scheme, which incorporatesluminance and motion information simultaneously and uses morphological tools such as open-close byreconstruction and the region-growing version of the watershed algorithm. The main advantage of this scheme is thatthe resultant objects ore characterized by a coherent motion and foe moving object boundaries are precisely located.Simulation results demonstrate the effiency of the Proposed scheme. 展开更多
关键词 mathematical morphology motion estimation image segmentation object-based video coding
原文传递
上一页 1 2 14 下一页 到第
使用帮助 返回顶部