期刊文献+
共找到561篇文章
< 1 2 29 >
每页显示 20 50 100
Visualization Analysis of Multi-Domain Access Control Policy Integration Based on Tree-Maps and Semantic Substrates 被引量:2
1
作者 Li Pan Qian Xu 《Intelligent Information Management》 2012年第5期188-193,共6页
The complexity of multi-domain access control policy integration makes it difficult to understand and manage the policy conflict information. The policy information visualization technology can express the logical rel... The complexity of multi-domain access control policy integration makes it difficult to understand and manage the policy conflict information. The policy information visualization technology can express the logical relation of the complex information intuitively which can effectively improve the management ability of the multi-domain policy integration. Based on the role-based access control model, this paper proposed two policy analyzing methods on the separated domain statistical information of multi-domain policy integration conflicts and the policy element levels of inter-domain and element mapping of cross-domain respectively. In addition, the corresponding visualization tool is developed. We use the tree-maps algorithm to statistically analyze quantity and type of the policy integration conflicts. On that basis, the semantic substrates algorithm is applied to concretely analyze the policy element levels of inter-domain and role and permission mapping of cross-domain. Experimental result shows tree-maps and semantic substrates can effectively analyze the conflicts of multi-domain policy integration and have a good application value. 展开更多
关键词 Cross-Domain Information Exchange visualIZATION ANALYSIS Tree-Maps semantIC SUBSTRATES
在线阅读 下载PDF
MG-SLAM: RGB-D SLAM Based on Semantic Segmentation for Dynamic Environment in the Internet of Vehicles 被引量:1
2
作者 Fengju Zhang Kai Zhu 《Computers, Materials & Continua》 2025年第2期2353-2372,共20页
The Internet of Vehicles (IoV) has become an important direction in the field of intelligent transportation, in which vehicle positioning is a crucial part. SLAM (Simultaneous Localization and Mapping) technology play... The Internet of Vehicles (IoV) has become an important direction in the field of intelligent transportation, in which vehicle positioning is a crucial part. SLAM (Simultaneous Localization and Mapping) technology plays a crucial role in vehicle localization and navigation. Traditional Simultaneous Localization and Mapping (SLAM) systems are designed for use in static environments, and they can result in poor performance in terms of accuracy and robustness when used in dynamic environments where objects are in constant movement. To address this issue, a new real-time visual SLAM system called MG-SLAM has been developed. Based on ORB-SLAM2, MG-SLAM incorporates a dynamic target detection process that enables the detection of both known and unknown moving objects. In this process, a separate semantic segmentation thread is required to segment dynamic target instances, and the Mask R-CNN algorithm is applied on the Graphics Processing Unit (GPU) to accelerate segmentation. To reduce computational cost, only key frames are segmented to identify known dynamic objects. Additionally, a multi-view geometry method is adopted to detect unknown moving objects. The results demonstrate that MG-SLAM achieves higher precision, with an improvement from 0.2730 m to 0.0135 m in precision. Moreover, the processing time required by MG-SLAM is significantly reduced compared to other dynamic scene SLAM algorithms, which illustrates its efficacy in locating objects in dynamic scenes. 展开更多
关键词 visual SLAM dynamic scene semantic segmentation GPU acceleration key segmentation frame
在线阅读 下载PDF
基于语义分割模型的城市街景空间评价与资本化效应研究——以杭州市主城区为例
3
作者 张凌 陈庚 张钊 《浙江大学学报(理学版)》 北大核心 2026年第1期13-26,共14页
对居住环境品质的追求体现了新时代居民对美好生活的向往,住宅周边的街景逐渐成为购房者关注的重点。当前,针对住宅邻域环境资本化效应的研究大多专注于讨论公共服务设施的配置,为进一步研究住宅邻域街景视觉特征对住宅价格的影响,采用... 对居住环境品质的追求体现了新时代居民对美好生活的向往,住宅周边的街景逐渐成为购房者关注的重点。当前,针对住宅邻域环境资本化效应的研究大多专注于讨论公共服务设施的配置,为进一步研究住宅邻域街景视觉特征对住宅价格的影响,采用基于深度学习的语义分割模型(segment anything model,SAM),提取住宅周边街道空间的视觉特征,并基于特征价格模型,从多个维度分析街景视觉特征对住宅价格的影响。结果表明:(1)街景视觉特征对住宅价格具有显著影响,且不同购买力、不同城市区域的购房者对城市街景的支付意愿存在差异;(2)街景视觉特征指标与住宅价格呈非线性关系,除天空开阔程度外,其他街景特征均存在阈值,在阈值附近居民表现出强烈的支付意愿;(3)街景视觉特征指标与传统住宅特征存在交互资本化效应,街景视觉特征指标对传统住宅特征起补偿作用。 展开更多
关键词 街景图像 语义分割 邻域视觉环境 住宅价格 资本化效应
在线阅读 下载PDF
基于位置和语义分离注意力机制的轻量视频目标跟踪算法
4
作者 王珺 李昆仑 +3 位作者 张伊菲 朱其振 刘磊 王帅 《计算机工程》 北大核心 2026年第1期228-241,共14页
随着深度大模型技术的不断发展,基于孪生网络的视频目标跟踪算法主干网络也不断深化,参数量不断增多,导致模型训练时间和成本的成倍增长,对模型在边缘设备上的部署造成了困难。因此,针对提升轻量级小模型对目标位置和语义信息提取能力... 随着深度大模型技术的不断发展,基于孪生网络的视频目标跟踪算法主干网络也不断深化,参数量不断增多,导致模型训练时间和成本的成倍增长,对模型在边缘设备上的部署造成了困难。因此,针对提升轻量级小模型对目标位置和语义信息提取能力的问题,提出基于位置和语义分离注意力机制的轻量视频目标跟踪算法。首先对归一化注意力机制进行改进并结合水平和竖直方向卷积构建位置注意力,嵌入到主干网络的浅层特征,实现对目标位置信息的提取。然后联合通道方向归一化注意力与压缩-激励网络(SENet)注意力,并将其与主干网络的深层特征进行融合实现对目标语义信息的提取。与之前的注意力机制不同,分别利用网络中浅层特征有利于空间信息的提取和深层特征有利于语义特征提取的性质将位置注意力和语义注意力分离,在不明显增加网络参数量的情况下,提升算法对目标位置和语义信息的提取能力。在通用视频目标跟踪数据集上的实验结果表明,所提算法能够提升基于轻量级孪生网络跟踪算法的精度和成功率。 展开更多
关键词 视频目标跟踪 孪生网络 注意力机制 语义注意力 位置注意力
在线阅读 下载PDF
基于BEV感知的视觉平面图定位
5
作者 陈集伟 陈泽彬 谭光 《计算机科学》 北大核心 2026年第1期216-223,共8页
视觉平面图定位任务通过视觉观测数据与场景平面图表示的匹配实现位姿估计。实际应用中,有效融合视觉观测与平面图之间的几何和语义关联对提升定位精度至关重要。然而,现有方法存在两个主要局限:一是未能充分挖掘相机视野内的语义信息;... 视觉平面图定位任务通过视觉观测数据与场景平面图表示的匹配实现位姿估计。实际应用中,有效融合视觉观测与平面图之间的几何和语义关联对提升定位精度至关重要。然而,现有方法存在两个主要局限:一是未能充分挖掘相机视野内的语义信息;二是缺乏几何与语义线索的联合匹配机制。针对上述问题,提出基于鸟瞰图(Bird Eye View,BEV)感知的视觉平面图定位框架,其包含3个核心模块:首先,BEV语义建图模块通过多模态图像投影变换构建局部场景的BEV语义表征,实现观测数据的结构化表示;其次,预期观测生成模块在平面图空间内生成预期观测数据库,通过可微分渲染方法实现观测数据的快速生成;最后,多层次匹配定位模块提出几何-语义联合匹配机制,通过层次化匹配策略融合BEV观测中的几何布局和语义类别信息,实现与平面图的精确匹配。实验结果表明,该框架在公开数据集Structured3D和自建仿真环境数据集IndoorEnv上的定位召回率分别从0.32%和4.82%提升到了3.12%和58.77%,显著优于现有基线方法Laser和F3Loc,从而验证了所提方法在复杂室内场景中的有效性和鲁棒性。 展开更多
关键词 BEV感知 平面图定位 视觉定位 几何-语义联合匹配
在线阅读 下载PDF
基于双重信息对齐的多目标生成式对抗攻击
6
作者 钱亚冠 余芹芹 +4 位作者 沙嘉强 鲍琦琦 顾钊铨 纪守领 王滨 《计算机研究与发展》 北大核心 2026年第3期752-767,共16页
深度神经网络在很多应用领域取得了显著成功。然而,近年来的研究表明,它们容易受到对抗攻击的威胁。尤其是有目标的对抗攻击,能够精确控制未知模型的输出,对数据隐私和系统安全构成严重挑战。生成式攻击方法因其高效生成对抗样本的能力... 深度神经网络在很多应用领域取得了显著成功。然而,近年来的研究表明,它们容易受到对抗攻击的威胁。尤其是有目标的对抗攻击,能够精确控制未知模型的输出,对数据隐私和系统安全构成严重挑战。生成式攻击方法因其高效生成对抗样本的能力,近年来逐渐应用于有目标攻击的研究中。然而,现有的生成式攻击方法通常针对单一目标类别生成对抗样本,在多目标任务中表现出计算效率低下、灵活性不足和扩展性有限等问题。针对这些不足,提出了一种基于双重信息的多目标生成式攻击(multitarget generative attack based on dual-information,MTGA-DI)方法。该方法通过设计一个条件生成模型,充分融合目标类别的语义和视觉信息,不仅具备多目标攻击能力,还显著提升了对抗样本的迁移性和鲁棒性。实验结果表明,与现有多目标攻击方法相比,MTGA-DI在标准训练模型和鲁棒模型上的性能更优,在应对基于输入预处理的防御模型时也展现出更强的适应能力。 展开更多
关键词 对抗攻击 有目标攻击 生成模型 语义信息 视觉信息 深度学习
在线阅读 下载PDF
激活数据要素潜能的档案数据关联挖掘与可视化研究
7
作者 孙绍媛 《山西档案》 北大核心 2026年第3期109-112,共4页
在数字经济背景下,档案正从行政资源向数据要素转型,仍面临语义割裂与利用方式单一等制约。为充分激活档案数据潜能,遵循“价值积聚—价值激活—价值实现”逻辑路径,提出基于语义化重组的数据关联挖掘方法及其时空可视化实现途径,构建... 在数字经济背景下,档案正从行政资源向数据要素转型,仍面临语义割裂与利用方式单一等制约。为充分激活档案数据潜能,遵循“价值积聚—价值激活—价值实现”逻辑路径,提出基于语义化重组的数据关联挖掘方法及其时空可视化实现途径,构建档案数据价值的系统性框架,并阐释档案数据从资源态向资产态、资本态跃迁的内在机理,为推动档案事业深度融入国家大数据战略提供理论支撑。 展开更多
关键词 档案数据要素 语义化重组 知识图谱 数据挖掘 可视化设计
在线阅读 下载PDF
基于集成学习的神经母细胞瘤语义分割及半透明可视化
8
作者 潘姣 季申予 +2 位作者 李亮 田中觉 王笑琨 《工程科学学报》 北大核心 2026年第2期360-369,共10页
神经母细胞瘤是一种形态复杂多变的肿瘤,肿瘤的位置、形状和大小差异显著,且常伴随重要解剖结构的包绕,肿瘤与周围组织的边界模糊,导致术前评估与手术规划面临巨大挑战.为提升术前诊疗的智能化与可视化水平,本文提出了一种基于集成学习... 神经母细胞瘤是一种形态复杂多变的肿瘤,肿瘤的位置、形状和大小差异显著,且常伴随重要解剖结构的包绕,肿瘤与周围组织的边界模糊,导致术前评估与手术规划面临巨大挑战.为提升术前诊疗的智能化与可视化水平,本文提出了一种基于集成学习的神经母细胞瘤语义分割及半透明三维可视化方法.在语义分割部分,本文基于预训练的nnU-Net架构构建了能够使用多模态医学图像作为输入的分割框架,并在推理阶段引入了一种基于验证集Dice分数的加权投票集成策略.与nnU-Net默认的等权平均集成不同,该策略根据模型性能分配融合权重,使表现更优的模型在最终预测中占据更大权重,从而在保持整体稳定性的同时提升了分割精度.本方法在SPPIN 2023挑战赛提供的儿童神经母细胞瘤数据集上开展了对比实验,该方法在Dice系数、Hausdorff距离与体积相似性等指标上均优于主流方法.此外,为进一步验证投票集成策略的有效性,我们在BraTS2021给出的脑肿瘤数据集上进行了消融实验,证明了投票策略确实有效.在肿瘤可视化部分,本文使用了一种基于随机点采样的半透明三维可视化方法,通过将分割后的结果进行点云化,并进行多子集点云的统计融合,在无需深度排序的条件下实现快速渲染,实现了肿瘤和周围器官的半透明可视化.本文提出的可视化方案可以提升术前空间理解效率,为复杂病例的术前辅助决策提供直观、精准的视觉支持,具备良好的临床应用前景. 展开更多
关键词 语义分割 半透明可视化 神经母细胞瘤 集成学习 多模态核磁共振
在线阅读 下载PDF
基于语义分割的动态视觉SLAM算法研究
9
作者 赵学培 《机械工程与自动化》 2026年第1期31-34,共4页
针对动态环境下的物体移动对系统位姿估计精度的影响,为提升无人机对真实环境的感知水平和定位精度,提出一种融合语义信息的动态视觉SLAM算法。基于经典的视觉SLAM框架,引入Mask R-CNN语义分割对含有动态物体的图像进行动态目标滤除,消... 针对动态环境下的物体移动对系统位姿估计精度的影响,为提升无人机对真实环境的感知水平和定位精度,提出一种融合语义信息的动态视觉SLAM算法。基于经典的视觉SLAM框架,引入Mask R-CNN语义分割对含有动态物体的图像进行动态目标滤除,消除动态目标对SLAM精度的影响。搭建了由PixhawkV4飞控、JETSON Xavier NX板载计算机、D435i深度相机作为视觉传感器组成的四旋翼无人机测试系统。在公开数据集与定制数据集上进行算法实验与分析,并对所提算法与基于特征点法的Orbslam3算法进行实验对比分析,验证改进算法的准确性及有效性。在定制数据集中进行算法测试实验,其结果符合实际飞行轨迹,表明所提算法可以显著提高SLAM系统在动态场景中的定位精度。 展开更多
关键词 动态环境 位姿估计 视觉SLAM 语义分割
在线阅读 下载PDF
基于视觉语义分割的服务机器人室内目标抓取路径生成方法
10
作者 高兴兴 《计算机应用文摘》 2026年第4期65-67,共3页
针对服务机器人在复杂室内环境中自主作业的需求,文章提出一种基于视觉语义分割的室内目标抓取路径生成方法。首先,构建“分割—识别—规划”的统一流程架构,采用深度学习模型对室内图像进行像素级语义分割,生成带有物体类别与空间关系... 针对服务机器人在复杂室内环境中自主作业的需求,文章提出一种基于视觉语义分割的室内目标抓取路径生成方法。首先,构建“分割—识别—规划”的统一流程架构,采用深度学习模型对室内图像进行像素级语义分割,生成带有物体类别与空间关系的场景语义图;其次,依据分割结果精准定位目标物品的可抓取点位,并实时识别动态障碍物区域;最后,结合改进的路径规划算法,生成兼顾安全性和效率的机械臂动作序列。 展开更多
关键词 视觉语义分割 服务机器人 路径生成 目标抓取 室内环境
在线阅读 下载PDF
Answer Semantics-enhanced Medical Visual Question Answering
11
作者 Yuliang Liang Enneng Yang +4 位作者 Guibing Guo Wei Cai Linying Jiang Jianzhe Zhao Xingwei Wang 《Machine Intelligence Research》 2025年第6期1127-1137,共11页
Medical visual question answering(Med-VQA)is a task that aims to answer clinical questions given a medical image.Existing literature generally treats it as a classic classification task based on interaction features o... Medical visual question answering(Med-VQA)is a task that aims to answer clinical questions given a medical image.Existing literature generally treats it as a classic classification task based on interaction features of the image and question.However,such a paradigm ignores the valuable semantics of candidate answers as well as their relations.From the real-world dataset,we observe that:1)The text of candidate answers has a strong intrinsic correlation with medical images;2)Subtle differences among multiple candidate answers are crucial for identifying the correct one.Therefore,we propose an answer semantics enhanced(ASE)method to integrate the semantics of answers and capture their subtle differences.Specifically,we enhance the semantic correlation of image-question-answer triplets by aligning images and question-answer tuples within the feature fusion module.Then,we devise a contrastive learning loss to highlight the semantic differences between the correct answer and other answers.Finally,extensive experiments demonstrate the effectiveness of our method. 展开更多
关键词 Medical visual question answering(Med-VQA) semantic knowledge vision-language model contrastive learning question answering
原文传递
Visual SLAM Based on Object Detection Network:A Review 被引量:2
12
作者 Jiansheng Peng Dunhua Chen +3 位作者 Qing Yang Chengjun Yang Yong Xu Yong Qin 《Computers, Materials & Continua》 SCIE EI 2023年第12期3209-3236,共28页
Visual simultaneous localization and mapping(SLAM)is crucial in robotics and autonomous driving.However,traditional visual SLAM faces challenges in dynamic environments.To address this issue,researchers have proposed ... Visual simultaneous localization and mapping(SLAM)is crucial in robotics and autonomous driving.However,traditional visual SLAM faces challenges in dynamic environments.To address this issue,researchers have proposed semantic SLAM,which combines object detection,semantic segmentation,instance segmentation,and visual SLAM.Despite the growing body of literature on semantic SLAM,there is currently a lack of comprehensive research on the integration of object detection and visual SLAM.Therefore,this study aims to gather information from multiple databases and review relevant literature using specific keywords.It focuses on visual SLAM based on object detection,covering different aspects.Firstly,it discusses the current research status and challenges in this field,highlighting methods for incorporating semantic information from object detection networks into mileage measurement,closed-loop detection,and map construction.It also compares the characteristics and performance of various visual SLAM object detection algorithms.Lastly,it provides an outlook on future research directions and emerging trends in visual SLAM.Research has shown that visual SLAM based on object detection has significant improvements compared to traditional SLAM in dynamic point removal,data association,point cloud segmentation,and other technologies.It can improve the robustness and accuracy of the entire SLAM system and can run in real time.With the continuous optimization of algorithms and the improvement of hardware level,object visual SLAM has great potential for development. 展开更多
关键词 Object detection visual SLAM visual odometry loop closure detection semantic map
在线阅读 下载PDF
Large-scale 3D Semantic Mapping Using Stereo Vision 被引量:1
13
作者 Yi Yang Fan Qiu +3 位作者 Hao Li Lu Zhang Mei-Ling Wang Meng-Yin Fu 《International Journal of Automation and computing》 EI CSCD 2018年第2期194-206,共13页
In recent years, there have been a lot of interests in incorporating semantics into simultaneous localization and mapping (SLAM) systems. This paper presents an approach to generate an outdoor large-scale 3D dense s... In recent years, there have been a lot of interests in incorporating semantics into simultaneous localization and mapping (SLAM) systems. This paper presents an approach to generate an outdoor large-scale 3D dense semantic map based on binocular stereo vision. The inputs to system are stereo color images from a moving vehicle. First, dense 3D space around the vehicle is constructed, and tile motion of camera is estimated by visual odometry. Meanwhile, semantic segmentation is performed through the deep learning technology online, and the semantic labels are also used to verify tim feature matching in visual odometry. These three processes calculate the motion, depth and semantic label of every pixel in the input views. Then, a voxel conditional random field (CRF) inference is introduced to fuse semantic labels to voxel. After that, we present a method to remove the moving objects by incorporating the semantic labels, which improves the motion segmentation accuracy. The last is to generate tile dense 3D semantic map of an urban environment from arbitrary long image sequence. We evaluate our approach on KITTI vision benchmark, and the results show that the proposed method is effective. 展开更多
关键词 semantic map stereo vision motion segmentation visual odometry simultaneous localization and mapping (SLAM).
原文传递
Linking a Game Engine Environment to Architectural Information on the Semantic Web 被引量:2
14
作者 Pieter Pauwels Ronald De Meyer Jan Van Campenhout 《Journal of Civil Engineering and Architecture》 2011年第9期787-798,共12页
Because of the importance of graphics and information within the domain of architecture, engineering and construction (AEC), an appropriate combination of visualization technology and information management technolo... Because of the importance of graphics and information within the domain of architecture, engineering and construction (AEC), an appropriate combination of visualization technology and information management technology is of utter importance in the development of appropriately supporting design and construction applications. Virtual environments, however, tend not to make this information available. The sparse number of applications that present additional information furthermore tend to limit their scope to pure construction information and do not incorporate information from loosely related knowledge domains, such as cultural heritage or architectural history information. We therefore started an investigation of two of the newest developments in these domains, namely game engine technology and semantic web technology. This paper documents part of this research, containing a review and comparison of the most prominent game engines and documenting our architectural semantic web. A short test-case illustrates how both can be combined to enhance information visualization for architectural design and construction. 展开更多
关键词 3D BIM construction industry game engines INFORMATION semantic web virtual environments visualization.
在线阅读 下载PDF
A PERSONALIZED IMAGE RETRIEVAL BASED ON VISUAL PERCEPTION 被引量:1
15
作者 Zhang Jing Shen Lansun David Dagan Feng 《Journal of Electronics(China)》 2008年第1期129-133,共5页
A new scheme named personalized image retrieval technique based on visual perception is proposed in this letter, whose motive is to narrow the semantic gap by directly perceiving user's visual information. It uses... A new scheme named personalized image retrieval technique based on visual perception is proposed in this letter, whose motive is to narrow the semantic gap by directly perceiving user's visual information. It uses visual attention model to segment image regions and eye-tracking technique to record fixations. Visual perception is obtained by analyzing the fixations in regions to measure gaze interests. Integrating visual perception into attention model is to detect the Regions Of Interest (ROIs), whose features are extracted and analyzed, then feedback interests to optimize the results and construct user profiles. 展开更多
关键词 Personalized image retrieval visual perception semantic gap Regions Of Interest (ROIs)
在线阅读 下载PDF
A language to analyze,describe,and explore collections of visual art
16
作者 Hermann Pflüger 《Visual Computing for Industry,Biomedicine,and Art》 EI 2021年第1期29-49,共21页
A vast quantity of art in existence today is inaccessible to individuals.If people want to know the different types of art that exist,how individual works are connected,and how works of art are interpreted and discuss... A vast quantity of art in existence today is inaccessible to individuals.If people want to know the different types of art that exist,how individual works are connected,and how works of art are interpreted and discussed in the context of other works,they must utilize means other than simply viewing the art.Therefore,this paper proposes a language to analyze,describe,and explore collections of visual art(LadeCA).LadeCA combines human interpretation and automatic analyses of images,allowing users to assess collections of visual art without viewing every image in them.This paper focuses on the lexical base of LadeCA.It also outlines how collections of visual art can be analyzed,described,and explored using a LadeCA vocabulary.Additionally,the relationship between LadeCA and indexing systems,such as ICONCLASS or AAT,is demonstrated,and ways in which LadeCA and indexing systems can complement each other are highlighted. 展开更多
关键词 Collections of visual art visual computing Personalized digital libraries Digital humanities semantics
在线阅读 下载PDF
Visualizing risk factors of dementia from scholarly literature using knowledge maps and next-generation data models
17
作者 Kiran Fahd Sitalakshmi Venkatraman 《Visual Computing for Industry,Biomedicine,and Art》 EI 2021年第1期165-182,共18页
Scholarly communication of knowledge is predominantly document-based in digital repositories,and researchers find it tedious to automatically capture and process the semantics among related articles.Despite the presen... Scholarly communication of knowledge is predominantly document-based in digital repositories,and researchers find it tedious to automatically capture and process the semantics among related articles.Despite the present digital era of big data,there is a lack of visual representations of the knowledge present in scholarly articles,and a time-saving approach for a literature search and visual navigation is warranted.The majority of knowledge display tools cannot cope with current big data trends and pose limitations in meeting the requirements of automatic knowledge representation,storage,and dynamic visualization.To address this limitation,the main aim of this paper is to model the visualization of unstructured data and explore the feasibility of achieving visual navigation for researchers to gain insight into the knowledge hidden in scientific articles of digital repositories.Contemporary topics of research and practice,including modifiable risk factors leading to a dramatic increase in Alzheimer’s disease and other forms of dementia,warrant deeper insight into the evidence-based knowledge available in the literature.The goal is to provide researchers with a visual-based easy traversal through a digital repository of research articles.This paper takes the first step in proposing a novel integrated model using knowledge maps and next-generation graph datastores to achieve a semantic visualization with domain-specific knowledge,such as dementia risk factors.The model facilitates a deep conceptual understanding of the literature by automatically establishing visual relationships among the extracted knowledge from the big data resources of research articles.It also serves as an automated tool for a visual navigation through the knowledge repository for faster identification of dementia risk factors reported in scholarly articles.Further,it facilitates a semantic visualization and domain-specific knowledge discovery from a large digital repository and their associations.In this study,the implementation of the proposed model in the Neo4j graph data repository,along with the results achieved,is presented as a proof of concept.Using scholarly research articles on dementia risk factors as a case study,automatic knowledge extraction,storage,intelligent search,and visual navigation are illustrated.The implementation of contextual knowledge and its relationship for a visual exploration by researchers show promising results in the knowledge discovery of dementia risk factors.Overall,this study demonstrates the significance of a semantic visualization with the effective use of knowledge maps and paves the way for extending visual modeling capabilities in the future. 展开更多
关键词 Big data Data visualization Knowledge maps DEMENTIA Non-relational database Graph database Neo4j semantic visualization
在线阅读 下载PDF
A Real-Time Semantic Segmentation Method Based on Transformer for Autonomous Driving
18
作者 Weiyu Hao Jingyi Wang Huimin Lu 《Computers, Materials & Continua》 SCIE EI 2024年第12期4419-4433,共15页
While traditional Convolutional Neural Network(CNN)-based semantic segmentation methods have proven effective,they often encounter significant computational challenges due to the requirement for dense pixel-level pred... While traditional Convolutional Neural Network(CNN)-based semantic segmentation methods have proven effective,they often encounter significant computational challenges due to the requirement for dense pixel-level predictions,which complicates real-time implementation.To address this,we introduce an advanced real-time semantic segmentation strategy specifically designed for autonomous driving,utilizing the capabilities of Visual Transformers.By leveraging the self-attention mechanism inherent in Visual Transformers,our method enhances global contextual awareness,refining the representation of each pixel in relation to the overall scene.This enhancement is critical for quickly and accurately interpreting the complex elements within driving sce-narios—a fundamental need for autonomous vehicles.Our experiments conducted on the DriveSeg autonomous driving dataset indicate that our model surpasses traditional segmentation methods,achieving a significant 4.5%improvement in Mean Intersection over Union(mIoU)while maintaining real-time responsiveness.This paper not only underscores the potential for optimized semantic segmentation but also establishes a promising direction for real-time processing in autonomous navigation systems.Future work will focus on integrating this technique with other perception modules in autonomous driving to further improve the robustness and efficiency of self-driving perception frameworks,thereby opening new pathways for research and practical applications in scenarios requiring rapid and precise decision-making capabilities.Further experimentation and adaptation of this model could lead to broader implications for the fields of machine learning and computer vision,particularly in enhancing the interaction between automated systems and their dynamic environments. 展开更多
关键词 visual transformer semantic segmentation autonomous driving
在线阅读 下载PDF
Next Generation Semantic and Spatial Joint Perception——Neural Metric-Semantic Understanding
19
作者 ZHU Fang 《ZTE Communications》 2021年第1期61-71,共11页
Efficient perception of the real world is a long-standing effort of computer vision.Mod⁃ern visual computing techniques have succeeded in attaching semantic labels to thousands of daily objects and reconstructing dens... Efficient perception of the real world is a long-standing effort of computer vision.Mod⁃ern visual computing techniques have succeeded in attaching semantic labels to thousands of daily objects and reconstructing dense depth maps of complex scenes.However,simultaneous se⁃mantic and spatial joint perception,so-called dense 3D semantic mapping,estimating the 3D ge⁃ometry of a scene and attaching semantic labels to the geometry,remains a challenging problem that,if solved,would make structured vision understanding and editing more widely accessible.Concurrently,progress in computer vision and machine learning has motivated us to pursue the capability of understanding and digitally reconstructing the surrounding world.Neural metric-se⁃mantic understanding is a new and rapidly emerging field that combines differentiable machine learning techniques with physical knowledge from computer vision,e.g.,the integration of visualinertial simultaneous localization and mapping(SLAM),mesh reconstruction,and semantic un⁃derstanding.In this paper,we attempt to summarize the recent trends and applications of neural metric-semantic understanding.Starting with an overview of the underlying computer vision and machine learning concepts,we discuss critical aspects of such perception approaches.Specifical⁃ly,our emphasis is on fully leveraging the joint semantic and 3D information.Later on,many im⁃portant applications of the perception capability such as novel view synthesis and semantic aug⁃mented reality(AR)contents manipulation are also presented.Finally,we conclude with a dis⁃cussion of the technical implications of the technology under a 5G edge computing scenario. 展开更多
关键词 visual computing semantic and spatial joint perception dense 3D semantic map⁃ping neural metric-semantic understanding
在线阅读 下载PDF
以语言为媒介的遥感图像跨时空领域自适应语义分割 被引量:1
20
作者 陶超 郭鑫 +2 位作者 胡柯彦 沈羽翔 王昊 《中国图象图形学报》 北大核心 2025年第9期3153-3170,共18页
目的随着视觉大模型的发展,利用多源无标注遥感影像预训练学习全局视觉特征,并在局部目标任务上进行迁移微调,已成为遥感影像领域自适应的一种新范式。然而,现有的全局预训练策略主要聚焦于学习低级的通用视觉特征,难以捕捉复杂、高层... 目的随着视觉大模型的发展,利用多源无标注遥感影像预训练学习全局视觉特征,并在局部目标任务上进行迁移微调,已成为遥感影像领域自适应的一种新范式。然而,现有的全局预训练策略主要聚焦于学习低级的通用视觉特征,难以捕捉复杂、高层次的语义关联。此外,微调过程中使用的少量标注样本往往只反映目标域的特定场景,无法充分激活全局模型中与目标域匹配的领域知识。因此,面对复杂多变的遥感影像跨时空领域偏移,现有方法得到的全局模型与目标任务之间仍然存在巨大的语义鸿沟。为应对这一挑战,本文提出一种语言文本引导的“全局模型预训练—局部模型微调”的领域自适应框架。方法提出框架针对遥感数据的时空异质性特点,借助大型视觉语言助手LLaVA(large language and vision assistant)生成包含季节、地理区域及地物分布等时空信息的遥感影像文本描述。通过语言文本引导的学习帮助全局模型挖掘地物的时空分布规律,增强局部任务微调时相关领域知识的激活。结果在对比判别式、掩码生成式和扩散生成式3种不同全局预训练策略上设置了3组“全局—局部”跨时空领域自适应语义分割实验来验证提出框架的有效性。以全局→局部(长沙)为例,使用语言文本引导相比于无文本引导在3种不同预训练策略上分别提升了8.7%、4.4%和2.9%。同样地,提出框架在全局→局部(湘潭)和全局→局部(武汉)上也都有性能提升。结论证明了语言文本对准确理解跨时空遥感影像中的语义内容具有积极影响。与无文本引导的学习方法相比,提出框架显著提升了模型的迁移性能。 展开更多
关键词 遥感影像 语义分割 领域自适应 视觉语言模型 时空异质性
原文传递
上一页 1 2 29 下一页 到第
使用帮助 返回顶部