期刊文献+
共找到164篇文章
< 1 2 9 >
每页显示 20 50 100
Deep Multi-Scale and Attention-Based Architectures for Semantic Segmentation in Biomedical Imaging
1
作者 Majid Harouni Vishakha Goyal +2 位作者 Gabrielle Feldman Sam Michael Ty C.Voss 《Computers, Materials & Continua》 2025年第10期331-366,共36页
Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional a... Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional approaches often fail in the face of challenges such as low contrast, morphological variability, and densely packed structures. Recent advancements in deep learning have transformed segmentation capabilities through the integration of fine-scale detail preservation, coarse-scale contextual modeling, and multi-scale feature fusion. This work provides a comprehensive analysis of state-of-the-art deep learning models, including U-Net variants, attention-based frameworks, and Transformer-integrated networks, highlighting innovations that improve accuracy, generalizability, and computational efficiency. Key architectural components such as convolution operations, shallow and deep blocks, skip connections, and hybrid encoders are examined for their roles in enhancing spatial representation and semantic consistency. We further discuss the importance of hierarchical and instance-aware segmentation and annotation in interpreting complex biological scenes and multiplexed medical images. By bridging methodological developments with diverse application domains, this paper outlines current trends and future directions for semantic segmentation, emphasizing its critical role in facilitating annotation, diagnosis, and discovery in biomedical research. 展开更多
关键词 Biomedical semantic segmentation multi-scale feature fusion fine-and coarse-scale features convolution operations shallow and deep blocks skip connections
在线阅读 下载PDF
ST-SIGMA:Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting 被引量:6
2
作者 Yang Fang Bei Luo +3 位作者 Ting Zhao Dong He Bingbing Jiang Qilie Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第4期744-757,共14页
Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges... Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios. 展开更多
关键词 feature fusion graph interaction hierarchical aggregation scene perception scene semantics trajectory forecasting
在线阅读 下载PDF
Document Clustering Using Semantic Cliques Aggregation
3
作者 Ajit Kumar I-Jen Chiang 《Journal of Computer and Communications》 2015年第12期28-40,共13页
The search engines are indispensable tools to find information amidst massive web pages and documents. A good search engine needs to retrieve information not only in a shorter time, but also relevant to the users’ qu... The search engines are indispensable tools to find information amidst massive web pages and documents. A good search engine needs to retrieve information not only in a shorter time, but also relevant to the users’ queries. Most search engines provide short time retrieval to user queries;however, they provide a little guarantee of precision even to the highly detailed users’ queries. In such cases, documents clustering centered on the subject and contents might improve search results. This paper presents a novel method of document clustering, which uses semantic clique. First, we extracted the Features from the documents. Later, the associations between frequently co-occurring terms were defined, which were called as semantic cliques. Each connected component in the semantic clique represented a theme. The documents clustered based on the theme, for which we designed an aggregation algorithm. We evaluated the aggregation algorithm effectiveness using four kinds of datasets. The result showed that the semantic clique based document clustering algorithm performed significantly better than traditional clustering algorithms such as Principal Direction Divisive Partitioning (PDDP), k-means, Auto-Class, and Hierarchical Clustering (HAC). We found that the Semantic Clique Aggregation is a potential model to represent association rules in text and could be immensely useful for automatic document clustering. 展开更多
关键词 Document Clustering semantic CLIQUE ASSOCIATION aggregation THEME
在线阅读 下载PDF
BSDNet:Semantic Information Distillation-Based for Bilateral-Branch Real-Time Semantic Segmentation on Street Scene Image
4
作者 Huan Zeng Jianxun Zhang +1 位作者 Hongji Chen Xinwei Zhu 《Computers, Materials & Continua》 2025年第11期3879-3896,共18页
Semantic segmentation in street scenes is a crucial technology for autonomous driving to analyze the surrounding environment.In street scenes,issues such as high image resolution caused by a large viewpoints and diffe... Semantic segmentation in street scenes is a crucial technology for autonomous driving to analyze the surrounding environment.In street scenes,issues such as high image resolution caused by a large viewpoints and differences in object scales lead to a decline in real-time performance and difficulties in multi-scale feature extraction.To address this,we propose a bilateral-branch real-time semantic segmentationmethod based on semantic information distillation(BSDNet)for street scene images.The BSDNet consists of a Feature Conversion Convolutional Block(FCB),a Semantic Information Distillation Module(SIDM),and a Deep Aggregation Atrous Convolution Pyramid Pooling(DASP).FCB reduces the semantic gap between the backbone and the semantic branch.SIDM extracts high-quality semantic information fromthe Transformer branch to reduce computational costs.DASP aggregates information lost in atrous convolutions,effectively capturingmulti-scale objects.Extensive experiments conducted on Cityscapes,CamVid,and ADE20K,achieving an accuracy of 81.7% Mean Intersection over Union(mIoU)at 70.6 Frames Per Second(FPS)on Cityscapes,demonstrate that our method achieves a better balance between accuracy and inference speed. 展开更多
关键词 Street scene understanding real-time semantic segmentation knowledge distillation multi-scale feature extraction
在线阅读 下载PDF
A 3D semantic segmentation network for accurate neuronal soma segmentation
5
作者 Li Ma Qi Zhong +2 位作者 Yezi Wang Xiaoquan Yang Qian Du 《Journal of Innovative Optical Health Sciences》 2025年第1期67-83,共17页
Neuronal soma segmentation plays a crucial role in neuroscience applications.However,the fine structure,such as boundaries,small-volume neuronal somata and fibers,are commonly present in cell images,which pose a chall... Neuronal soma segmentation plays a crucial role in neuroscience applications.However,the fine structure,such as boundaries,small-volume neuronal somata and fibers,are commonly present in cell images,which pose a challenge for accurate segmentation.In this paper,we propose a 3D semantic segmentation network for neuronal soma segmentation to address this issue.Using an encoding-decoding structure,we introduce a Multi-Scale feature extraction and Adaptive Weighting fusion module(MSAW)after each encoding block.The MSAW module can not only emphasize the fine structures via an upsampling strategy,but also provide pixel-wise weights to measure the importance of the multi-scale features.Additionally,a dynamic convolution instead of normal convolution is employed to better adapt the network to input data with different distributions.The proposed MSAW-based semantic segmentation network(MSAW-Net)was evaluated on three neuronal soma images from mouse brain and one neuronal soma image from macaque brain,demonstrating the efficiency of the proposed method.It achieved an F1 score of 91.8%on Fezf2-2A-CreER dataset,97.1%on LSL-H2B-GFP dataset,82.8%on Thy1-EGFP-Mline dataset,and 86.9%on macaque dataset,achieving improvements over the 3D U-Net model by 3.1%,3.3%,3.9%,and 2.3%,respectively. 展开更多
关键词 Neuronal soma segmentation semantic segmentation network multi-scale feature extraction adaptive weighting fusion
原文传递
Global context-aware multi-scale feature iterative refinement for aviation-road traffic semantic segmentation
6
作者 Mengyue ZHANG Shichun YANG +1 位作者 Xinjie FENG Yaoguang CAO 《Chinese Journal of Aeronautics》 2026年第2期429-441,共13页
Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made re... Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made remarkable achievements in both fine-grained segmentation and real-time performance.However,when faced with the huge differences in scale and semantic categories brought about by the mixed scenes of aerial remote sensing and road traffic,they still face great challenges and there is little related research.Addressing the above issue,this paper proposes a semantic segmentation model specifically for mixed datasets of aerial remote sensing and road traffic scenes.First,a novel decoding-recoding multi-scale feature iterative refinement structure is proposed,which utilizes the re-integration and continuous enhancement of multi-scale information to effectively deal with the huge scale differences between cross-domain scenes,while using a fully convolutional structure to ensure the lightweight and real-time requirements.Second,a welldesigned cross-window attention mechanism combined with a global information integration decoding block forms an enhanced global context perception,which can effectively capture the long-range dependencies and multi-scale global context information of different scenes,thereby achieving fine-grained semantic segmentation.The proposed method is tested on a large-scale mixed dataset of aerial remote sensing and road traffic scenes.The results confirm that it can effectively deal with the problem of large-scale differences in cross-domain scenes.Its segmentation accuracy surpasses that of the SOTA methods,which meets the real-time requirements. 展开更多
关键词 Aviation-road traffic Flying cars Global context-aware multi-scale feature iterative refinement semantic segmentation
原文传递
A Lightweight Road Scene Semantic Segmentation Algorithm 被引量:1
7
作者 Jiansheng Peng Qing Yang Yaru Hou 《Computers, Materials & Continua》 SCIE EI 2023年第11期1929-1948,共20页
In recent years,with the continuous deepening of smart city construction,there have been significant changes and improvements in the field of intelligent transportation.The semantic segmentation of road scenes has imp... In recent years,with the continuous deepening of smart city construction,there have been significant changes and improvements in the field of intelligent transportation.The semantic segmentation of road scenes has important practical significance in the fields of automatic driving,transportation planning,and intelligent transportation systems.However,the current mainstream lightweight semantic segmentation models in road scene segmentation face problems such as poor segmentation performance of small targets and insufficient refinement of segmentation edges.Therefore,this article proposes a lightweight semantic segmentation model based on the LiteSeg model improvement to address these issues.The model uses the lightweight backbone network MobileNet instead of the LiteSeg backbone network to reduce the network parameters and computation,and combines the Coordinate Attention(CA)mechanism to help the network capture long-distance dependencies.At the same time,by combining the dependencies of spatial information and channel information,the Spatial and Channel Network(SCNet)attention mechanism is proposed to improve the feature extraction ability of the model.Finally,a multiscale transposed attention encoding(MTAE)module was proposed to obtain features of different resolutions and perform feature fusion.In this paper,the proposed model is verified on the Cityscapes dataset.The experimental results show that the addition of SCNet and MTAE modules increases the mean Intersection over Union(mIoU)of the original LiteSeg model by 4.69%.On this basis,the backbone network is replaced with MobileNet,and the CA model is added at the same time.At the cost of increasing the minimum model parameters and computing costs,the mIoU of the original LiteSeg model is increased by 2.46%.This article also compares the proposed model with some current lightweight semantic segmentation models,and experiments show that the comprehensive performance of the proposed model is the best,especially in achieving excellent results in small object segmentation.Finally,this article will conduct generalization testing on the KITTI dataset for the proposed model,and the experimental results show that the proposed algorithm has a certain degree of generalization. 展开更多
关键词 semantic segmentation LIGHTWEIGHT road scenes multi-scale transposition attention encoding(MTAE)
在线阅读 下载PDF
A Multi-Scale Network with the Encoder-Decoder Structure for CMR Segmentation 被引量:1
8
作者 Chaoyang Xia Jing Peng +1 位作者 Zongqing Ma Xiaojie Li 《Journal of Information Hiding and Privacy Protection》 2019年第3期109-117,共9页
Cardiomyopathy is one of the most serious public health threats.The precise structural and functional cardiac measurement is an essential step for clinical diagnosis and follow-up treatment planning.Cardiologists are ... Cardiomyopathy is one of the most serious public health threats.The precise structural and functional cardiac measurement is an essential step for clinical diagnosis and follow-up treatment planning.Cardiologists are often required to draw endocardial and epicardial contours of the left ventricle(LV)manually in routine clinical diagnosis or treatment planning period.This task is time-consuming and error-prone.Therefore,it is necessary to develop a fully automated end-to-end semantic segmentation method on cardiac magnetic resonance(CMR)imaging datasets.However,due to the low image quality and the deformation caused by heartbeat,there is no effective tool for fully automated end-to-end cardiac segmentation task.In this work,we propose a multi-scale segmentation network(MSSN)for left ventricle segmentation.It can effectively learn myocardium and blood pool structure representations from 2D short-axis CMR image slices in a multi-scale way.Specifically,our method employs both parallel and serial of dilated convolution layers with different dilation rates to capture multi-scale semantic features.Moreover,we design graduated up-sampling layers with subpixel layers as the decoder to reconstruct lost spatial information and produce accurate segmentation masks.We validated our method using 164 T1 Mapping CMR images and showed that it outperforms the advanced convolutional neural network(CNN)models.In validation metrics,we archived the Dice Similarity Coefficient(DSC)metric of 78.96%. 展开更多
关键词 Cardiac magnetic resonance imaging multi-scale semantic segmentation convolutional neural networks
暂未订购
Development and Research on the Internet Resources Aggregation Platform
9
作者 XU Youwei TANG Shengqun YANG Yan XU Yang LI Ling 《Wuhan University Journal of Natural Sciences》 CAS 2007年第2期243-248,共6页
We proposed an Intemet resource aggregation platform based on semantic web. The platform includes an Web Ontology Language(OWL) ontology design toolkit(VO-Editor) and a selective inference algorithm engine so that... We proposed an Intemet resource aggregation platform based on semantic web. The platform includes an Web Ontology Language(OWL) ontology design toolkit(VO-Editor) and a selective inference algorithm engine so that it can visually editing ontology and using novel selective reasoning for information aggregation. We introduce the VO-Editor and the principle of selective inference algorithm. At last a case of budget travel system is used to interpret the approach of Internet resources aggregation by this platform. 展开更多
关键词 semantic web ONTOLOGY information aggregation INFERENCE Web Ontology Language
在线阅读 下载PDF
Multi-scale enhancement and aggregation network for singleimage deraining
10
作者 Rui Zhang Yuetong Liu +3 位作者 Huijian Han Yong Zheng Tao Zhang Yunfeng Zhang 《Computational Visual Media》 2025年第1期213-226,共14页
Rain streaks in an image appear in different sizes and orientations,resulting in severe blurring and visual quality degradation.Previous CNNbased algorithms have achieved encouraging deraining results although there a... Rain streaks in an image appear in different sizes and orientations,resulting in severe blurring and visual quality degradation.Previous CNNbased algorithms have achieved encouraging deraining results although there are certain limitations in the description of rain streaks and the restoration of scene structures in different environments.In this paper,we propose an efficient multi-scale enhancement and aggregation network(MEAN)to solve the single-image deraining problem.Considering the importance of large receptive fields and multi-scale features,we introduce a multi-scale enhanced unit(MEU)to capture longrange dependencies and exploit features at different scales to depict rain.Simultaneously,an attentive aggregation unit(AAU)is designed to utilize the informative features in spatial and channel dimensions,thereby aggregating effective information to eliminate redundant features for rich scenario details.To improve the deraining performance of the encoder–decoder network,we utilized an AAU to filter the information in the encoder network and concatenated the useful features to the decoder network,which is conducive to predicting high-quality clean images.Experimental results on synthetic datasets and real-world samples show that the proposed method achieves a significant deraining performance compared to state-of-the-art approaches. 展开更多
关键词 single-image deraining multi-scale enhan-cement and aggregation(MEA) encoder-decoder network
原文传递
顾及距离衰减效应的地理知识图谱补全方法
11
作者 赫晓慧 李爽 +1 位作者 孔锦澜 田智慧 《地球信息科学学报》 北大核心 2026年第2期273-286,共14页
【目的】地理知识图谱(GeoKG)通过知识图谱的形式化技术,将地理知识表示为计算机可解释、可复用、可推理的知识网络。但由于地理信息分布的稀疏性以及更新的落后性,地理知识图谱往往是不完整的,制约着其应用广度和深度,需要地理知识图... 【目的】地理知识图谱(GeoKG)通过知识图谱的形式化技术,将地理知识表示为计算机可解释、可复用、可推理的知识网络。但由于地理信息分布的稀疏性以及更新的落后性,地理知识图谱往往是不完整的,制约着其应用广度和深度,需要地理知识图谱补全方法来解决其不完整的问题。然而,现有补全方法未充分考虑到地理知识图谱中的语义信息以及地理实体间的交互遵循距离衰减效应,致使嵌入空间难以充分还原地理实体和关系的真实分布,从而限制了补全性能的提升。【方法】本文提出了一种顾及距离衰减效应的地理知识图谱补全方法DDGKGC(Distance-Decaying Effect-Aware Geographic Knowledge Graph Completion method)。该方法首先通过语义信息聚合模块和距离衰减效应感知模块,捕获实体和关系间的语义信息和距离信息;然后,通过基于双注意力机制的表示学习模块自适应地学习实体和关系的邻域信息,得到实体和关系的嵌入表示,最后通过ConvE得分函数进行评分预测,并使用预测结果来完成地理知识图谱补全任务。【结果】为全面评估模型性能,本文在自构建数据集Multi-Geo、CityDirection、CountyDistance及公开数据集Countries-S3上进行了对比实验、消融实验和多维度分析验证。实验结果表明,DDGKGC在MRR、Hits@1、Hits@3、Hits@10等多项指标上表现出色,尤其在全面反映模型性能的MRR指标上相较于对比方法在4个数据集上分别提升4%、3.1%、1.8%和5.2%。此外,通过多维度分析验证评估,证明了DDGKGC能够更合理地建模地理实体关系间的空间和语义关联,从而提升补全结果的准确性与地理合理性。【结论】本文提出的顾及距离衰减效应的地理知识图谱补全方法,不仅有效提升了地理知识图谱补全任务的性能,还展现出良好的泛化能力与应用潜力,同时也为地理知识图谱的深化应用提供了可靠支撑。 展开更多
关键词 地理知识图谱 地理知识图谱补全 距离衰减效应 语义信息聚合 实体关系表示 注意力机制
原文传递
多尺度聚合协同轴向语义引导的实体关系联合抽取方法
12
作者 钱清 陈辉程 +2 位作者 崔允贺 唐瑞雪 付金玫 《计算机科学》 北大核心 2026年第3期97-106,共10页
近年来,基于表填充的实体关系联合抽取方法取得了显著效果,但现有研究尚未考虑到词对间的边界关联性建模,以及构建词对语义相似性问题。为解决上述问题,提出了一种基于多尺度聚合协同轴向语义引导的实体关系联合抽取模型。首先,设计的... 近年来,基于表填充的实体关系联合抽取方法取得了显著效果,但现有研究尚未考虑到词对间的边界关联性建模,以及构建词对语义相似性问题。为解决上述问题,提出了一种基于多尺度聚合协同轴向语义引导的实体关系联合抽取模型。首先,设计的多尺度语义聚合模块通过并行多个不同尺寸的深度卷积提取不同排列下词对间的边界关联信息,从而丰富词对语义,识别隐形实体。其次,轴向语义引导模块通过行列带状卷积从轴向上进行卷积注意力校准,强化词对关键语义表征,从而改善词对间语义相似问题。最后,在数据集NYT*,WebNLG*,NYT和WebNLG上进行实验,该方法分别取得了93.2%,94.5%,93.2%和91.4%的F1得分,相较于基线模型分别提高了0.1个百分点、0.6个百分点、0.4个百分点和1.0个百分点,表明其能够捕获词对边界关联以及精细化词对语义,提升了实体关系联合抽取的性能。 展开更多
关键词 自然语言处理 实体关系联合抽取 多尺度语义聚合 轴向语义引导 卷积注意力
在线阅读 下载PDF
融合轨迹时空感知与自适应语义聚焦的视频描述方法
13
作者 肖景富 姜文晖 +2 位作者 方玉明 方承炀 赵小伟 《中国图象图形学报》 北大核心 2026年第2期556-572,共17页
目的 视频内容描述任务旨在自动生成自然语言句子,精准表达视频视觉语义信息。尽管编码器—解码器方法在视觉表达与语言生成上已有进展,但视频编码器难以建模目标级运动与事件,解码器也难以实现跨模态语义对齐,限制了生成文本质量。为此... 目的 视频内容描述任务旨在自动生成自然语言句子,精准表达视频视觉语义信息。尽管编码器—解码器方法在视觉表达与语言生成上已有进展,但视频编码器难以建模目标级运动与事件,解码器也难以实现跨模态语义对齐,限制了生成文本质量。为此,提出融合轨迹时空感知与自适应语义聚焦的方法,以增强目标运动建模能力并改善多模态语义对齐。方法 首先,提出基于点轨迹的视觉特征聚合方法,通过时空建模生成兼具空间外观与时间连续性的轨迹特征,并与局部运动特征融合,以增强模型在运动和形变场景下的目标追踪能力和语义连贯性;同时,设计无监督自适应关键轨迹聚焦学习方法,利用密集点轨迹动态信息,通过注意力权重自适应筛选关键轨迹并引入聚焦损失,引导模型优先关注关键语义区域、抑制背景干扰,从而提升跨模态语义关联能力。结果 在MSRVTT(Microsoft research video to text)和MSVD(Microsoft research video description corpus)两个公开数据集上进行实验,所提方法在CIDEr(consensus-based image description evaluation)指标上分别取得61.2和130.1的得分,显著优于现有主流方法,验证了所提方法在描述准确性与语义丰富性方面的有效性。定性分析表明,该方法在提升描述的时序连贯性和语义表达能力方面表现优异。结论 本文方法有效提升了视频描述模型在复杂动态环境下的目标语义连续性建模能力,并通过无监督的自适应关键轨迹聚焦学习方法改善了注意力机制对视频与文本语义关联的能力。 展开更多
关键词 视频内容描述 多模态语义关联 时空点轨迹 特征聚合 自适应语义聚焦
原文传递
Tea Leaf Disease Diagnosis Based on Improved Lightweight U-Net3+
14
作者 HU Yumeng GUAN Feifan +5 位作者 XIE Dongchen MA Ping YU Youben ZHOU Jie NIE Yanming HUANG Lüwen 《智慧农业(中英文)》 2026年第1期15-27,共13页
[Objective]Leaf diseases significantly affect both the yield and quality of tea throughout the year.To address the issue of inadequate segmentation finesse in the current tea spot segmentation models,a novel diagnosis... [Objective]Leaf diseases significantly affect both the yield and quality of tea throughout the year.To address the issue of inadequate segmentation finesse in the current tea spot segmentation models,a novel diagnosis of the severity of tea spots was proposed in this research,designated as MDC-U-Net3+,to enhance segmentation accuracy on the base framework of U-Net3+.[Methods]Multi-scale feature fusion module(MSFFM)was incorporated into the backbone network of U-Net3+to obtain feature information across multiple receptive fields of diseased spots,thereby reducing the loss of features within the encoder.Dual multi-scale attention(DMSA)was incorporated into the skip connection process to mitigate the segmentation boundary ambiguity issue.This integration facilitates the comprehensive fusion of fine-grained and coarse-grained semantic information at full scale.Furthermore,the segmented mask image was subjected to conditional random fields(CRF)to enhance the optimization of the segmentation results[Results and Discussions]The improved model MDC-U-Net3+achieved a mean pixel accuracy(mPA)of 94.92%,accompanied by a mean Intersection over Union(mIoU)ratio of 90.9%.When compared to the mPA and mIoU of U-Net3+,MDC-U-Net3+model showed improvements of 1.85 and 2.12 percentage points,respectively.These results illustrated a more effective segmentation performance than that achieved by other classical semantic segmentation models.[Conclusions]The methodology presented herein could provide data support for automated disease detection and precise medication,consequently reducing the losses associated with tea diseases. 展开更多
关键词 disease diagnosis semantic segmentation U-Net3+ multi-scale feature fusion attention mechanism conditional random fields
在线阅读 下载PDF
Semantic Interoperability Aggregation in Service Requirements Refinement 被引量:7
15
作者 何克清 王健 梁鹏 《Journal of Computer Science & Technology》 SCIE EI CSCD 2010年第6期1103-1117,共15页
Semantic refinement of stakeholders' requirements is a fundamental issue in requirements engineering. Facing with the on-demand collaboration problem among the heterogeneous, autonomous, and dynamic service resources... Semantic refinement of stakeholders' requirements is a fundamental issue in requirements engineering. Facing with the on-demand collaboration problem among the heterogeneous, autonomous, and dynamic service resources in the Web, service requirements refinement becomes extremely important, and the key issue in service requirements refinement is semantic interoperability aggregation. A method for creating connecting ontologies driven by requirement sign ontology is proposed. Based on connecting ontologies, a method for semantic interoperability aggregation in requirements refinement is proposed. In addition, we discover that the necessary condition for semantic interoperability is semantic similarity, and the sufficient condition is the coverability of the agreed mediation ontology. Based on this viewpoint, a metric framework for calculating semantic interoperability capability is proposed. This methodology can provide a semantic representation mechanism for refining users' requirements; meanwhile, since users' requirements in the Web usually originate from different domains, it can also provide semantic interoperability guidance for networked service discovery, and is an effective approach for the realization of on-demand service integration. The methodology will be beneficial in service-oriented software engineering and cloud computing. 展开更多
关键词 connecting ontologies requirements refinement semantic interoperability aggregation
原文传递
高效稀疏特征聚合的点云语义分割方法 被引量:1
16
作者 胡立坤 王小勇 黄润辉 《广西大学学报(自然科学版)》 北大核心 2025年第3期558-569,共12页
针对现有大规模场景点云语义分割方法效率低、难以满足实时性和大规模场景边界分割精度低的问题,提出一种高效稀疏特征聚合的点云语义分割方法。该方法以锥形栅格表述输入点云,设计高效稀疏特征聚合模块学习上下文语义特征,解决了特征... 针对现有大规模场景点云语义分割方法效率低、难以满足实时性和大规模场景边界分割精度低的问题,提出一种高效稀疏特征聚合的点云语义分割方法。该方法以锥形栅格表述输入点云,设计高效稀疏特征聚合模块学习上下文语义特征,解决了特征提取计算量大、内存效率低的问题;通过邻域内语义标签单一性设计边界损失函数,解决物体边界模糊问题。实验表明:该方法在SemanticKITTI和nuScenes数据集上的语义分割平均交并比(mIoU)分别达到66.9%和74.1%,相比算法VCL分别提高了3.3、3.6个百分点;在SemanticKITTI验证集上推理速度达到19.2 Hz,远超该数据集点云采集频率10 Hz,满足实时性要求。本文方法能够更高效地提取稀疏语义特征,并能对物体边界进行准确分割。 展开更多
关键词 稀疏特征聚合 边界损失 语义分割 点云
在线阅读 下载PDF
基于邻居信息聚合的无配对跨模态检索重排序
17
作者 沃焱 梁展扬 《华南理工大学学报(自然科学版)》 北大核心 2025年第11期18-26,共9页
重排序方法作为一种后处理技术,在跨模态检索任务中展现出了显著的效果,它通过挖掘、处理初始排序列表之间的信息,有效提高了检索的准确性。当前主流的跨模态检索重排序方法是在数据集有配对的情况下对初始列表进行重排序,灵活性差,使... 重排序方法作为一种后处理技术,在跨模态检索任务中展现出了显著的效果,它通过挖掘、处理初始排序列表之间的信息,有效提高了检索的准确性。当前主流的跨模态检索重排序方法是在数据集有配对的情况下对初始列表进行重排序,灵活性差,使用时需对原来的框架进行修改并重新训练,无法灵活地迁移到其他框架上;此外,它们无法应用于无配对情景。依赖于大规模配对数据集,跨模态检索目前取得了显著的进展,但忽视了实际场景中标注大规模数据集需耗费大量资源的问题。鉴于此,该文提出了一种基于邻居信息聚合的无配对跨模态检索重排序方法。该方法通过挖掘并利用样本的邻居信息,使错误的答案远离查询输入;通过搜索欧氏邻域中的局部邻居,并基于协同表达搜索全局邻居表达样本的邻居信息,将这两种邻居信息加以融合生成新特征,再重新计算与检索输入的语义相似性,完成重排序。将该方法置于多种跨模态检索框架作为后处理方法,并在MSCOCO数据集上进行实验,结果证明了该方法的有效性以及相对于其他重排序方法的优越性。 展开更多
关键词 跨模态检索 重排序方法 邻居信息聚合 全局语义邻居 局部语义邻居
在线阅读 下载PDF
基于语义先验和纹理增强引导的壁画修复算法
18
作者 陈永 赵梦雪 +1 位作者 杜婉君 张世龙 《湖南大学学报(自然科学版)》 北大核心 2025年第8期1-13,共13页
针对现有深度学习方法修复壁画过程中,未充分利用完好区域壁画语义和纹理等先验信息引导壁画修复,导致修复结果欠佳的问题,提出了一种基于语义先验和纹理增强引导的壁画修复算法.首先,设计语义先验学习模块,通过像素折叠操作将原始壁画... 针对现有深度学习方法修复壁画过程中,未充分利用完好区域壁画语义和纹理等先验信息引导壁画修复,导致修复结果欠佳的问题,提出了一种基于语义先验和纹理增强引导的壁画修复算法.首先,设计语义先验学习模块,通过像素折叠操作将原始壁画语义特征映射到语义先验学习器,利用原始语义特征引导残缺特征修复,逐渐缩减破损语义特征与原始语义特征的差异.然后,设计纹理增强模块,通过融合上下文信息模块增强纹理细节并将其融合,完成壁画纹理特征修复.最后,设计聚合引导模块,将语义先验修复结果和纹理增强结果进行融合并解码至原始分辨率,并通过与马尔可夫判别器对抗博弈,完成破损壁画的修复.敦煌壁画数字化分类修复实验表明:所提方法在主客观评价上均优于比较算法,取得了更好的修复结果. 展开更多
关键词 壁画修复 语义先验 像素折叠 纹理增强 聚合引导
在线阅读 下载PDF
基于点云加权邻域聚合的语义分割方法
19
作者 李新 孙钰奇 +1 位作者 宋刘广 曾佳全 《软件导刊》 2025年第3期162-169,共8页
点云数据本身无序且形状不规则,给现有语义分割算法带来了挑战,因为现有算法通常采用固定的特征聚合策略,导致对数据分布缺乏适应性。针对该问题,提出一种基于点云加权聚合的语义分割方法。首先,加权聚合模块通过一种可训练方式,为邻域... 点云数据本身无序且形状不规则,给现有语义分割算法带来了挑战,因为现有算法通常采用固定的特征聚合策略,导致对数据分布缺乏适应性。针对该问题,提出一种基于点云加权聚合的语义分割方法。首先,加权聚合模块通过一种可训练方式,为邻域点生成加权系数,从而实现邻域特征的加权聚合,显著增强邻域特征表达能力。其次,开发了一个基于距离加权的逆残差模块Weighted InvresMLP,进一步提高特征提取深度和效率。最后,在这些模块的基础上,设计了一个端到端的点云语义分割框架Weighted Local Aggregation Neural Network(WLA-Net)。在大规模公共数据集S3DIS和ScanNet上进行广泛实验后,证明所提出方法著提高了网络拟合能力,与其他方法相比具有更高的精度。 展开更多
关键词 点云数据 语义分割 特征聚合 加权逆残差
在线阅读 下载PDF
结合目标特征增强与语义-位置路径聚合的水下目标检测 被引量:1
20
作者 宋巍 倪舟 +2 位作者 梁纪辰 张明华 王建 《计算机工程与应用》 北大核心 2025年第15期93-110,共18页
针对水下图像质量差、目标多尺度和严重遮挡导致的漏检和误检等问题,提出一种结合目标信息增强与语义-位置路径聚合的水下目标检测模型。该模型以RT-DETR框架为基础,提出了边缘特征多尺度注入模块(multiscale injection for edge featur... 针对水下图像质量差、目标多尺度和严重遮挡导致的漏检和误检等问题,提出一种结合目标信息增强与语义-位置路径聚合的水下目标检测模型。该模型以RT-DETR框架为基础,提出了边缘特征多尺度注入模块(multiscale injection for edge features module,MSI-Edge),将边缘信息注入深层网络中,强化了模型对小目标的感知能力;同时,提出了全局-局部特征增强模块(global-local feature enhancement module,GLF-Enhance)来替代编码器中的传统多头自注意力机制,增强对目标全局和局部信息的学习能力,并加速模型推理;进而,设计了一种新的结合语义-位置路径聚合网络(semantic-location path aggregation network,SL-PAN),利用高层特征作为权重来指导低层特征中的语义信息学习,再使用低层特征作为权重来指导高层特征中的位置信息学习,从而有效缓解多尺度特征融合过程中信息传递退化的问题。在公开水下数据集上进行实验验证,相较基准模型RT-DETR(ResNet50主干网络),在URPC数据集上AP、AP^(50)、AP^(75)指标分别提升了约3.2、3.0和2.7个百分点;在DUO数据集上分别提升了2.9、2.7、3.0个百分点,同时有效降低了误检和漏检率。消融实验验证了各模块的有效性。整体性能与主流目标检测器及最新水下目标检测器相比,达到了较好水平。 展开更多
关键词 水下目标检测 语义-位置路径聚合网络 边缘特征多尺度注入 RT-DETR模型 全局-局部特征增强
在线阅读 下载PDF
上一页 1 2 9 下一页 到第
使用帮助 返回顶部