期刊文献+
共找到489篇文章
< 1 2 25 >
每页显示 20 50 100
Fusion Prototypical Network for 3D Scene Graph Prediction
1
作者 Jiho Bae Bogyu Choi +1 位作者 Sumin Yeon Suwon Lee 《Computer Modeling in Engineering & Sciences》 2025年第6期2991-3003,共13页
Scene graph prediction has emerged as a critical task in computer vision,focusing on transforming complex visual scenes into structured representations by identifying objects,their attributes,and the relationships amo... Scene graph prediction has emerged as a critical task in computer vision,focusing on transforming complex visual scenes into structured representations by identifying objects,their attributes,and the relationships among them.Extending this to 3D semantic scene graph(3DSSG)prediction introduces an additional layer of complexity because it requires the processing of point-cloud data to accurately capture the spatial and volumetric characteristics of a scene.A significant challenge in 3DSSG is the long-tailed distribution of object and relationship labels,causing certain classes to be severely underrepresented and suboptimal performance in these rare categories.To address this,we proposed a fusion prototypical network(FPN),which combines the strengths of conventional neural networks for 3DSSG with a Prototypical Network.The former are known for their ability to handle complex scene graph predictions while the latter excels in few-shot learning scenarios.By leveraging this fusion,our approach enhances the overall prediction accuracy and substantially improves the handling of underrepresented labels.Through extensive experiments using the 3DSSG dataset,we demonstrated that the FPN achieves state-of-the-art performance in 3D scene graph prediction as a single model and effectively mitigates the impact of the long-tailed distribution,providing a more balanced and comprehensive understanding of complex 3D environments. 展开更多
关键词 3D scene graph prediction prototypical network 3D scene understanding
在线阅读 下载PDF
An attention-based prototypical network for forest fire smoke few-shot detection 被引量:3
2
作者 Tingting Li Haowei Zhu +1 位作者 Chunhe Hu Junguo Zhang 《Journal of Forestry Research》 SCIE CAS CSCD 2022年第5期1493-1504,共12页
Existing almost deep learning methods rely on a large amount of annotated data, so they are inappropriate for forest fire smoke detection with limited data. In this paper, a novel hybrid attention-based few-shot learn... Existing almost deep learning methods rely on a large amount of annotated data, so they are inappropriate for forest fire smoke detection with limited data. In this paper, a novel hybrid attention-based few-shot learning method, named Attention-Based Prototypical Network, is proposed for forest fire smoke detection. Specifically, feature extraction network, which consists of convolutional block attention module, could extract high-level and discriminative features and further decrease the false alarm rate resulting from suspected smoke areas. Moreover, we design a metalearning module to alleviate the overfitting issue caused by limited smoke images, and the meta-learning network enables achieving effective detection via comparing the distance between the class prototype of support images and the features of query images. A series of experiments on forest fire smoke datasets and miniImageNet dataset testify that the proposed method is superior to state-of-the-art few-shot learning approaches. 展开更多
关键词 Forest fire smoke detection Few-shot learning Channel attention module Spatial attention module prototypical network
在线阅读 下载PDF
Prototypical Network Based on Manhattan Distance 被引量:1
3
作者 Zengchen Yu Ke Wang +2 位作者 Shuxuan Xie Yuanfeng Zhong Zhihan Lv 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第5期655-675,共21页
Few-shot Learning algorithms can be effectively applied to fields where certain categories have only a small amount of data or a small amount of labeled data,such as medical images,terrorist surveillance,and so on.The... Few-shot Learning algorithms can be effectively applied to fields where certain categories have only a small amount of data or a small amount of labeled data,such as medical images,terrorist surveillance,and so on.The Metric Learning in the Few-shot Learning algorithmis classified by measuring the similarity between the classified samples and the unclassified samples.This paper improves the Prototypical Network in the Metric Learning,and changes its core metric function to Manhattan distance.The Convolutional Neural Network of the embedded module is changed,and mechanisms such as average pooling and Dropout are added.Through comparative experiments,it is found that thismodel can converge in a small number of iterations(below 15,000 episodes),and its performance exceeds algorithms such asMAML.Research shows that replacingManhattan distance with Euclidean distance can effectively improve the classification effect of the Prototypical Network,and mechanisms such as average pooling and Dropout can also effectively improve the model. 展开更多
关键词 Few-shot Learning prototypical Network Convolutional Neural Network Manhattan distance
在线阅读 下载PDF
Few-shot image recognition based on multi-scale features prototypical network
4
作者 LIU Jiatong DUAN Yong 《High Technology Letters》 EI CAS 2024年第3期280-289,共10页
In order to improve the models capability in expressing features during few-shot learning,a multi-scale features prototypical network(MS-PN)algorithm is proposed.The metric learning algo-rithm is employed to extract i... In order to improve the models capability in expressing features during few-shot learning,a multi-scale features prototypical network(MS-PN)algorithm is proposed.The metric learning algo-rithm is employed to extract image features and project them into a feature space,thus evaluating the similarity between samples based on their relative distances within the metric space.To sufficiently extract feature information from limited sample data and mitigate the impact of constrained data vol-ume,a multi-scale feature extraction network is presented to capture data features at various scales during the process of image feature extraction.Additionally,the position of the prototype is fine-tuned by assigning weights to data points to mitigate the influence of outliers on the experiment.The loss function integrates contrastive loss and label-smoothing to bring similar data points closer and separate dissimilar data points within the metric space.Experimental evaluations are conducted on small-sample datasets mini-ImageNet and CUB200-2011.The method in this paper can achieve higher classification accuracy.Specifically,in the 5-way 1-shot experiment,classification accuracy reaches 50.13%and 66.79%respectively on these two datasets.Moreover,in the 5-way 5-shot ex-periment,accuracy of 66.79%and 85.91%are observed,respectively. 展开更多
关键词 few-shot learning multi-scale feature prototypical network channel attention label-smoothing
在线阅读 下载PDF
Prototypicality Gradient and Similarity Measure: A Semiotic-Based Approach Dedicated to Ontology Personalization
5
作者 X. Aime F. Furst +1 位作者 P. Kuntz F. Trichet 《Intelligent Information Management》 2010年第2期65-79,共15页
This paper introduces a new approach dedicated to the Ontology Personalization. Inspired by works in Cognitive Psychology, our work is based on a process which aims at capturing the user-sensitive relevance of the cat... This paper introduces a new approach dedicated to the Ontology Personalization. Inspired by works in Cognitive Psychology, our work is based on a process which aims at capturing the user-sensitive relevance of the categorization process, that is the one which is really perceived by the end-user. Practically, this process consists in decorating the Specialization/Generalization links (i.e. the is-a links) of the hierarchy of concepts with 2 gradients. The goal of the first gradient, called Conceptual Prototypicality Gradient, is to capture the user-sensitive relevance of the categorization process, that is the one which is perceived by the end-user. As this gradient is defined according to the three aspects of the semiotic triangle (i.e. intentional, extensional and expressional dimension), we call it Semiotic based Prototypicality Gradient. The objective of the second gradient, called Lexical Prototypicality Gradient, is to capture the user-sensitive relevance of the lexicalization process, i.e. the definition of a set of terms used to denote a concept. These gradients enrich the initial formal semantics of an ontology by adding a pragmatics defined according to a context of use which depends on parameters like culture, educational background and/or emotional context of the end-user. This paper also introduces a new similarity measure also defined in the context of a semiotic-based approach. The first originality of this measure, called SEMIOSEM, is to consider the three semiotic dimensions of the conceptualization underlying an ontology. Thus, SEMIOSEM aims at aggregating and improving existing extensional-based and intentional-based measures. The second originality of this measure is to be context-sensitive, and in particular user-sensitive. This makes SEMIOSEM more flexible, more robust and more close to the end-user’s judgment than the other similarity measures which are usually only based on one aspect of a conceptualization and never take the end-user’s perceptions and purposes into account. 展开更多
关键词 Semantic Measure Conceptual prototypicalITY LEXICAL prototypicalITY GRADIENT Ontology PERSONALIZATION SEMIOTICS
暂未订购
A Prototypical Study of English Intonation
6
作者 梅丽 柴同文 《Sino-US English Teaching》 2006年第6期76-78,81,共4页
Starting from the traditional analysis of English intonation, the article discusses the Halliday's, Jackendoff's, and Brazil's views upon intonation. Then it explores the multiple meanings and its metonymic pattern... Starting from the traditional analysis of English intonation, the article discusses the Halliday's, Jackendoff's, and Brazil's views upon intonation. Then it explores the multiple meanings and its metonymic patterns of English high key in terms of the prototype theory. 展开更多
关键词 INTONATION FUNCTION PROTOTYPE
在线阅读 下载PDF
Prototypical motivation of polysemy
7
作者 YIN Ping 《Sino-US English Teaching》 2010年第4期50-53,共4页
This paper applies prototype theory to explain the motivation of polysemy. There are mainly 3 types of meaning model of polysemy, namely, radiation, concatenation and integrated model. According to prototype theory, i... This paper applies prototype theory to explain the motivation of polysemy. There are mainly 3 types of meaning model of polysemy, namely, radiation, concatenation and integrated model. According to prototype theory, in the semantic category formed by a polysemic word, category members are determined by prototype. They are the result of development from prototype to boundary. Connected by a network of overlapping similarities (i.e., family resemblances), category members present different degrees of prototypicality, but not all of them can represent the category. Only the prototype can fully embody the category. With the extension of the semantic category, the boundary of the category is fuzzy and begins to intersect another semantic category. 展开更多
关键词 prototype theory PROTOTYPE POLYSEMY
在线阅读 下载PDF
Prototypical clustered federated learning for heart rate prediction
8
作者 Yongjie YIN Hui RUAN +5 位作者 Yang CHEN Jiong CHEN Ziyue LI Xiang SU Yipeng ZHOU Qingyuan GONG 《Frontiers of Information Technology & Electronic Engineering》 2025年第10期1896-1912,共17页
Predicting future heart rate(HR)not only helps in detecting abnormal heart rhythms but also provides timely support for downstream health monitoring services.Existing methods for HR prediction encounter challenges,esp... Predicting future heart rate(HR)not only helps in detecting abnormal heart rhythms but also provides timely support for downstream health monitoring services.Existing methods for HR prediction encounter challenges,especially concerning privacy protection and data heterogeneity.To address these challenges,this paper proposes a novel HR prediction framework,PCFedH,which leverages personalized federated learning and prototypical contrastive learning to achieve stable clustering results and more accurate predictions.PCFedH contains two core modules:a prototypical contrastive learning-based federated clustering module,which characterizes data heterogeneity and enhances HR representation to facilitate more effective clustering,and a two-phase soft clustered federated learning module,which enables personalized performance improvements for each local model based on stable clustering results.Experimental results on two real-world datasets demonstrate the superiority of our approach over state-of-the-art methods,achieving an average reduction of 3.1%in the mean squared error across both datasets.Additionally,we conduct comprehensive experiments to empirically validate the effectiveness of the key components in the proposed method.Among these,the personalization component is identified as the most crucial aspect of our design,indicating its substantial impact on overall performance. 展开更多
关键词 Federated learning Heart rate prediction prototypical contrastive learning
原文传递
NPC:Negative Prototypical Contrasting for Label Disambiguation of Partial Label Learning
9
作者 Yu-Jie Jin Ya-Sha Wang Xu Chu 《Journal of Computer Science & Technology》 2025年第5期1386-1400,共15页
Partial label learning(PLL)learns under label ambiguity where each training instance is annotated with a set of candidate labels,among which only one is the ground-truth label.Recent advances showed that PLL can be pr... Partial label learning(PLL)learns under label ambiguity where each training instance is annotated with a set of candidate labels,among which only one is the ground-truth label.Recent advances showed that PLL can be promoted by combining label disambiguation with representation learning coherently,which achieved state-of-the-art performance.However,most of the existing deep PLL methods over-emphasize pulling the inaccurate pseudo-label-induced positive samples and fail to achieve a balance between the intra-class compactness and the inter-class separability,thus leading to a sub-optimal representation space.In this paper,we solve this issue by taking into account the pure negative supervision information which can be extracted perfectly from the non-candidate label set.Methodologically,we propose a novel framework Negative Prototypical Contrasting(NPC).The optimization objective of NPC contrasts each instance with its candidate prototypes against its negative prototypes,aiming at a sufficiently distinguishable representation space.Based on the learned representations,the label disambiguation process is performed in a moving-average style.Theoretically,we show that the objective of NPC is equivalent to solving a constrained maximum likelihood optimization.We also justify applying the moving average from the stochastic expectation-maximization perspective.Empirically,extensive experiments demonstrate that the proposed NPC method achieves state-of-the-art classification performance on various datasets,and even competes with its supervised counterparts. 展开更多
关键词 machine learning partial label learning weak supervision negative prototype expectation maximization
原文传递
Generating prototypical residential building geometry models using a new hybrid approach 被引量:1
10
作者 Yuanli Ma Wu Deng +3 位作者 Jing Xie Tim Heath Yeyu Xiang Yuanda Hong 《Building Simulation》 SCIE EI CSCD 2022年第1期17-28,共12页
Building prototyping has regularly been used in building performance analyses with statistically feasible models.The novelty of this research involves a new hybrid approach combining stratified sampling and k-means cl... Building prototyping has regularly been used in building performance analyses with statistically feasible models.The novelty of this research involves a new hybrid approach combining stratified sampling and k-means clustering to establish building geometry prototypes.The research focuses on residential buildings in Ningbo,China.Seventeen small residential districts(SRDs)containing 367 residential buildings were systemically selected for survey and data collection.The stratified sampling used building construction year as the main parameter to generate stratification.Floor numbers,shape coefficients,floor areas,and window-to-wall ratios were used as the four observations for k-means clustering.Based on this new approach,nine building geometry prototypes were identified and modelled.These statistically representative prototypes provide building geometrical information and characteristic-based evaluations for subsequent building performance analysis. 展开更多
关键词 building prototyping geometry models new hybrid approach Ningbo China
原文传递
Image-Based Air Quality Estimation by Few-Shot Learning
11
作者 Duc Cuong Pham Tien Duc Ngo Hoai Nam Vu 《Computers, Materials & Continua》 2025年第8期2959-2974,共16页
Air quality estimation assesses the pollution level in the air,supports public health warnings,and is a valuable tool in environmental management.Although air sensors have proven helpful in this task,sensors are often... Air quality estimation assesses the pollution level in the air,supports public health warnings,and is a valuable tool in environmental management.Although air sensors have proven helpful in this task,sensors are often expensive and difficult to install,while cameras are becoming more popular and accessible,from which images can be collected as data for deep learning models to solve the above task.This leads to another problem:several labeled images are needed to achieve high accuracy when deep-learningmodels predict air quality.In this research,we have threemain contributions:(1)Collect and publish an air quality estimation dataset,namely PTIT_AQED,including environmental image data and air quality;(2)Propose a deep learning model to predict air quality with few data,called PTIT_FAQE(PTIT Few-shot air quality estimation).We build PTIT_FAQE based on EfficientNet-a CNN architecture that ensures high performance in deep learning applications and Few-shot Learning with Prototypical Networks.This helps the model use only a fewtraining data but still achieve high accuracy in air quality estimation.And(3)conduct experiments to prove the superiority of PTIT_FAQE compared to other studies on both PTIT_AQED and APIN datasets.The results show that our model achieves an accuracy of 0.9278 and an F1-Score of 0.9139 on the PTIT_AQED dataset and an accuracy of 0.9467 and an F1-Score of 0.9371 on the APIN dataset,which demonstrate a significant performance improvement compared to previous studies.We also conduct detailed experiments to evaluate the impact of each component on model performance. 展开更多
关键词 Air quality estimation few-shot learning prototypical networks deep learning
在线阅读 下载PDF
基于Prototype反向蒸馏的无监督多类别异常检测
12
作者 何立仁 彭博 池明旻 《计算机科学》 北大核心 2025年第2期202-211,共10页
无监督异常检测因只需要正常样本进行训练而被广泛应用于工业质检等领域。直接将现有的单类别异常检测方法应用到多类别异常检测中会导致性能显著下降,其中基于知识蒸馏的异常检测方法将预训练的教师模型关于正常样本的特征知识蒸馏到... 无监督异常检测因只需要正常样本进行训练而被广泛应用于工业质检等领域。直接将现有的单类别异常检测方法应用到多类别异常检测中会导致性能显著下降,其中基于知识蒸馏的异常检测方法将预训练的教师模型关于正常样本的特征知识蒸馏到学生模型中,然而它们在多类别异常检测中存在无法保证学生模型只学习到正常样本知识的问题。文中提出一种基于反向知识蒸馏框架的无监督多类别异常检测方法(Prototype based Reverse Distillation,PRD),其通过Multi-class Normal Prototype模块和Sparse Prototype Recall训练策略来学习教师模型关于多类别正常样本特征的Prototype,并以此来过滤学生模型的输入特征,从而确保学生模型只学习到教师模型关于正常样本的特征知识。PRD在多种工业异常检测数据集上性能均超越了现有的SOTA方法,定性、定量和消融实验验证了PRD整体框架和内部模块的有效性。 展开更多
关键词 异常检测 无监督学习 Prototype学习 知识蒸馏 预训练特征
在线阅读 下载PDF
Scope,nature,and exploration significance of Ordos Basin during geological historical periods,NW China 被引量:1
13
作者 HE Dengfa CHENG Xiang +10 位作者 ZHANG Guowei ZHAO Wenzhi ZHAO Zhe LIU Xinshe BAO Hongping FAN Liyong ZOU Song KAI Baize MAO Danfeng XU Yanhua CHENG Changyu 《Petroleum Exploration and Development》 2025年第4期855-871,共17页
Based on the analysis of surface geological survey,exploratory well,gravity-magnetic-electric and seismic data,and through mapping the sedimentary basin and its peripheral orogenic belts together,this paper explores s... Based on the analysis of surface geological survey,exploratory well,gravity-magnetic-electric and seismic data,and through mapping the sedimentary basin and its peripheral orogenic belts together,this paper explores systematically the boundary,distribution,geological structure,and tectonic attributes of the Ordos prototype basin in the geological historical periods.The results show that the Ordos block is bounded to the west by the Engorwusu Fault Zone,to the east by the Taihangshan Mountain Piedmont Fault Zone,to the north by the Solonker-Xilamuron Suture Zone,and to the south by the Shangnan-Danfeng Suture Zone.The Ordos Basin boundary was the plate tectonic boundary during the Middle Proterozoic to Paleozoic,and the intra-continental deformation boundary in the Meso-Cenozoic.The basin survived as a marine cratonic basin covering the entire Ordos block during the Middle Proterozoic to Ordovician,a marine-continental transitional depression basin enclosed by an island arc uplift belt at the plate margin during the Carboniferous to Permian,a unified intra-continental lacustrine depression basin in the Triassic,and an intra-continental cratonic basin circled by a rift system in the Cenozoic.The basin scope has been decreasing till the present.The large,widespread prototype basin controlled the exploration area far beyond the present-day sedimentary basin boundary,with multiple target plays vertically.The Ordos Basin has the characteristics of a whole petroleum(or deposition)system.The Middle Proterozoic wide-rift system as a typical basin under the overlying Phanerozoic basin and the Cambrian-Ordovician passive margin basin and intra-cratonic depression in the deep-sited basin will be the important successions for oil and gas exploration in the coming years. 展开更多
关键词 basin boundary prototype basin tectonic attribute energy and ore deposit superimposed basin whole petroleum system oil and gas exploration area Ordos Basin
在线阅读 下载PDF
Design and low-power test of an HOM-damped normal-conducting cavity for WALS
14
作者 Cheng Wang Jian-Hao Tan +8 位作者 Ding-Hui Su Zi-He Gao Yu-Sen Guo Cheng-Cheng Xiao Yu-Xin Zhang Yuan-Cun Nie Wen-Cheng Fang Jian-Hua He Zhen-Tang Zhao 《Nuclear Science and Techniques》 2025年第6期80-90,共11页
Radio frequency(RF)cavities for advanced storage rings,also known as diffraction-limited storage rings,are under development.To this end,a competitive and promising approach involves normal-conducting continuous wave ... Radio frequency(RF)cavities for advanced storage rings,also known as diffraction-limited storage rings,are under development.To this end,a competitive and promising approach involves normal-conducting continuous wave technology.The design and preliminary test of a 499.654 MHz RF cavity for the Wuhan Advanced Light Source(WALS)based on specific beam parameters were conducted at the SSRF.Multi-objective evolutionary algorithms have been utilized to optimize RF properties,such as the power loss and power density,resulting in better performance in the continuous wave mode.Further improvements were made to suppress multipacting effects in the working area.To operate stably with the beam,higher-order mode dampers were applied to better address the coupling bunch instability than in previous designs,along with thermal analysis to achieve the desired RF performance.Comprehensive simulation studies demonstrated the stable operation of the RF cavity at the defined beam parameters in the WALS design.A prototype RF cavity was then developed,and the RF performance results in a low-power test showed good agreement with the design and simulation,exhibiting readiness for high-power experiments and operation. 展开更多
关键词 Continuous Wave MOEA Hom-damping Mechanical design Prototype testing
在线阅读 下载PDF
Research on the Protection Strategies of Traditional Villages from the Perspective of Architectural Typology: Taking Nuogang Village and Wengji Village in the Jingmai Mountain as Examples
15
作者 LAN Lan MA Yiqun LI Mingrui 《Journal of Landscape Research》 2025年第3期73-76,80,共5页
There are many traditional villages with well-preserved architectural types and images in the Jingmai Mountain,Yunnan Province.Through field investigations in traditional villages in the research area,this study appli... There are many traditional villages with well-preserved architectural types and images in the Jingmai Mountain,Yunnan Province.Through field investigations in traditional villages in the research area,this study applied the architectural typology,analyzed Nuogang Village of the Dai Nationality and Wengji Village of the Bulang Nationality from 3 perspectives of“point,line and surface”,explored the characteristics of village,architecture and landscape,and extracted the“prototypes”,tried to figure out the problems of the villages and then propose corresponding protection strategies,so as to support the preservation,renovation,improvement and utilization of traditional villages. 展开更多
关键词 Traditional village Architectural typology Jingmai Mountain PROTOTYPE Protection and development
在线阅读 下载PDF
Lanthanide nitric oxides(LnNO,Ln=La-Lu)present unique trend in bonding structure and oxidation states of Ln
16
作者 Zhi-Yu Wei Shu-Xian Hu 《Chinese Journal of Structural Chemistry》 2025年第9期11-13,共3页
Scientists have devoted considerable effort overs several decades to reduce automobile exhaust emissions,and one practical and important strategy is the catalytic conversion of nitric oxide(NO)[1].Previous studies hav... Scientists have devoted considerable effort overs several decades to reduce automobile exhaust emissions,and one practical and important strategy is the catalytic conversion of nitric oxide(NO)[1].Previous studies have shown that lanthanide(Ln)metals can catalytically reduce NO.Thus,the reactions of NO with Ln to form lanthanide-nitric oxide(LnNO)complexes have been designed and served as the simplest prototype molecules for studying NO chemisorption on metal surfaces[2]. 展开更多
关键词 LANTHANIDE prototype molecules nitric oxide metal surfaces catalytic conversion automobile exhaust emissions catalytic reduction CHEMISORPTION
原文传递
Synthesis of and Experiment on a Morphing Nose Cone Driven by a Biomimetic 4-3R1U&3R Parallel Mechanism
17
作者 Hui Yang Zhonghao Huang +3 位作者 Yan Wang Yongsheng Zhao Yanpu Yao Shangling Qiao 《Chinese Journal of Mechanical Engineering》 2025年第4期518-534,共17页
Aircraft have received much attention because of their capability to adapt to various flight environments and complex missions.The nose cone is one of the key elements in optimising the aerodynamic shape of aircraft.A... Aircraft have received much attention because of their capability to adapt to various flight environments and complex missions.The nose cone is one of the key elements in optimising the aerodynamic shape of aircraft.A morphing nose cone(MNC)driven by a biomimetic 4-3R1U&3R sparallel mechanism is proposed in this study.Based on screw theory,the parallel mechanism’s configuration is determined,and the structure’s full-cycle degrees of freedom are concurrently confirmed.Examples in the paper demonstrate the viability of the structure by configuration synthesis,and diagrams also show the chains.This MNC is modelled after the structural design of the cicada’s abdomen and can be extended,contracted and bent.It can actively adjust its shape in response to change in the flight environments,thereby aerodynamic performance and enhancing the aircraft’s multi-mission capabilities.A scaled-down prototype is created to verify the deformation capacity of the MNC meeting the engineering requirements.Results show that the extension ratio is 36.7%,and the bending angle is 21.7°,which is better than expected.The relative error value is within a reasonable range and the extension process is incredibly stable.This research proposes new perspectives for the design of MNCs. 展开更多
关键词 Configuration synthesis Cicada abdomen Parallel mechanism Prototype experiment
在线阅读 下载PDF
PPFormer:Patch Prototype Transformer for Semantic Segmentation
18
作者 Shanyuan Liu Yonggang Lu 《Journal of Beijing Institute of Technology》 2025年第4期405-417,共13页
Since the introduction of vision Transformers into the computer vision field,many vision tasks such as semantic segmentation tasks,have undergone radical changes.Although Transformer enhances the correlation of each l... Since the introduction of vision Transformers into the computer vision field,many vision tasks such as semantic segmentation tasks,have undergone radical changes.Although Transformer enhances the correlation of each local feature of an image object in the hidden space through the attention mechanism,it is difficult for a segmentation head to accomplish the mask prediction for dense embedding of multi-category and multi-local features.We present patch prototype vision Transformer(PPFormer),a Transformer architecture for semantic segmentation based on knowledge-embedded patch prototypes.1)The hierarchical Transformer encoder can generate multi-scale and multi-layered patch features including seamless patch projection to obtain information of multiscale patches,and feature-clustered self-attention to enhance the interplay of multi-layered visual information with implicit position encodes.2)PPFormer utilizes a non-parametric prototype decoder to extract region observations which represent significant parts of the objects by unlearnable patch prototypes and then calculate similarity between patch prototypes and pixel embeddings.The proposed contrasting patch prototype alignment module,which uses new patch prototypes to update prototype bank,effectively maintains class boundaries for prototypes.For different application scenarios,we have launched PPFormer-S,PPFormer-M and PPFormer-L by expanding the scale.Experimental results demonstrate that PPFormer can outperform fully convolutional networks(FCN)-and attention-based semantic segmentation models on the PASCAL VOC 2012,ADE20k,and Cityscapes datasets. 展开更多
关键词 hierarchical backbones patch prototype nonparametric learning semantic segmentation
在线阅读 下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部