期刊文献+
共找到280篇文章
< 1 2 14 >
每页显示 20 50 100
Zero-Shot Based Spatial AI Algorithm for Up-to-Date 3D Vision Map Generations in Highly Complex Indoor Environments
1
作者 Sehun Lee Taehoon Kim Junho Ahn 《Computers, Materials & Continua》 2025年第8期3623-3648,共26页
This paper proposes a zero-shot based spatial recognition AI algorithm by fusing and developing multidimensional vision identification technology adapted to the situation in large indoor and underground spaces.With th... This paper proposes a zero-shot based spatial recognition AI algorithm by fusing and developing multidimensional vision identification technology adapted to the situation in large indoor and underground spaces.With the expansion of large shopping malls and underground urban spaces(UUS),there is an increasing need for new technologies that can quickly identify complex indoor structures and changes such as relocation,remodeling,and construction for the safety and management of citizens through the provision of the up-to-date indoor 3D site maps.The proposed algorithm utilizes data collected by an unmanned robot to create a 3D site map of the up-to-date indoor site and recognizes complex indoor spaces based on zero-shot learning.This research specifically addresses two major challenges:the difficulty of detecting walls and floors due to complex patterns and the difficulty of spatial perception due to unknown obstacles.The proposed algorithm addresses the limitations of the existing foundation model,detects floors and obstacles without expensive sensors,and improves the accuracy of spatial recognition by combining floor detection,vanishing point detection,and fusion obstacle detection algorithms.The experimental results show that the algorithm effectively detects the floor and obstacles in various indoor environments,with F1 scores of 0.96 and 0.93 in the floor detection and obstacle detection experiments,respectively. 展开更多
关键词 Spatial AI VISION foundation model zero-shot learning image segmentation
在线阅读 下载PDF
Denoising graph neural network based on zero-shot learning for Gibbs phenomenon in high-order DG applications
2
作者 Wei AN Jiawen LIU +3 位作者 Wenxuan OUYANG Haoyu RU Xuejun LIU Hongqiang LYU 《Chinese Journal of Aeronautics》 2025年第3期234-248,共15页
With the availability of high-performance computing technology and the development of advanced numerical simulation methods, Computational Fluid Dynamics (CFD) is becoming more and more practical and efficient in engi... With the availability of high-performance computing technology and the development of advanced numerical simulation methods, Computational Fluid Dynamics (CFD) is becoming more and more practical and efficient in engineering. As one of the high-precision representative algorithms, the high-order Discontinuous Galerkin Method (DGM) has not only attracted widespread attention from scholars in the CFD research community, but also received strong development. However, when DGM is extended to high-speed aerodynamic flow field calculations, non-physical numerical Gibbs oscillations near shock waves often significantly affect the numerical accuracy and even cause calculation failure. Data driven approaches based on machine learning techniques can be used to learn the characteristics of Gibbs noise, which motivates us to use it in high-speed DG applications. To achieve this goal, labeled data need to be generated in order to train the machine learning models. This paper proposes a new method for denoising modeling of Gibbs phenomenon using a machine learning technique, the zero-shot learning strategy, to eliminate acquiring large amounts of CFD data. The model adopts a graph convolutional network combined with graph attention mechanism to learn the denoising paradigm from synthetic Gibbs noise data and generalize to DGM numerical simulation data. Numerical simulation results show that the Gibbs denoising model proposed in this paper can suppress the numerical oscillation near shock waves in the high-order DGM. Our work automates the extension of DGM to high-speed aerodynamic flow field calculations with higher generalization and lower cost. 展开更多
关键词 Computational fluid dynamics High-order discon tinuous Galerkin method Gibbs phenomenon Graph neural networks zero-shot learning
原文传递
Deep Reinforcement Learning for Zero-Shot Coverage Path Planning With Mobile Robots
3
作者 JoséPedro Carvalho A.Pedro Aguiar 《IEEE/CAA Journal of Automatica Sinica》 2025年第8期1594-1609,共16页
The ability of mobile robots to plan and execute a path is foundational to various path-planning challenges,particularly Coverage Path Planning.While this task has been typically tackled with classical algorithms,thes... The ability of mobile robots to plan and execute a path is foundational to various path-planning challenges,particularly Coverage Path Planning.While this task has been typically tackled with classical algorithms,these often struggle with flexibility and adaptability in unknown environments.On the other hand,recent advances in Reinforcement Learning offer promising approaches,yet a significant gap in the literature remains when it comes to generalization over a large number of parameters.This paper presents a unified,generalized framework for coverage path planning that leverages value-based deep reinforcement learning techniques.The novelty of the framework comes from the design of an observation space that accommodates different map sizes,an action masking scheme that guarantees safety and robustness while also serving as a learning-fromdemonstration technique during training,and a unique reward function that yields value functions that are size-invariant.These are coupled with a curriculum learning-based training strategy and parametric environment randomization,enabling the agent to tackle complete or partial coverage path planning with perfect or incomplete knowledge while generalizing to different map sizes,configurations,sensor payloads,and sub-tasks.Our empirical results show that the algorithm can perform zero-shot learning scenarios at a near-optimal level in environments that follow a similar distribution as during training,outperforming a greedy heuristic by sixfold.Furthermore,in out-of-distribution environments,our method surpasses existing state-of-the-art algorithms in most zero-shot and all few-shot scenarios,paving the way for generalizable and adaptable path-planning algorithms. 展开更多
关键词 Autonomous robots coverage path planning deep reinforcement learning mobile robot partially observable markov decision processes path planning zero-shot generalization
在线阅读 下载PDF
Select-and-Answer Prompting:Facilitating LLMs for Improving Zero-Shot Reasoning
4
作者 WANG Yufang TANG Xuesong HAO Kuangrong 《Journal of Donghua University(English Edition)》 2025年第5期513-522,共10页
Large language models(LLMs)have demonstrated remarkable generalization abilities across multiple tasks in natural language processing(NLP).For multi-step reasoning tasks,chain-of-thought(CoT)prompting facilitates step... Large language models(LLMs)have demonstrated remarkable generalization abilities across multiple tasks in natural language processing(NLP).For multi-step reasoning tasks,chain-of-thought(CoT)prompting facilitates step-by-step thinking,leading to improved performance.However,despite significant advancements in LLMs,current CoT prompting performs suboptimally on smaller-scale models that have fewer parameters.Additionally,the common paradigm of few-shot CoT prompting relies on a set of manual demonstrations,with performance contingent on the quality of these annotations and varying with task-specific requirements.To address these limitations,we propose a select-and-answer prompting method(SAP)to enhance language model performance on reasoning tasks without the need for manual demonstrations.This method comprises two primary steps:guiding the model to conduct preliminary analysis and generate several candidate answers based on the prompting;allowing the model to provide final answers derived from these candidate answers.The proposed prompting strategy is evaluated across two language models of varying sizes and six datasets.On ChatGLM-6B,SAP consistently outperforms few-shot CoT across all datasets.For GPT-3.5,SAP achieves comparable performance to few-shot CoT and outperforms zero-shot CoT in most cases.These experimental results indicate that SAP can significantly improve the accuracy of language models in reasoning tasks. 展开更多
关键词 zero-shot learning large language model(LLM) reasoning problem chain-of-thought(CoT)prompting
在线阅读 下载PDF
基于反向投影的zero-shot learning目标分类算法研究 被引量:1
5
作者 冯鹏 庹红娅 +2 位作者 乔凌峰 王洁欣 敬忠良 《计算机应用研究》 CSCD 北大核心 2017年第11期3291-3294,共4页
Zero-shot learning(ZSL)是针对没有训练样本的类别进行分类的问题。传统回归方法的核心是将视觉特征投影到语义空间,没有充分利用视觉特征自身包含的样本信息,同时训练计算量大。提出基于反向投影的ZSL目标分类方法,将类别原型投影到... Zero-shot learning(ZSL)是针对没有训练样本的类别进行分类的问题。传统回归方法的核心是将视觉特征投影到语义空间,没有充分利用视觉特征自身包含的样本信息,同时训练计算量大。提出基于反向投影的ZSL目标分类方法,将类别原型投影到视觉空间,利用视觉特征的语义性学习出映射函数,参数优化过程仅通过解析解就可以获得。在两个基准数据集的实验结果表明,提出的反向投影方法分类结果较传统回归方法和其他现有方法有大幅提升,并且训练时间大大减少,可以更好地推广到未知类别的分类问题上。 展开更多
关键词 zero-shot LEARNING 目标分类 反向投影 解析解
在线阅读 下载PDF
Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics 被引量:8
6
作者 Ao-Xue Li Ke-Xin Zhang Li-Wei Wang 《International Journal of Automation and computing》 EI CSCD 2019年第5期563-574,共12页
Fine-grained image classification, which aims to distinguish images with subtle distinctions, is a challenging task for two main reasons: lack of sufficient training data for every class and difficulty in learning dis... Fine-grained image classification, which aims to distinguish images with subtle distinctions, is a challenging task for two main reasons: lack of sufficient training data for every class and difficulty in learning discriminative features for representation. In this paper, to address the two issues, we propose a two-phase framework for recognizing images from unseen fine-grained classes, i.e., zeroshot fine-grained classification. In the first feature learning phase, we finetune deep convolutional neural networks using hierarchical semantic structure among fine-grained classes to extract discriminative deep visual features. Meanwhile, a domain adaptation structure is induced into deep convolutional neural networks to avoid domain shift from training data to test data. In the second label inference phase, a semantic directed graph is constructed over attributes of fine-grained classes. Based on this graph, we develop a label propagation algorithm to infer the labels of images in the unseen classes. Experimental results on two benchmark datasets demonstrate that our model outperforms the state-of-the-art zero-shot learning models. In addition, the features obtained by our feature learning model also yield significant gains when they are used by other zero-shot learning models, which shows the flexility of our model in zero-shot finegrained classification. 展开更多
关键词 FINE-GRAINED image CLASSIFICATION zero-shot LEARNING DEEP FEATURE LEARNING domain adaptation semantic graph
原文传递
A Dual Discriminator Method for Generalized Zero-Shot Learning
7
作者 Tianshu Wei Jinjie Huang 《Computers, Materials & Continua》 SCIE EI 2024年第4期1599-1612,共14页
Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof ... Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof different types of features and domain shift problems are two of the critical issues in zero-shot learning. Toaddress both of these issues, this paper proposes a new modeling structure. The traditional approach mappedsemantic features and visual features into the same feature space;based on this, a dual discriminator approachis used in the proposed model. This dual discriminator approach can further enhance the consistency betweensemantic and visual features. At the same time, this approach can also align unseen class semantic features andtraining set samples, providing a portion of information about the unseen classes. In addition, a new feature fusionmethod is proposed in the model. This method is equivalent to adding perturbation to the seen class features,which can reduce the degree to which the classification results in the model are biased towards the seen classes.At the same time, this feature fusion method can provide part of the information of the unseen classes, improvingits classification accuracy in generalized zero-shot learning and reducing domain bias. The proposed method isvalidated and compared with othermethods on four datasets, and fromthe experimental results, it can be seen thatthe method proposed in this paper achieves promising results. 展开更多
关键词 Generalized zero-shot learning modality consistent DISCRIMINATOR domain shift problem feature fusion
在线阅读 下载PDF
A Novel Siamese Network for Few/Zero-Shot Handwritten Character Recognition Tasks
8
作者 Nagwa Elaraby Sherif Barakat Amira Rezk 《Computers, Materials & Continua》 SCIE EI 2023年第1期1837-1854,共18页
Deep metric learning is one of the recommended methods for the challenge of supporting few/zero-shot learning by deep networks.It depends on building a Siamese architecture of two homogeneous Convolutional Neural Netw... Deep metric learning is one of the recommended methods for the challenge of supporting few/zero-shot learning by deep networks.It depends on building a Siamese architecture of two homogeneous Convolutional Neural Networks(CNNs)for learning a distance function that can map input data from the input space to the feature space.Instead of determining the class of each sample,the Siamese architecture deals with the existence of a few training samples by deciding if the samples share the same class identity or not.The traditional structure for the Siamese architecture was built by forming two CNNs from scratch with randomly initialized weights and trained by binary cross-entropy loss.Building two CNNs from scratch is a trial and error and time-consuming phase.In addition,training with binary crossentropy loss sometimes leads to poor margins.In this paper,a novel Siamese network is proposed and applied to few/zero-shot Handwritten Character Recognition(HCR)tasks.The novelties of the proposed network are in.1)Utilizing transfer learning and using the pre-trained AlexNet as a feature extractor in the Siamese architecture.Fine-tuning a pre-trained network is typically faster and easier than building from scratch.2)Training the Siamese architecture with contrastive loss instead of the binary cross-entropy.Contrastive loss helps the network to learn a nonlinear mapping function that enables it to map the extracted features in the vector space with an optimal way.The proposed network is evaluated on the challenging Chars74K datasets by conducting two experiments.One is for testing the proposed network in few-shot learning while the other is for testing it in zero-shot learning.The recognition accuracy of the proposed network reaches to 85.6%and 82%in few-and zero-shot learning respectively.In addition,a comparison between the performance of the proposed Siamese network and the traditional Siamese CNNs is conducted.The comparison results show that the proposed network achieves higher recognition results in less time.The proposed network reduces the training time from days to hours in both experiments. 展开更多
关键词 Handwritten character recognition(HCR) few-shot learning zero-shot learning deep metric learning transfer learning contrastive loss Chars74K datasets
在线阅读 下载PDF
Explanatory Multi-Scale Adversarial Semantic Embedding Space Learning for Zero-Shot Recognition
9
作者 Huiting Li 《Open Journal of Applied Sciences》 2022年第3期317-335,共19页
The goal of zero-shot recognition is to classify classes it has never seen before, which needs to build a bridge between seen and unseen classes through semantic embedding space. Therefore, semantic embedding space le... The goal of zero-shot recognition is to classify classes it has never seen before, which needs to build a bridge between seen and unseen classes through semantic embedding space. Therefore, semantic embedding space learning plays an important role in zero-shot recognition. Among existing works, semantic embedding space is mainly taken by user-defined attribute vectors. However, the discriminative information included in the user-defined attribute vector is limited. In this paper, we propose to learn an extra latent attribute space automatically to produce a more generalized and discriminative semantic embedded space. To prevent the bias problem, both user-defined attribute vector and latent attribute space are optimized by adversarial learning with auto-encoders. We also propose to reconstruct semantic patterns produced by explanatory graphs, which can make semantic embedding space more sensitive to usefully semantic information and less sensitive to useless information. The proposed method is evaluated on the AwA2 and CUB dataset. These results show that our proposed method achieves superior performance. 展开更多
关键词 zero-shot Recognition Semantic Embedding Space Adversarial Learning Explanatory Graph
在线阅读 下载PDF
FM2S:Towards Spatially-correlated Noise Modeling in Zero-shot Fluorescence Microscopy Image Denoising
10
作者 Jizhihui Liu Qixun Teng +2 位作者 Qing Ma Junhui Hou Junjun Jiang 《Machine Intelligence Research》 2026年第1期200-213,共14页
Fluorescence microscopy image(FMI)denoising faces critical challenges because of the compound mixed Poisson-Gaussian noise with strong spatial correlation and the impracticality of acquiring paired noisy/clean data in... Fluorescence microscopy image(FMI)denoising faces critical challenges because of the compound mixed Poisson-Gaussian noise with strong spatial correlation and the impracticality of acquiring paired noisy/clean data in dynamic biomedical scenarios.While supervised methods trained on synthetic noise(e.g.,Gaussian/Poisson)suffer from out-of-distribution generalization issues,existing self-supervised approaches degrade under real FMI noise because they oversimplify noise assumptions and computationally intensive deep architectures.In this work,we propose fluorescence micrograph to self(FM2S),a zero-shot denoiser that achieves efficient FMI denoising through three key innovations:1)A noise injection module that ensures training data sufficiency through adaptive Poisson-Gaussian synthesis while preserving spatial correlation and global statistics of FMI noise for robust model generalization;2)A two-stage proactive learning strategy that first recovers structural priors via predenoised targets and then refines high-frequency details through noise distribution alignment;3)An ultralight-weight network(3.5 k parameters)enabling rapid convergence with 270×faster training and inference than state-of-the-art(SOTA).Extensive experiments across FMI datasets demonstrate FM2S’superiority:It outperforms CVF-SID by 1.4 dB in peak signal-to-noise ratio(PSNR)on average while requiring 0.1%of the parameters of the AP-BSN.Notably,FM2S maintains stable performance across varying noise levels,indicating its practicality for microscopy platforms with diverse sensor characteristics.The code and datasets can be found at https://github.com/Danielement321/FM2S. 展开更多
关键词 Image denoising fluorescence microscopy zero-shot learning noise modelling ultralight-weight network
原文传递
GMCoT:a graph-augmented multimodal chain-of-thought reasoning framework for multi-label zero-shot learning
11
作者 Xiang WEN Haobo WANG +2 位作者 Ke CHEN Tianlei HU Gang CHEN 《Frontiers of Information Technology & Electronic Engineering》 2025年第12期2623-2637,共15页
In recent years,multi-label zero-shot learning(ML-ZSL)has garnered increasing attention because of its wide range of potential applications,such as image annotation,text classification,and bioinformatics.The central c... In recent years,multi-label zero-shot learning(ML-ZSL)has garnered increasing attention because of its wide range of potential applications,such as image annotation,text classification,and bioinformatics.The central challenge in ML-ZSL lies in predicting multiple labels for unseen classes without requiring any labeled training data,which contrasts with conventional supervised learning paradigms.However,existing methods face several significant challenges.These include the substantial semantic gap between different modalities,which impedes effective knowledge transfer,and the intricate and typically complex relationships among multiple labels,making it difficult to model them in a meaningful and accurate manner.To overcome these challenges,we propose a graph-augmented multimodal chain-of-thought(GMCoT)reasoning approach.The proposed method combines the strengths of multimodal large language models with graph-based structures,significantly enhancing the reasoning process involved in multi-label prediction.First,a novel multimodal chain-of-thought reasoning framework is presented which imitates human-like step-by-step reasoning to produce multi-label predictions.Second,a technique is presented for integrating label graphs into the reasoning process.This technique enables the capture of complex semantic relationships among labels,thereby improving the accuracy and consistency of multi-label generation.Comprehensive experiments on benchmark datasets demonstrate that the proposed GMCoT approach outperforms state-of-the-art methods in ML-ZSL. 展开更多
关键词 Chain-of-thought Multi-label zero-shot learning Multimodal reasoning Large language model
原文传递
BDA:Bi-directional attention for zero-shot learning
12
作者 Junseok Lee Jinming Cao +3 位作者 Yifang Yin Jihie Kim Roger Zimmermann Seongsik Park 《Computational Visual Media》 2025年第5期983-1003,共21页
Zero-shot learning(ZSL)is an important and rapidly growing area of machine learning that aims to recognize new classes without prior training data.Despite its significance,ZSL has faced challenges with overfitting in ... Zero-shot learning(ZSL)is an important and rapidly growing area of machine learning that aims to recognize new classes without prior training data.Despite its significance,ZSL has faced challenges with overfitting in embedding-based methods and limitations in traditional one-directional attention(ODA)based approaches.To bridge these gaps,this paper proposes the use of bi-directional attention(BDA)to integrate insights from both embedding and attention-based approaches.The proposed BDA system consists of a bi-directional attention network(BDAN)and a synthesized visual embedding network(SVEN)that facilitates visual-semantic interaction for ZSL classification.More specifically,the BDAN employs region self-attention(RSA),semantic synthesis attention(SSA),and visual synthesis attention(VSA)to overcome the overfitting issue in embedding methods and enhance transferability,to associate visual features with semantic property information,and to learn locally improved visual features.Extensive testing on CUB,SUN,and AWA2 datasets confirm the superiority of our proposed method over traditional approaches. 展开更多
关键词 zero-shot learning(ZSL) bi-directional attention(BDA) transferability INTERACTION
原文传递
MAPS:A Multi-task Framework with Anchor Point Sampling for Zero-shot Entity Linking
13
作者 Chao Chen Pengfei Luo +3 位作者 Changkai Feng Tian Wu Wenbin Jiang Tong Xu 《Data Intelligence》 2025年第4期1085-1107,共23页
Entity linking(EL)plays a crucial role in natural language processing(NLP)NLP tasks by linking ambiguous entity mentions to relevant entities in a knowledge base.Due to the inconsistency in data distribution across di... Entity linking(EL)plays a crucial role in natural language processing(NLP)NLP tasks by linking ambiguous entity mentions to relevant entities in a knowledge base.Due to the inconsistency in data distribution across diverse domains,it is difficult to accurately estimate the overall data distribution of the target domain,resulting in the zero-shot scenarios with a significant decrease in generalization performance.Currently,existing works primarily focus on sampling and incorporating fine-grained information to deal with above issue.Unfortunately,they may face either significant computational cost of negative samples for sampling strategy,or shortcomings in interaction between coarse and fine-grained information.To tackle these challenges,in this paper,we propose a Multi-Task Framework with Anchor Point Sampling(MAPS).Specifically,for the anchor point sampling(APS)part,with considering fine-grained information,we pre-bind mention-entity pairs based on prior conditions(e.g.,entity type)to introduce challenging negative samples and modifies the conditional distribution.In this way,the optimal trade-off between computational effectiveness and efficiency will be reached.Moreover,we propose a novel multi-task framework that shares coarse-grained information at a lower level,and utilizes multiple extractors to extract fine-grained information at a higher level.By combining the multi-task framework and various APS approaches,comprehensive fusion of coarse and fine-grained information will be finally achieved.Experimental results on the benchmark dataset ZESHEL demonstrate that MAPS significantly outperforms the competitive baselines. 展开更多
关键词 zero-shot entity linking Negative sampling Multi-task learning
原文传递
A Survey of Zero-Shot Object Detection
14
作者 Weipeng Cao Xuyang Yao +3 位作者 Zhiwu Xu Ye Liu Yinghui Pan Zhong Ming 《Big Data Mining and Analytics》 2025年第3期726-750,共25页
Zero-Shot object Detection(ZSD),one of the most challenging problems in the field of object detection,aims to accurately identify new categories that are not encountered during training.Recent advancements in deep lea... Zero-Shot object Detection(ZSD),one of the most challenging problems in the field of object detection,aims to accurately identify new categories that are not encountered during training.Recent advancements in deep learning and increased computational power have led to significant improvements in object detection systems,achieving high recognition accuracy on benchmark datasets.However,these systems remain limited in real-world applications due to the scarcity of labeled training samples,making it difficult to detect unseen classes.To address this,researchers have explored various approaches,yielding promising progress.This article provides a comprehensive review of the current state of ZSD,distinguishing four related methods—zero-shot,open-vocabulary,open-set,and open-world approaches—based on task objectives and data usage.We highlight representative methods,discuss the technical challenges within each framework,and summarize the commonly used evaluation metrics,benchmark datasets,and experimental results.Our review aims to offer readers a clear overview of the latest developments and performance trends in ZSD. 展开更多
关键词 zero-shot object Detection(ZSD) open-vocabulary object detection open-set object detection open-world object detection
原文传递
Zero-Shot Knowledge-Based Visual Question Answering with Frozen Language Models
15
作者 Jing Liu Lizong Zhang +3 位作者 Chenpeng Cao Yinong Shi Chong Mu Jiaxin Li 《Big Data Mining and Analytics》 2025年第6期1418-1431,共14页
Knowledge-based Visual Question Answering(VQA)is a challenging task that requires models to access external knowledge for reasoning.Large Language Models(LLMs)have recently been employed for zero-shot knowledge-based ... Knowledge-based Visual Question Answering(VQA)is a challenging task that requires models to access external knowledge for reasoning.Large Language Models(LLMs)have recently been employed for zero-shot knowledge-based VQA due to their inherent knowledge storage and in-context learning capabilities.However,LLMs are commonly perceived as implicit knowledge bases,and their generative and in-context learning potential remains underutilized.Existing works demonstrate that the performance of in-context learning strongly depends on the quality and order of demonstrations in prompts.In light of this,we propose Knowledge Generation with Frozen Language Models(KGFLM),a novel method for generating explicit knowledge statements to improve zero-shot knowledge-based VQA.Our knowledge generation strategy aims to identify effective demonstrations and determine their optimal order,thereby activating the frozen LLM to produce more useful knowledge statements for better predictions.The generated knowledge statements can also serve as interpretable rationales.In our method,the selection and arrangement of demonstrations are based on semantic similarity and quality of demonstrations for each question,without requiring additional annotations.Furthermore,a series of experiments are conducted on A-OKVQA and OKVQA datasets.The results show that our method outperforms some superior zero-shot knowledge-based VQA methods. 展开更多
关键词 knowledge-based Visual Question Answering(VQA) zero-shot learning Large Language Models(LLMs)
原文传递
生成式零样本深度学习模型的轴承故障诊断方法
16
作者 刘月文 刘文淼 +2 位作者 李永亭 齐咏生 刘慧文 《中国农机化学报》 北大核心 2026年第1期201-209,共9页
基于深度学习的故障诊断模型需要大量数据进行训练,然而在实际工况中环境恶劣,完备故障数据的获取困难,导致模型训练精度差甚至无法训练。为此,引入生成式零样本学习模型,然而生成式模型也存在一些局限性,如生成的特征质量可能比较差,... 基于深度学习的故障诊断模型需要大量数据进行训练,然而在实际工况中环境恶劣,完备故障数据的获取困难,导致模型训练精度差甚至无法训练。为此,引入生成式零样本学习模型,然而生成式模型也存在一些局限性,如生成的特征质量可能比较差,与真实特征之间存在较大差距,限制模型性能。针对此问题,提出一种结合互补属性和回归模块生成式零样本学习(CARMGZSL)方法并应用于轴承故障诊断。首先采用连续小波变换将一维故障信号转换为时频图,使用CNN提取故障特征;然后设计一种语义属性模块,依据不同故障定义不同语义属性,通过生成对抗模块将可见类故障的语义属性和故障特征进行对抗性训练,生成不可见类故障特征并送入判别器,和真实故障样本特征进行判别;再构造一类回归模块,将生成样本特征通过回归模块重构为语义属性送入生成器,使生成样本特征更加逼真;最后通过相似性度量实现对不可见类故障与生成式不可见类故障的距离判别,完成故障识别。通过凯斯西储大学轴承数据集进行算法验证,结果表明,在零样本情况下,该方法可实现滚动轴承零样本故障诊断,相比于其他经典的零样本诊断算法,所提方法平均准确率达到92.32%,具有更好的诊断性能。 展开更多
关键词 滚动轴承 零样本学习 故障诊断 生成对抗网络 语义特征
在线阅读 下载PDF
“零样本语言学习”:大语言模型能“像人一样”习得语境中的情感吗?
17
作者 吴诗玉 王亦赟 《心理学报》 北大核心 2026年第2期308-322,共15页
本研究旨在检验大语言模型(LLMs)能否在“零样本”条件下通过阅读附带习得单词所出现的语境情感,并评估情感效价与语境变异性对词汇学习的影响。研究采用跨模型-人类对比,4种LLMs与3组学习者在统一材料中学习嵌入不同情感(积极/中性/消... 本研究旨在检验大语言模型(LLMs)能否在“零样本”条件下通过阅读附带习得单词所出现的语境情感,并评估情感效价与语境变异性对词汇学习的影响。研究采用跨模型-人类对比,4种LLMs与3组学习者在统一材料中学习嵌入不同情感(积极/中性/消极)与重复/变化语境的目标词,并以多项测试衡量情感迁移及词形、词义习得效果。结果显示,LLMs与人类模式一致,能将语境情感迁移至目标词,并在语言生成中保持情感一致;而且也呈现“积极情感优势”“语境变异优势”,且在定义生成中出现语境情感与语境变异的交互效应。文章提出“双重机制框架”,认为LLMs在功能层面具备类人的情感语义学习能力,但其机制基于统计共现与向量优化,异于人类的具身与社会加工。本研究为情感计算、人机交互伦理与词汇教学提供启示。 展开更多
关键词 大语言模型 零样本学习 情感学习
在线阅读 下载PDF
基于结构感知与蒙特卡洛树搜索的SQL生成
18
作者 富宇 李浩冉 《计算机技术与发展》 2026年第3期118-123,117,共7页
自然语言到SQL(Text-to-SQL)任务旨在将用户查询映射为可执行的SQL语句,是自然语言与数据库交互的核心技术。当前主流大型语言模型在处理复杂结构、多表关联及嵌套逻辑时,常出现结构错误、语义偏离和执行失败,限制了其可靠性与泛化能力... 自然语言到SQL(Text-to-SQL)任务旨在将用户查询映射为可执行的SQL语句,是自然语言与数据库交互的核心技术。当前主流大型语言模型在处理复杂结构、多表关联及嵌套逻辑时,常出现结构错误、语义偏离和执行失败,限制了其可靠性与泛化能力。为此,该文提出Struct-MCTS,一种基于结构感知与蒙特卡洛树搜索(MCTS)的Text-to-SQL生成框架。该框架通过细粒度结构化动作建模SQL生成过程,并结合多模型并行生成与协同辩论对候选路径进行动态打分,从而提升生成结果的鲁棒性与一致性。在零样本条件下,Struct-MCTS在Spider和BIRD等复杂数据集上表现出领先的执行准确率,显示出强泛化能力与实际应用潜力。 展开更多
关键词 Text-to-SQL 大语言模型 结构感知 蒙特卡洛树搜索 多模型辩论 零样本学习
在线阅读 下载PDF
Learning from Scarcity:A Review of Deep Learning Strategies for Cold-Start Energy Time-Series Forecasting
19
作者 Jihoon Moon 《Computer Modeling in Engineering & Sciences》 2026年第1期26-76,共51页
Predicting the behavior of renewable energy systems requires models capable of generating accurate forecasts from limited historical data,a challenge that becomes especially pronounced when commissioning new facil-iti... Predicting the behavior of renewable energy systems requires models capable of generating accurate forecasts from limited historical data,a challenge that becomes especially pronounced when commissioning new facil-ities where operational records are scarce.This review aims to synthesize recent progress in data-efficient deep learning approaches for addressing such“cold-start”forecasting problems.It primarily covers three interrelated domains—solar photovoltaic(PV),wind power,and electrical load forecasting—where data scarcity and operational variability are most critical,while also including representative studies on hydropower and carbon emission prediction to provide a broader systems perspective.To this end,we examined trends from over 150 predominantly peer-reviewed studies published between 2019 and mid-2025,highlighting advances in zero-shot and few-shot meta-learning frameworks that enable rapid model adaptation with minimal labeled data.Moreover,transfer learning approaches combined with spatiotemporal graph neural networks have been employed to transfer knowledge from existing energy assets to new,data-sparse environments,effectively capturing hidden dependencies among geographic features,meteorological dynamics,and grid structures.Synthetic data generation has further proven valuable for expanding training samples and mitigating overfitting in cold-start scenarios.In addition,large language models and explainable artificial intelligence(XAI)—notably conversational XAI systems—have been used to interpret and communicate complex model behaviors in accessible terms,fostering operator trust from the earliest deployment stages.By consolidating methodological advances,unresolved challenges,and open-source resources,this review provides a coherent overview of deep learning strategies that can shorten the data-sparse ramp-up period of new energy infrastructures and accelerate the transition toward resilient,low-carbon electricity grids. 展开更多
关键词 Cold-start forecasting zero-shot learning few-shot meta-learning transfer learning spatiotemporal graph neural networks energy time series large language models explainable artificial intelligence(XAI)
在线阅读 下载PDF
基于深度学习的野生动物图像识别方法与挑战
20
作者 李尧迪 田野 +3 位作者 张长春 谢将剑 赵海涛 张军国 《林业科学》 北大核心 2026年第1期207-222,共16页
随着野生动物保护和生态监测需求的不断增长,基于深度学习的图像识别方法在野生动物研究中的应用日益广泛。本研究首先介绍野生动物常用公开数据集,随后详细综述不同深度学习技术在野生动物图像识别中的应用,依据任务需求将识别方法划... 随着野生动物保护和生态监测需求的不断增长,基于深度学习的图像识别方法在野生动物研究中的应用日益广泛。本研究首先介绍野生动物常用公开数据集,随后详细综述不同深度学习技术在野生动物图像识别中的应用,依据任务需求将识别方法划分为图像级、对象级和像素级3个层级,并重点讨论各层级方法的具体实现及其技术细节。在此基础上,深入探讨野生动物图像识别所面临的核心挑战,涵盖数据层面的诸多问题,如数据质量参差不齐、标注代价高昂且效率低下、样本分布不均衡;同时还从模型与算法角度剖析若干关键技术难题,包括细粒度检测、跨域分布偏移、类增量学习、零样本学习和跨模态学习等。针对上述挑战,总结当前的研究进展与应对策略,并提出未来可能的发展方向,旨在为构建高效、鲁棒且适用于实际监测场景的野生动物智能识别系统提供理论支持和方法参考。 展开更多
关键词 野生动物图像识别 深度学习 数据不平衡 迁移学习 零样本学习 跨模态学习
在线阅读 下载PDF
上一页 1 2 14 下一页 到第
使用帮助 返回顶部