Foundation models are reshaping artificial intelligence,yet their deployment in specialised domains such as agricultural question answering(AQA)still faces challenges including data scarcity and barriers to domainspec...Foundation models are reshaping artificial intelligence,yet their deployment in specialised domains such as agricultural question answering(AQA)still faces challenges including data scarcity and barriers to domainspecific knowledge.To systematically review recent progress in this area,this paper adopts a task–paradigmperspective and examines applications across three major AQA task families.For text-based QA,we analyse the strengths and limitations of retrieval-based,generative,and hybrid approaches built on large languagemodels,revealing a clear trend toward hybrid paradigms that balance precision and flexibility.For visual diagnosis,we discuss techniques such as crossmodal alignment and prompt-driven generation,which are pushing systems beyond simple pest and disease recognition toward deeper causal reasoning.Formultimodal reasoning,we show how the fusion of heterogeneous data—including text,images,speech,and sensor streams—enables comprehensive decision-making for diagnosis,monitoring,and yield prediction.To address the lack of unified benchmarks,we further propose a standardised evaluation protocol and a diagnostic taxonomy specifically designed to characterise agriculture-specific errors.Finally,we outline a concreteAQA roadmap that emphasises safety alignment,hallucination control,and lightweight deployment,aiming to guide future systems toward greater efficiency,trustworthiness,and sustainability.展开更多
In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilizati...In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilization of this information. This study proposes a novel framework for intelligent Question-and-Answer (Q&A) systems based on Retrieval-Augmented Generation (RAG) to address these issues. The system efficiently acquires domain-specific knowledge by leveraging external databases, including Relational Databases (RDBs) and graph databases, without additional fine-tuning for Large Language Models (LLMs). Crucially, the framework integrates a Dynamic Knowledge Base Updating Mechanism (DKBUM) and a Weighted Context-Aware Similarity (WCAS) method to enhance retrieval accuracy and mitigate inherent limitations of LLMs, such as hallucinations and lack of specialization. Additionally, the proposed DKBUM dynamically adjusts knowledge weights within the database, ensuring that the most recent and relevant information is utilized, while WCAS refines the alignment between queries and knowledge items by enhanced context understanding. Experimental validation demonstrates that the system can generate timely, accurate, and context-sensitive responses, making it a robust solution for managing complex business logic in specialized industries.展开更多
Medical visual question answering(MedVQA)faces unique challenges due to the high precision required for images and the specialized nature of the questions.These challenges include insufficient feature extraction capab...Medical visual question answering(MedVQA)faces unique challenges due to the high precision required for images and the specialized nature of the questions.These challenges include insufficient feature extraction capabilities,a lack of textual priors,and incomplete information fusion and interaction.This paper proposes an enhanced bootstrapping language-image pre-training(BLIP)model for MedVQA based on multimodal feature augmentation and triple-path collaborative attention(FCA-BLIP)to address these issues.First,FCA-BLIP employs a unified bootstrap multimodal model architecture that integrates ResNet and bidirectional encoder representations from Transformer(BERT)models to enhance feature extraction capabilities.It enables a more precise analysis of the details in images and questions.Next,the pre-trained BLIP model is used to extract features from image-text sample pairs.The model can understand the semantic relationships and shared information between images and text.Finally,a novel attention structure is developed to fuse the multimodal feature vectors,thereby improving the alignment accuracy between modalities.Experimental results demonstrate that the proposed method performs well in clinical visual question-answering tasks.For the MedVQA task of staging diabetic macular edema in fundus imaging,the proposed method outperforms the existing major models in several performance metrics.展开更多
The medical education of the Song dynasty constitutes a pivotal aspect within the broader framework of ancient Chinese medical education. The advent of the imperial examination system coincided with the emergence of a...The medical education of the Song dynasty constitutes a pivotal aspect within the broader framework of ancient Chinese medical education. The advent of the imperial examination system coincided with the emergence of a medical examination system, which served as the cornerstone for the subsequent evolution of medical education. According to historical records, the Song government established dedicated medical departments, along with comprehensive systems encompassing medical professors, students, and examinations. By examining extant medical historical documents, such as Tai Yi Ju Zhu Ke Cheng Wen Ge(《太医局诸科程文格》 Examination Answers and Standards of the Imperial Medical Bureau), researchers and readers can obtain a comprehensive understanding of the medical system that prevailed in the Song dynasty. While the intricate details of medical education during this era are not explicitly documented in historical records, modern researchers have the opportunity to uncover the entire view of medical education, particularly the medical examination system, through rigorous analysis of these extant historical medical documents. Such studies offer valuable insights into the developmental trajectory of the ancient Chinese medical examination system and provide crucial references for contemporary medical education. By conducting in-depth literature research and analysis of Tai Yi Ju Zhu Ke Cheng Wen Ge, this study endeavors to reconstruct the authentic scenario of medical examinations in the Song dynasty, as presented in the document, for the benefit of modern readers and researchers.展开更多
基金supported by the Ningxia Natural Science Foundation(2025AAC050001)the Scientific Research Startup Project for Full-Time Introduced High-Level Talents in Ningxia(2024BEH04130)+2 种基金the National Natural Science Foundation of China(32460444)the Ningxia Hui Autonomous Region Key Research and Development Program(2024BBF0101302,2023BDE02001)Supported by the Special Fund for Basic Research Business of Central Universities of North Minzu University(2025BG234,2023ZRLG12).
文摘Foundation models are reshaping artificial intelligence,yet their deployment in specialised domains such as agricultural question answering(AQA)still faces challenges including data scarcity and barriers to domainspecific knowledge.To systematically review recent progress in this area,this paper adopts a task–paradigmperspective and examines applications across three major AQA task families.For text-based QA,we analyse the strengths and limitations of retrieval-based,generative,and hybrid approaches built on large languagemodels,revealing a clear trend toward hybrid paradigms that balance precision and flexibility.For visual diagnosis,we discuss techniques such as crossmodal alignment and prompt-driven generation,which are pushing systems beyond simple pest and disease recognition toward deeper causal reasoning.Formultimodal reasoning,we show how the fusion of heterogeneous data—including text,images,speech,and sensor streams—enables comprehensive decision-making for diagnosis,monitoring,and yield prediction.To address the lack of unified benchmarks,we further propose a standardised evaluation protocol and a diagnostic taxonomy specifically designed to characterise agriculture-specific errors.Finally,we outline a concreteAQA roadmap that emphasises safety alignment,hallucination control,and lightweight deployment,aiming to guide future systems toward greater efficiency,trustworthiness,and sustainability.
文摘In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilization of this information. This study proposes a novel framework for intelligent Question-and-Answer (Q&A) systems based on Retrieval-Augmented Generation (RAG) to address these issues. The system efficiently acquires domain-specific knowledge by leveraging external databases, including Relational Databases (RDBs) and graph databases, without additional fine-tuning for Large Language Models (LLMs). Crucially, the framework integrates a Dynamic Knowledge Base Updating Mechanism (DKBUM) and a Weighted Context-Aware Similarity (WCAS) method to enhance retrieval accuracy and mitigate inherent limitations of LLMs, such as hallucinations and lack of specialization. Additionally, the proposed DKBUM dynamically adjusts knowledge weights within the database, ensuring that the most recent and relevant information is utilized, while WCAS refines the alignment between queries and knowledge items by enhanced context understanding. Experimental validation demonstrates that the system can generate timely, accurate, and context-sensitive responses, making it a robust solution for managing complex business logic in specialized industries.
基金Supported by the Program for Liaoning Excellent Talents in University(No.LR15045)the Liaoning Provincial Science and Technology Department Applied Basic Research Plan(No.101300243).
文摘Medical visual question answering(MedVQA)faces unique challenges due to the high precision required for images and the specialized nature of the questions.These challenges include insufficient feature extraction capabilities,a lack of textual priors,and incomplete information fusion and interaction.This paper proposes an enhanced bootstrapping language-image pre-training(BLIP)model for MedVQA based on multimodal feature augmentation and triple-path collaborative attention(FCA-BLIP)to address these issues.First,FCA-BLIP employs a unified bootstrap multimodal model architecture that integrates ResNet and bidirectional encoder representations from Transformer(BERT)models to enhance feature extraction capabilities.It enables a more precise analysis of the details in images and questions.Next,the pre-trained BLIP model is used to extract features from image-text sample pairs.The model can understand the semantic relationships and shared information between images and text.Finally,a novel attention structure is developed to fuse the multimodal feature vectors,thereby improving the alignment accuracy between modalities.Experimental results demonstrate that the proposed method performs well in clinical visual question-answering tasks.For the MedVQA task of staging diabetic macular edema in fundus imaging,the proposed method outperforms the existing major models in several performance metrics.
文摘The medical education of the Song dynasty constitutes a pivotal aspect within the broader framework of ancient Chinese medical education. The advent of the imperial examination system coincided with the emergence of a medical examination system, which served as the cornerstone for the subsequent evolution of medical education. According to historical records, the Song government established dedicated medical departments, along with comprehensive systems encompassing medical professors, students, and examinations. By examining extant medical historical documents, such as Tai Yi Ju Zhu Ke Cheng Wen Ge(《太医局诸科程文格》 Examination Answers and Standards of the Imperial Medical Bureau), researchers and readers can obtain a comprehensive understanding of the medical system that prevailed in the Song dynasty. While the intricate details of medical education during this era are not explicitly documented in historical records, modern researchers have the opportunity to uncover the entire view of medical education, particularly the medical examination system, through rigorous analysis of these extant historical medical documents. Such studies offer valuable insights into the developmental trajectory of the ancient Chinese medical examination system and provide crucial references for contemporary medical education. By conducting in-depth literature research and analysis of Tai Yi Ju Zhu Ke Cheng Wen Ge, this study endeavors to reconstruct the authentic scenario of medical examinations in the Song dynasty, as presented in the document, for the benefit of modern readers and researchers.