期刊文献+
共找到108,213篇文章
< 1 2 250 >
每页显示 20 50 100
Strong Laws of Large Numbers for Sequences of Blockwise m-Dependent and Sub-Orthogonal Random Variables under Sublinear Expectations 被引量:1
1
作者 Jialiang FU 《Journal of Mathematical Research with Applications》 2026年第1期103-118,共16页
In this paper,we establish some strong laws of large numbers,which are for nonindependent random variables under the framework of sublinear expectations.One of our main results is for blockwise m-dependent random vari... In this paper,we establish some strong laws of large numbers,which are for nonindependent random variables under the framework of sublinear expectations.One of our main results is for blockwise m-dependent random variables,and another is for sub-orthogonal random variables.Both extend the strong law of large numbers for independent random variables under sublinear expectations to the non-independent case. 展开更多
关键词 sublinear expectations strong law of large numbers blockwise m-dependent suborthogonal random variables
原文传递
Agri-Eval:Multi-level Large Language Model Valuation Benchmark for Agriculture
2
作者 WANG Yaojun GE Mingliang +2 位作者 XU Guowei ZHANG Qiyu BIE Yuhui 《农业机械学报》 北大核心 2026年第1期290-299,共10页
Model evaluation using benchmark datasets is an important method to measure the capability of large language models(LLMs)in specific domains,and it is mainly used to assess the knowledge and reasoning abilities of LLM... Model evaluation using benchmark datasets is an important method to measure the capability of large language models(LLMs)in specific domains,and it is mainly used to assess the knowledge and reasoning abilities of LLMs.Therefore,in order to better assess the capability of LLMs in the agricultural domain,Agri-Eval was proposed as a benchmark for assessing the knowledge and reasoning ability of LLMs in agriculture.The assessment dataset used in Agri-Eval covered seven major disciplines in the agricultural domain:crop science,horticulture,plant protection,animal husbandry,forest science,aquaculture science,and grass science,and contained a total of 2283 questions.Among domestic general-purpose LLMs,DeepSeek R1 performed best with an accuracy rate of 75.49%.In the realm of international general-purpose LLMs,Gemini 2.0 pro exp 0205 standed out as the top performer,achieving an accuracy rate of 74.28%.As an LLMs in agriculture vertical,Shennong V2.0 outperformed all the LLMs in China,and the answer accuracy rate of agricultural knowledge exceeded that of all the existing general-purpose LLMs.The launch of Agri-Eval helped the LLM developers to comprehensively evaluate the model's capability in the field of agriculture through a variety of tasks and tests to promote the development of the LLMs in the field of agriculture. 展开更多
关键词 large language models assessment systems agricultural knowledge agricultural datasets
在线阅读 下载PDF
Preferences of Chinese Dermatologists for Large Language Model Responses in Clinical Psoriasis Scenarios:A Nationwide Cross-Sectional Survey in China
3
作者 Jungang Yang Jingkai Xu +6 位作者 Xuejiao Song Chengxu Li Lili Chen Lingbo Bi Tingting Jiang Xianbo Zuo Yong Cui 《Health Care Science》 2026年第1期40-48,共9页
Background:Large language models(LLMs)have shown considerable promise in supporting clinical decision-making.However,their adoption and evaluation in dermatology remains limited.This study aimed to explore the prefere... Background:Large language models(LLMs)have shown considerable promise in supporting clinical decision-making.However,their adoption and evaluation in dermatology remains limited.This study aimed to explore the preferences of Chinese dermatologists regarding LLM-generated responses in clinical psoriasis scenarios and to assess how they prioritize key quality dimensions,including accuracy,traceability,and logicality.Methods:A cross-sectional,web-based survey was conducted between December 25,2024,and January 22,2025,following the Checklist for Reporting Results of Internet E-Surveys guidelines.A total of 1247 valid responses were collected from practicing dermatologists across 33 of China's provincial-level administrative divisions.Participants evaluated responses to five categories of clinical questions(etiology,clinical presentation,differential diagnosis,treatment,and case study)generated by five LLMs:ChatGPT-4o,Kimi.ai,Doubao,ZuoYiGPT,and Lingyi-agent.Statistical associations between participant characteristics and model preferences were examined using chi-square tests.Results:ChatGPT-4o(Model 1)emerged as the most preferred model across all clinical tasks,consistently receiving the highest number of votes in case study(n=740),clinical presentation(n=666),differential diagnosis(n=707),etiology(n=602),and treatment(n=656).Significant variation in model preference by professional title was observed only for the differential diagnosis task(χ^(2)=21.13,df=12,p=0.0485),while no significant differences were found across hospital tiers(p>0.05).In terms of evaluation dimensions,accuracy was most frequently rated as“very important”(n=635).A significant association existed between hospital tier and the most valued dimension(χ^(2)=27.667,df=9,p=0.0011),with dermatologists in primary hospitals prioritizing traceability more than their peers in higher-tier hospitals.No significant associations were found across professional titles(p=0.127).Conclusions:Chinese dermatologists suggest a strong preference for ChatGPT-4o over domestic LLMs in psoriasis-related clinical tasks.While accuracy remains the primary criterion,traceability and logicality are also critical,particularly for clinicians in lower-tier hospitals.These findings suggest that future clinical LLMs should prioritize not only content accuracy but also source transparency and structural clarity to meet the diverse needs of different clinical settings. 展开更多
关键词 DERMATOLOGY large language model model evaluation
暂未订购
A Survey on Medical Competence Evaluation Benchmarks for Large Language Models
4
作者 Qiting Wang Huiru Zou +3 位作者 Haobin Zhang Yongshun Huang Junzhang Tian Weibin Cheng 《Health Care Science》 2026年第1期4-18,共15页
Large language models(LLMs)show considerable potential to revolutionize healthcare through their performance across diverse clinical applications.Given the inherent constraints of LLMs and the critical nature of medic... Large language models(LLMs)show considerable potential to revolutionize healthcare through their performance across diverse clinical applications.Given the inherent constraints of LLMs and the critical nature of medical practice,a rigorous and systematic evaluation of their medical competence is imperative.This study presents a comprehensive review of the established methodologies and benchmarks for evaluating the medical competence of LLMs,encompassing a thorough analysis of current assessment practices across medical knowledge,clinical practice competence,and ethical-safety considerations.By integrating clinician competency assessment frameworks into LLMs evaluation,we propose a structured tri-dimensional framework that systematically organizes existing evaluation approaches according to medical theoretical knowledge,clinical practice ability,and ethical-safety considerations.Furthermore,this research provides critical insights into future developmental trajectories while establishing foundational frameworks and standardization protocols for the integration of LLMs into medical practice. 展开更多
关键词 BENCHMARK large language model medical competence ABSTRACT
在线阅读 下载PDF
The Combined Immune Effects of Perfluorooctanoic Acid(PFOA)and Perfluorobutanoic Acid(PFBA)on Intestinal Microbiota of Large Yellow Croaker(Larimichthys crocea)
5
作者 XUE Yadong HAN Ping +3 位作者 LIU Xiumei CHEN Jianming YUAN Mingzhe WANG Xubo 《Journal of Ocean University of China》 2026年第1期312-322,共11页
Polyfluoroalkyl substances(PFAS)have emerged as persistent environmental contaminants because of their chemical stability,degradation-resistance and bioaccumulation potential.However,current studies mainly focus on th... Polyfluoroalkyl substances(PFAS)have emerged as persistent environmental contaminants because of their chemical stability,degradation-resistance and bioaccumulation potential.However,current studies mainly focus on the toxicity of single PFAS such as perfluorooctanoic acid(PFOA)and perfluorobutanoic acid(PFBA),the knowledge of their combined effects is relatively limited.In this study,we explored the immune response of the gut in large yellow croaker(Larimichthys crocea)under the combined stress of PFOA and PFBA.Histologicalanalyses revealed that the combined effect induced intestinal vacuolization and decreased the length of intestinal villi.And it significantly activated pro-inflammatory pathways with marked upregulation of tnfα,il1β,il6 and myd88 expressions,particularly after 14 days of exposure.Gut microbiota analysis revealed substantial dysbiosis,including 1)reduced alpha diversity,2)increased abundance of potential pathogenic taxa(Proteobacteria and Spirochaetota),and 3)depletion of beneficial Firmicutes.PICRUSt-based functional prediction indicated temporal metabolic shifts,with upregulation of DNA repair pathways at day 3 and enhanced bacterial motility protein activity at days 7 and 14 of post-exposure.The Pearson correlation analysis further indicated that these immune genes had significant positive correlations with Vibrio and Brevinema,and negative correlations with Streptococcus.Our present study will provide novel insights into the microbiome-mediated immunomodulation in the larger yellow croaker exposed to combined PFAS,which will be helpful for healthy farming of economically important marine species. 展开更多
关键词 large yellow croaker GUT combined stress immune response
在线阅读 下载PDF
Multiphysics Implicit Coupling Method for Fluid,Particles,and Large-Deformation Structures
6
作者 Xiangxiang Wang Hualong Xie +3 位作者 Yue Yu Min Li Yubin Wang Fei Xing 《Computer Modeling in Engineering & Sciences》 2026年第2期367-401,共35页
This study presents an implicit multiphysics coupling method integrating Computational Fluid Dynamics(CFD),the Multiphase Particle-in-Cell(MPPIC)model,and the Finite Element Method(FEM),implemented with OpenFOAM,Calcu... This study presents an implicit multiphysics coupling method integrating Computational Fluid Dynamics(CFD),the Multiphase Particle-in-Cell(MPPIC)model,and the Finite Element Method(FEM),implemented with OpenFOAM,CalculiX,and preCICE to simulate fluid-particle-structure interactions with large deformations.Mesh motion in the fluid field is handled using the radial basis function(RBF)method.The particle phase is modeled by MPPIC,where fluid-particle interaction is described through momentum exchange,and inter-particle collisions are characterized by collision stress.The structural field is solved by nonlinear FEM to capture large deformations induced by geometric nonlinearity.Coupling among fields is realized through a partitioned,parallel,and non-intrusive iterative strategy,ensuring stable transfer and convergence of interface forces and displacements.Notably,the influence of particles on the structure is not direct but mediated by the fluid,while structural motion directly affects particle dynamics.The results demonstrate that the proposed approach effectively captures multiphysics interaction processes and provides a valuable reference for numerical modeling of coupled fluid-particle-structure systems. 展开更多
关键词 Fluid-particle-structure interaction large deformation partitioned method non-intrusive coupling
在线阅读 下载PDF
When Large Language Models and Machine Learning Meet Multi-Criteria Decision Making: Fully Integrated Approach for Social Media Moderation
7
作者 Noreen Fuentes Janeth Ugang +4 位作者 Narcisan Galamiton Suzette Bacus Samantha Shane Evangelista Fatima Maturan Lanndon Ocampo 《Computers, Materials & Continua》 2026年第1期2137-2162,共26页
This study demonstrates a novel integration of large language models,machine learning,and multicriteria decision-making to investigate self-moderation in small online communities,a topic under-explored compared to use... This study demonstrates a novel integration of large language models,machine learning,and multicriteria decision-making to investigate self-moderation in small online communities,a topic under-explored compared to user behavior and platform-driven moderation on social media.The proposed methodological framework(1)utilizes large language models for social media post analysis and categorization,(2)employs k-means clustering for content characterization,and(3)incorporates the TODIM(Tomada de Decisão Interativa Multicritério)method to determine moderation strategies based on expert judgments.In general,the fully integrated framework leverages the strengths of these intelligent systems in a more systematic evaluation of large-scale decision problems.When applied in social media moderation,this approach promotes nuanced and context-sensitive self-moderation by taking into account factors such as cultural background and geographic location.The application of this framework is demonstrated within Facebook groups.Eight distinct content clusters encompassing safety,harassment,diversity,and misinformation are identified.Analysis revealed a preference for content removal across all clusters,suggesting a cautious approach towards potentially harmful content.However,the framework also highlights the use of other moderation actions,like account suspension,depending on the content category.These findings contribute to the growing body of research on self-moderation and offer valuable insights for creating safer and more inclusive online spaces within smaller communities. 展开更多
关键词 Self-moderation user-generated content k-means clustering TODIM large language models
在线阅读 下载PDF
Assessing Large Language Models for Early Article Identification in Otolaryngology—Head and Neck Surgery Systematic Reviews
8
作者 Ajibola B.Bakare Young Lee +2 位作者 Jhuree Hong Claus-Peter Richter Jonathan P.Kuriakose 《Health Care Science》 2026年第1期19-28,共10页
Background:Assess ChatGPT and Bard's effectiveness in the initial identification of articles for Otolaryngology—Head and Neck Surgery systematic literature reviews.Methods:Three PRISMA-based systematic reviews(Ja... Background:Assess ChatGPT and Bard's effectiveness in the initial identification of articles for Otolaryngology—Head and Neck Surgery systematic literature reviews.Methods:Three PRISMA-based systematic reviews(Jabbour et al.2017,Wong et al.2018,and Wu et al.2021)were replicated using ChatGPTv3.5 and Bard.Outputs(author,title,publication year,and journal)were compared to the original references and cross-referenced with medical databases for authenticity and recall.Results:Several themes emerged when comparing Bard and ChatGPT across the three reviews.Bard generated more outputs and had greater recall in Wong et al.'s review,with a broader date range in Jabbour et al.'s review.In Wu et al.'s review,ChatGPT-2 had higher recall and identified more authentic outputs than Bard-2.Conclusion:Large language models(LLMs)failed to fully replicate peer-reviewed methodologies,producing outputs with inaccuracies but identifying relevant,especially recent,articles missed by the references.While human-led PRISMA-based reviews remain the gold standard,refining LLMs for literature reviews shows potential. 展开更多
关键词 artificial intelligence BARD ChatGPT large language models systematic review
暂未订购
Command-agent:Reconstructing warfare simulation and command decision-making using large language models
9
作者 Mengwei Zhang Minchi Kuang +3 位作者 Heng Shi Jihong Zhu Jingyu Zhu Xiao Jiang 《Defence Technology(防务技术)》 2026年第2期294-313,共20页
War rehearsals have become increasingly important in national security due to the growing complexity of international affairs.However,traditional rehearsal methods,such as military chess simulations,are inefficient an... War rehearsals have become increasingly important in national security due to the growing complexity of international affairs.However,traditional rehearsal methods,such as military chess simulations,are inefficient and inflexible,with particularly pronounced limitations in command and decision-making.The overwhelming volume of information and high decision complexity hinder the realization of autonomous and agile command and control.To address this challenge,an intelligent warfare simulation framework named Command-Agent is proposed,which deeply integrates large language models(LLMs)with digital twin battlefields.By constructing a highly realistic battlefield environment through real-time simulation and multi-source data fusion,the natural language interaction capabilities of LLMs are leveraged to lower the command threshold and to enable autonomous command through the Observe-Orient-Decide-Act(OODA)feedback loop.Within the Command-Agent framework,a multimodel collaborative architecture is further adopted to decouple the decision-generation and command-execution functions of LLMs.By combining specialized models such as Deep Seek-R1 and MCTool,the limitations of single-model capabilities are overcome.MCTool is a lightweight execution model fine-tuned for military Function Calling tasks.The framework also introduces a Vector Knowledge Base to mitigate hallucinations commonly exhibited by LLMs.Experimental results demonstrate that Command-Agent not only enables natural language-driven simulation and control but also deeply understands commander intent.Leveraging the multi-model collaborative architecture,during red-blue UAV confrontations involving 2 to 8 UAVs,the integrated score is improved by an average of 41.8%compared to the single-agent system(MCTool),accompanied by a 161.8%optimization in the battle loss ratio.Furthermore,when compared with multi-agent systems lacking the knowledge base,the inclusion of the Vector Knowledge Base further improves overall performance by 16.8%.In comparison with the general model(Qwen2.5-7B),the fine-tuned MCTool leads by 5%in execution efficiency.Therefore,the proposed Command-Agent introduces a novel perspective to the military command system and offers a feasible solution for intelligent battlefield decision-making. 展开更多
关键词 Digital twin battlefield large language models Multi-agent system Military command
在线阅读 下载PDF
Prompt Injection Attacks on Large Language Models:A Survey of Attack Methods,Root Causes,and Defense Strategies
10
作者 Tongcheng Geng Zhiyuan Xu +1 位作者 Yubin Qu W.Eric Wong 《Computers, Materials & Continua》 2026年第4期134-185,共52页
Large language models(LLMs)have revolutionized AI applications across diverse domains.However,their widespread deployment has introduced critical security vulnerabilities,particularly prompt injection attacks that man... Large language models(LLMs)have revolutionized AI applications across diverse domains.However,their widespread deployment has introduced critical security vulnerabilities,particularly prompt injection attacks that manipulate model behavior through malicious instructions.Following Kitchenham’s guidelines,this systematic review synthesizes 128 peer-reviewed studies from 2022 to 2025 to provide a unified understanding of this rapidly evolving threat landscape.Our findings reveal a swift progression from simple direct injections to sophisticated multimodal attacks,achieving over 90%success rates against unprotected systems.In response,defense mechanisms show varying effectiveness:input preprocessing achieves 60%–80%detection rates and advanced architectural defenses demonstrate up to 95%protection against known patterns,though significant gaps persist against novel attack vectors.We identified 37 distinct defense approaches across three categories,but standardized evaluation frameworks remain limited.Our analysis attributes these vulnerabilities to fundamental LLM architectural limitations,such as the inability to distinguish instructions from data and attention mechanism vulnerabilities.This highlights critical research directions such as formal verification methods,standardized evaluation protocols,and architectural innovations for inherently secure LLM designs. 展开更多
关键词 Prompt injection attacks large language models defense mechanisms security evaluation
在线阅读 下载PDF
Turbulence-induced disturbances and their evolution to stall onset in a compressor cascade using large eddy simulation
11
作者 Tianyu PAN Teng LI +1 位作者 Zhaoqi YAN Qiushi LI 《Chinese Journal of Aeronautics》 2026年第2期1-19,共19页
This study investigates the turbulence-induced disturbances and stall precursor triggering mechanism in NACA65-18(10)cascade based on large eddy simulations.The results indicate that the disturbances exist under vario... This study investigates the turbulence-induced disturbances and stall precursor triggering mechanism in NACA65-18(10)cascade based on large eddy simulations.The results indicate that the disturbances exist under various operating conditions along the performance curve.The shear layer is the physical structure responsible for the generation,propagation,and dissipation of disturbances.When operating near stall,the separation on the suction surface intensifies,and strong unsteady backflow occurs at the trailing edge of the passage.Under the influence of inlet disturbances,unsteady behaviors between passages form specific phase differences,leading the entire system to oscillate in a first-order mode.As the flow develops from near-stall to stall,axial momentum decreases further,reducing the main flow’s ability to drive blockages downstream through convection.Consequently,the blockage accumulates during the circumferential propagation process until the stall onset.Based on the above mechanism,this study proposes factors describing the size of the backflow zone,shedding frequency,and convection velocity to characterize blockage dynamics,identifying critical values that represent the stall onset. 展开更多
关键词 Stall onset Pre-stall Disturbances in cascade Stall indicator large eddy simulation
原文传递
Explosive lunar fission above a large low-velocity province
12
作者 Matthew R.Edwards 《Acta Geochimica》 2026年第1期15-29,共15页
The giant impact hypothesis for the Moon's origin has had difficulty explaining the nearly identical isotopic compositions of Moon rocks and rocks from Earth's silicate mantle and crust.These similarities are ... The giant impact hypothesis for the Moon's origin has had difficulty explaining the nearly identical isotopic compositions of Moon rocks and rocks from Earth's silicate mantle and crust.These similarities are instead more compatible with the Darwin-Wise hypothesis that the Moon arose by fission of a rapidly spinning Earth.To overcome problems with the fission model concerning structural stability and angular momentum conservation,some authors suggested that lunar fission was feasible on a more slowly rotating Earth if assisted by a nuclear explosion near the core-mantle boundary.In this light we consider the possible roles of the large low-velocity provinces(LLVPs).These long-lived structures have been implicated in diverse geophysical processes ranging from deep mantle plumes to continental breakup and mass extinction events.While the LLVPs have been seen as possible remnants of the giant imp actor,we propose that one of them was the site of lunar ejection.Internal heating of the liquid core is suggested to have given rise to an equatorial belt just under the core-mantle boundary analogous to the one recently detected by Ma and Tkalcic[Sci Adv 10(35):eadn5562,2024].Upwellings of heat and volatiles from this belt then generated two antipodal,equatorial bulges:the precursors of the Pacific and African LLVPs.Prior to the emergence of plate tectonics,core heat was mainly dissipated by networks of deep mantle plumes extending above the proto-LLVPs.These plume networks represent conduits of weakened mantle through which proto-lunar materials could later rise in a focused ejection.Continuing heat buildup in the core eventually triggered a cataclysmic explosion in the Pacific proto-LLVP,possibly analogous to a planetary-scale kimberlite eruption.This explosion launched LLVP and overlying mantle material into a low Earth orbit,where it coalesced to form the Moon.Some possible sources of additional energy to power the explosion are considered,including nuclear fission,bolide impacts and a hypothetical gravitational decay process culminating in a'A event'. 展开更多
关键词 large low-velocity provinces Deep mantle plumes Lunar fission model KIMBERLITE
在线阅读 下载PDF
CIT-Rec:Enhancing Sequential Recommendation System with Large Language Models
13
作者 Ziyu Li Zhen Chen +2 位作者 Xuejing Fu Tong Mo Weiping Li 《Computers, Materials & Continua》 2026年第3期2328-2343,共16页
Recommendation systems are key to boosting user engagement,satisfaction,and retention,particularly on media platforms where personalized content is vital.Sequential recommendation systems learn from user-item interact... Recommendation systems are key to boosting user engagement,satisfaction,and retention,particularly on media platforms where personalized content is vital.Sequential recommendation systems learn from user-item interactions to predict future items of interest.However,many current methods rely on unique user and item IDs,limiting their ability to represent users and items effectively,especially in zero-shot learning scenarios where training data is scarce.With the rapid development of Large Language Models(LLMs),researchers are exploring their potential to enhance recommendation systems.However,there is a semantic gap between the linguistic semantics of LLMs and the collaborative semantics of recommendation systems,where items are typically indexed by IDs.Moreover,most research focuses on item representations,neglecting personalized user modeling.To address these issues,we propose a sequential recommendation framework using LLMs,called CIT-Rec,a model that integrates Collaborative semantics for user representation and Image and Text information for item representation to enhance Recommendations.Specifically,by aligning intuitive image information with text containing semantic features,we can more accurately represent items,improving item representation quality.We focus not only on item representations but also on user representations.To more precisely capture users’personalized preferences,we use traditional sequential recommendation models to train on users’historical interaction data,effectively capturing behavioral patterns.Finally,by combining LLMs and traditional sequential recommendation models,we allow the LLM to understand linguistic semantics while capturing collaborative semantics.Extensive evaluations on real-world datasets show that our model outperforms baseline methods,effectively combining user interaction history with item visual and textual modalities to provide personalized recommendations. 展开更多
关键词 large language models vision language models sequential recommendation instruction tuning
在线阅读 下载PDF
OPOR-Bench:Evaluating Large Language Models on Online Public Opinion Report Generation
14
作者 Jinzheng Yu Yang Xu +4 位作者 Haozhen Li Junqi Li Ligu Zhu Hao Shen Lei Shi 《Computers, Materials & Continua》 2026年第4期1403-1427,共25页
Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lack... Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lacks formal task definitions and corresponding benchmarks.To bridge this gap,we define the Automated Online Public Opinion Report Generation(OPOR-Gen)task and construct OPOR-Bench,an event-centric dataset with 463 crisis events across 108 countries(comprising 8.8 K news articles and 185 K tweets).To evaluate report quality,we propose OPOR-Eval,a novel agent-based framework that simulates human expert evaluation.Validation experiments show OPOR-Eval achieves a high Spearman’s correlation(ρ=0.70)with human judgments,though challenges in temporal reasoning persist.This work establishes an initial foundation for advancing automated public opinion reporting research. 展开更多
关键词 Online public opinion reports crisis management large language models agent-based evaluation
在线阅读 下载PDF
LLMKB:Large Language Models with Knowledge Base Augmentation for Conversational Recommendation
15
作者 FANG Xiu QIU Sijia +1 位作者 SUN Guohao LU Jinhu 《Journal of Donghua University(English Edition)》 2026年第1期91-103,共13页
Conversational recommender systems(CRSs)focus on refining preferences and providing personalized recommendations through natural language interactions and dialogue history.Large language models(LLMs)have shown outstan... Conversational recommender systems(CRSs)focus on refining preferences and providing personalized recommendations through natural language interactions and dialogue history.Large language models(LLMs)have shown outstanding performance across various domains,thereby prompting researchers to investigate their applicability in recommendation systems.However,due to the lack of task-specific knowledge and an inefficient feature extraction process,LLMs still have suboptimal performance in recommendation tasks.Therefore,external knowledge sources,such as knowledge graphs(KGs)and knowledge bases(KBs),are often introduced to address the issue of data sparsity.Compared to KGs,KBs possess higher retrieval efficiency,making them more suitable for scenarios where LLMs serve as recommenders.To this end,we introduce a novel framework integrating LLMs with KBs for enhanced retrieval generation,namely LLMKB.LLMKB initially leverages structured knowledge to create mapping dictionaries,extracting entity-relation information from heterogeneous knowledge to construct KBs.Then,LLMKB achieves the embedding calibration between user information representations and documents in KBs through retrieval model fine-tuning.Finally,LLMKB employs retrievalaugmented generation to produce recommendations based on fused text inputs,followed by post-processing.Experiment results on two public CRS datasets demonstrate the effectiveness of our framework.Our code is publicly available at the link:https://anonymous.4open.science/r/LLMKB-6FD0. 展开更多
关键词 recommender system large language model(LLM) knowledge base(KB)
在线阅读 下载PDF
A Deep Learning–Based Bias Correction Model for Tropical Cyclone Track and Intensity towards Forecasting of the TianXing Large Weather Model
16
作者 Shijin YUAN Xingzhou WANG +3 位作者 Bin MU Guansong WANG Zeyi NIU Hao LI 《Advances in Atmospheric Sciences》 2026年第3期612-630,共19页
Accurate forecasting of tropical cyclone(TC)tracks and intensities is essential.Although the TianXing large weather model,a six-hourly forecasting model surpassing operational forecasts,exhibits superior performance,i... Accurate forecasting of tropical cyclone(TC)tracks and intensities is essential.Although the TianXing large weather model,a six-hourly forecasting model surpassing operational forecasts,exhibits superior performance,its TC forecasts still require enhancement.Prediction errors persist due to biases in the training data and smoothing effects in data-driven methods.To address this,we introduce CycloneBCNet,a deep-learning model designed to correct TianXing’s TC forecast biases by leveraging spatial and temporal data.CycloneBCNet utilizes the SimVP(simpler yet better video prediction)framework with spatial attention to highlight cyclone core regions in forecast fields.It also incorporates TC trend information(center position,maximum wind speed,and minimum sea level pressure)via an LSTM(long short-term memory)module.These TC vectors are derived from post-processed TianXing forecasts.By fusing features from forecast fields and TC vectors,CycloneBCNet corrects biases across multiple lead times.At a 96-h lead time,the track error reduces from 162.4 to 86.4 km,the wind speed error from 17.2 to 6.69 m s^(-1),and the pressure error from 22.2 to 9.36 hPa.Interpretability analysis shows that CycloneBCNet adjusts its attention across forecast lead times.Intensity corrections prioritize inner-core dynamics,particularly the eye and eyewall,while track corrections shift from lower-level variables and the cyclone’s core to broader environmental factors and mid-to upper-level features as the forecast duration increases.These findings demonstrate that CycloneBCNet effectively captures key TC dynamics consistent with meteorological principles,including the dominance of near-surface conditions for intensity and the increasing influence of steering currents on track prediction. 展开更多
关键词 tropical cyclone TianXing large weather model bias correction interpretability analysis deep learning-based model
在线阅读 下载PDF
Beyond Accuracy:Evaluating and Explaining the Capability Boundaries of Large Language Models in Syntax-Preserving Code Translation
17
作者 Yaxin Zhao Qi Han +1 位作者 Hui Shu Yan Guang 《Computers, Materials & Continua》 2026年第2期1371-1394,共24页
LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora... LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora,which introduces significant bias in performance evaluation;and(2)mainstream metrics focus primarily on surface-level accuracy,failing to uncover the underlying factors that constrain model capabilities.To address these issues,this paper presents TCode(Translation-Oriented Code Evaluation benchmark)—a complexity-controllable,contamination-free benchmark dataset for code translation—alongside a dedicated static feature sensitivity evaluation framework.The dataset is carefully designed to control complexity along multiple dimensions—including syntactic nesting and expression intricacy—enabling both broad coverage and fine-grained differentiation of sample difficulty.This design supports precise evaluation of model capabilities across a wide spectrum of translation challenges.The proposed evaluation framework introduces a correlation-driven analysis mechanism based on static program features,enabling predictive modeling of translation success from two perspectives:Code Form Complexity(e.g.,code length and character density)and Semantic Modeling Complexity(e.g.,syntactic depth,control-flow nesting,and type system complexity).Empirical evaluations across representative LLMs—including Qwen2.5-72B and Llama3.3-70B—demonstrate that even state-of-the-art models achieve over 80% compilation success on simple samples,but their accuracy drops sharply below 40% on complex cases.Further correlation analysis indicates that Semantic Modeling Complexity alone is correlated with up to 60% of the variance in translation success,with static program features exhibiting nonlinear threshold effects that highlight clear capability boundaries.This study departs fromthe traditional accuracy-centric evaluation paradigm and,for the first time,systematically characterizes the capabilities of large languagemodels in translation tasks through the lens of programstatic features.The findings provide actionable insights for model refinement and training strategy development. 展开更多
关键词 large language models(LLMs) code translation compiler testing program analysis complexity-based evaluation
在线阅读 下载PDF
Clinical decision and prescription generation for diarrhea in traditional Chinese medicine based on large language model
18
作者 Jiaze Wu Hao Liang +2 位作者 Haoran Dai Hongliang Rui Baoli Liu 《Digital Chinese Medicine》 2026年第1期13-30,共18页
Objective To develop a clinical decision and prescription generation system(CDPGS)specifically for diarrhea in traditional Chinese medicine(TCM),utilizing a specialized large language model(LLM),Qwen-TCM-Dia,to standa... Objective To develop a clinical decision and prescription generation system(CDPGS)specifically for diarrhea in traditional Chinese medicine(TCM),utilizing a specialized large language model(LLM),Qwen-TCM-Dia,to standardize diagnostic processes and prescription generation.Methods Two primary datasets were constructed:an evaluation benchmark and a fine-tuning dataset consisting of fundamental diarrhea knowledge,medical records,and chain-ofthought(CoT)reasoning datasets.After an initial evaluation of 16 open-source LLMs across inference time,accuracy,and output quality,Qwen2.5 was selected as the base model due to its superior overall performance.We then employed a two-stage low-rank adaptation(LoRA)fine-tuning strategy,integrating continued pre-training on domain-specific knowledge with instruction fine-tuning using CoT-enriched medical records.This approach was designed to embed the clinical logic(symptoms→pathogenesis→therapeutic principles→prescriptions)into the model’s reasoning capabilities.The resulting fine-tuned model,specialized for TCM diarrhea,was designated as Qwen-TCM-Dia.Model performance was evaluated for disease diagnosis and syndrome type differentiation using accuracy,precision,recall,and F1-score.Furthermore,the quality of the generated prescriptions was compared with that of established open-source TCM LLMs.Results Qwen-TCM-Dia achieved peak performance compared to both the base Qwen2.5 model and five other open-source TCM LLMs.It achieved 97.05%accuracy and 91.48%F1-score in disease diagnosis,and 74.54%accuracy and 74.21%F1-score in syndrome type differentiation.Compared with existing open-source TCM LLMs(BianCang,HuangDi,LingDan,TCMLLM-PR,and ZhongJing),Qwen-TCM-Dia exhibited higher fidelity in reconstructing the“symptoms→pathogenesis→therapeutic principles→prescriptions”logic chain.It provided complete prescriptions,whereas other models often omitted dosages or generated mismatched prescriptions.Conclusion By integrating continued pre-training,CoT reasoning,and a two-stage fine-tuning strategy,this study establishes a CDPGS for diarrhea in TCM.The results demonstrate the synergistic effect of strengthening domain representation through pre-training and activating logical reasoning via CoT.This research not only provides critical technical support for the standardized diagnosis and treatment of diarrhea but also offers a scalable paradigm for the digital inheritance of expert TCM experience and the intelligent transformation of TCM. 展开更多
关键词 DIARRHEA Traditional Chinese medicine large language model Clinical decision and prescription generation Natural language processing
暂未订购
Rock Magnetic Characterization of the Seismogenic Environment of the Large Earthquake within Wenchuan Earthquake Fault Scientific Drilling Borehole 2 Cores
19
作者 ZHANG Lei LI Haibing +6 位作者 SUN Zhiming CAO Yong XU Peng LI Chunrui WANG Huan ZHENG Yong SI Jialiang 《Acta Geologica Sinica(English Edition)》 2026年第1期251-264,共14页
The Yingxiu-Beichuan fault zone(YBFZ)has long been active and experienced repeated large earthquakes.The physicochemical properties of the deep fault zone(>1000 m)are the key to understanding the deformation mechan... The Yingxiu-Beichuan fault zone(YBFZ)has long been active and experienced repeated large earthquakes.The physicochemical properties of the deep fault zone(>1000 m)are the key to understanding the deformation mechanism of large earthquakes.This study uses rock magnetic,microstructural,and geochemical analyses of representative samples exposed in FZ1681 within the Wenchuan Earthquake Fault Scientific Drilling borehole 2(WFSD-2)cores.Fault gouge and fault breccia have higher magnetic susceptibility values than wall rocks,and they contain abundant paramagnetic minerals and small quantities of magnetite and monoclinic pyrrhotite.The magnetite and monoclinic pyrrhotite in the fault gouge were mainly formed by coseismic frictional heating,indicating that large earthquakes with frictional heating temperatures of~500-900℃once occurred in the YBFZ.The seismogenic and coseismic environment was reducing with a relatively high sulfur content.The monoclinic pyrrhotite in the fault breccia was formed mainly by low-temperature hydrothermal fluid.This indicates that the fault zone experienced reducing and low-temperature(<400℃)hydrothermal fluid with a relatively high sulfur content after the earthquake.The YBFZ,which experiences frequent large earthquakes,is weakly oxidizing environment at different depths,but the effect of the low-temperature hydrothermal fluid is weaker at depth. 展开更多
关键词 fault gouge rock magnetism large earthquake Wenchuan Earthquake Fault Scientific Drilling Longmen Shan Thrust Belt
在线阅读 下载PDF
Comparing of small and large optimal tapered cascades for supplying enriched uranium for fresh fuel production in the equilibrium cycle of a nuclear power reactor
20
作者 S.L.Mirmohammadi J.Safdari A.A.Ghorbanpour Khamseh 《Nuclear Science and Techniques》 2026年第3期208-234,共27页
One of the main issues in designing optimum tapered cascades for uranium enrichment for annual fuel production in a power reactor is whether to employ large(fat)or small(thin)cascades.What will be the permissible and ... One of the main issues in designing optimum tapered cascades for uranium enrichment for annual fuel production in a power reactor is whether to employ large(fat)or small(thin)cascades.What will be the permissible and optimal ranges of the number of machines that can be used in a cascade?For the first time,the permissible and optimal ranges of the number of gas centrifuges that can be utilized in a cascade were investigated using two types of centrifuges,and the performance of small and large tapered cascades was discussed.The particle swarm optimization algorithm(PSO)has been used to optimize tapered cascades.The results show:(1)For the first centrifuge,41 cascades(91≤n≤4897)and for the second centrifuge,49 cascades(18≤n≤3839)with small and large sizes can be used in enrichment facilities,and the best cascade for them has 530(with 23 stages)and 39(with 7 stages)centrifuges,respectively.(2)For both centrifuges,when 600≤n(number of centrifuges=n),the large cascade performance changes are relatively insignificant.(3)For both types of gas centrifuges,the annual los s of separation power in enrichment facilities is approximately 1.25%-4.82%of the total separation work required. 展开更多
关键词 Small tapered cascade(thin) large tapered cascade(fat) Enriched uranium fuel Power reactor PSO algorithm
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部