期刊文献+
共找到932,566篇文章
< 1 2 250 >
每页显示 20 50 100
A decision framework for rural domestic sewage treatment models and process:Evidence from Inner Mongolia Autonomous Region,China 被引量:1
1
作者 Ying Yan Pengyu Li +5 位作者 Zixuan Wang Yubo Tan Tianlong Zheng Jianguo Liu Xiaoxia Yang Junxin Liu 《Journal of Environmental Sciences》 2026年第1期302-311,共10页
Rural domestic sewage treatment is critical for environmental protection.This study defines the spatial pattern of villages from the perspective of rural sewage treatment and develops an integrated decision-making sys... Rural domestic sewage treatment is critical for environmental protection.This study defines the spatial pattern of villages from the perspective of rural sewage treatment and develops an integrated decision-making system to propose a sewage treatment mode and scheme suitable for local conditions.By considering the village spatial layout and terrain factors,a decision tree model of residential density and terrain type was constructed with accuracies of 76.47%and 96.00%,respectively.Combined with binary classification probability unit regression,an appropriate sewage treatment mode for the village was determined with 87.00%accuracy.The Analytic Hierarchy Process(AHP),combined with the Technique for Order Preference(TOPSIS)by Similarity to an Ideal Solution model,formed the basis for optimal treatment process selection under different emission standards.Verification was conducted in 542 villages across three counties of the Inner Mongolia Autonomous Region,focusing on the standard effluent effect(0.3773),low investment cost(0.3196),and high standard effluent effect(0.5115)to determine the best treatment process for the same emission standard under different needs.The annual environmental and carbon emission benefits of sewage treatment in these villages were estimated.This model matches village density,geographic feature,and social development level,and provides scientific support and a theoretical basis for rural sewage treatment decision-making. 展开更多
关键词 Rural domestic sewage Sewage treatment model DECISION-MAKING Environmental-economic benefits Inner Mongolia
原文传递
Ecosystem service models are indeed being validated:A response to Pereira et al.(2025)
2
作者 James M.Bullock Danny A.P.Hooftman +1 位作者 John W.Redhead Simon Willcock 《Geography and Sustainability》 2026年第1期247-248,共2页
In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation ... In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation is provided”and that“the validation step is largely overlooked”.This assertion may have been true several years ago,for example,when Ochoa and Urbina-Cardona(2017)made a similar observation.However,there has been much work on ES model validation over the last decade. 展开更多
关键词 evaluation MAPPING modeling es model ecosystem services VALIDATION
在线阅读 下载PDF
CIT-Rec:Enhancing Sequential Recommendation System with Large Language Models
3
作者 Ziyu Li Zhen Chen +2 位作者 Xuejing Fu Tong Mo Weiping Li 《Computers, Materials & Continua》 2026年第3期2328-2343,共16页
Recommendation systems are key to boosting user engagement,satisfaction,and retention,particularly on media platforms where personalized content is vital.Sequential recommendation systems learn from user-item interact... Recommendation systems are key to boosting user engagement,satisfaction,and retention,particularly on media platforms where personalized content is vital.Sequential recommendation systems learn from user-item interactions to predict future items of interest.However,many current methods rely on unique user and item IDs,limiting their ability to represent users and items effectively,especially in zero-shot learning scenarios where training data is scarce.With the rapid development of Large Language Models(LLMs),researchers are exploring their potential to enhance recommendation systems.However,there is a semantic gap between the linguistic semantics of LLMs and the collaborative semantics of recommendation systems,where items are typically indexed by IDs.Moreover,most research focuses on item representations,neglecting personalized user modeling.To address these issues,we propose a sequential recommendation framework using LLMs,called CIT-Rec,a model that integrates Collaborative semantics for user representation and Image and Text information for item representation to enhance Recommendations.Specifically,by aligning intuitive image information with text containing semantic features,we can more accurately represent items,improving item representation quality.We focus not only on item representations but also on user representations.To more precisely capture users’personalized preferences,we use traditional sequential recommendation models to train on users’historical interaction data,effectively capturing behavioral patterns.Finally,by combining LLMs and traditional sequential recommendation models,we allow the LLM to understand linguistic semantics while capturing collaborative semantics.Extensive evaluations on real-world datasets show that our model outperforms baseline methods,effectively combining user interaction history with item visual and textual modalities to provide personalized recommendations. 展开更多
关键词 Large language models vision language models sequential recommendation instruction tuning
在线阅读 下载PDF
Research advances in animal models of high-altitude qi-deficiency and blood-stasis pattern
4
作者 Zhixing Wang Xin Shen +3 位作者 Baoying Shen Lijun Huang Jie Huang Chengcai Lai 《Journal of Traditional Chinese Medical Sciences》 2026年第1期19-26,共8页
This study summarizes the theoretical basis,modeling strategies,pathological mechanisms,and therapeutic advances related to high-altitude qi-deficiency and blood-stasis pattern.Traditional concepts such as“qi drives ... This study summarizes the theoretical basis,modeling strategies,pathological mechanisms,and therapeutic advances related to high-altitude qi-deficiency and blood-stasis pattern.Traditional concepts such as“qi drives blood”and“deficiency leads to stasis”closely align with modern evidence demonstrating that hypoxia disrupts energy metabolism,impairs microcirculation,and amplifies inflammation and oxidative stress.Current animal models commonly use hypobaric hypoxia combined with fatigue loading,dietary restriction,ice-water stimulation,or adrenaline injection to mimic the combined effects of qi deficiency,blood stasis,and hypoxic injury.These composite approaches reproduce systemic abnormalities,including reduced arterial oxygen partial pressure,increased blood viscosity,impaired cardiac and pulmonary function,microcirculatory obstruction,and mitochondrial dysfunction.Enhanced inflammatory signaling,oxidative stress,and disturbances in metabolic and epigenetic networks further characterize the pattern.The findings indicate that its pathogenesis arises from multi-system,multi-target interactions rather than a single pathway.Representative herbal formulas,such as Buyang Huanwu decoction,Xuefu Zhuyu decoction,and prescriptions rich in Astragalus membranaceus(Fisch.)Bunge(A.membranaceus,Huang qi)or Salvia miltiorrhiza Bunge(S.miltiorrhiza,Dan Shen)have demonstrated the ability to improve energy metabolism,attenuate endothelial injury,enhance microcirculation,and suppress inflammation through network-level regulation.Future research should focus on standardizing exposure parameters,developing quantitative syndrome evaluation systems,and integrating multi-omics,systems biology and artificial intelligence to improve model reproducibility and mechanistic precision.These efforts may help establish objective criteria for high-altitude qi-deficiency and blood-stasis pattern and support the development of targeted therapeutic strategies. 展开更多
关键词 High-altitude qi-deficiency and blood-stasis pattern Animal model Hypobaric hypoxia model establishment Evaluation system
暂未订购
Do Higher Horizontal Resolution Models Perform Better?
5
作者 Shoji KUSUNOKI 《Advances in Atmospheric Sciences》 2026年第1期259-262,共4页
Climate model prediction has been improved by enhancing model resolution as well as the implementation of sophisticated physical parameterization and refinement of data assimilation systems[section 6.1 in Wang et al.(... Climate model prediction has been improved by enhancing model resolution as well as the implementation of sophisticated physical parameterization and refinement of data assimilation systems[section 6.1 in Wang et al.(2025)].In relation to seasonal forecasting and climate projection in the East Asian summer monsoon season,proper simulation of the seasonal migration of rain bands by models is a challenging and limiting factor[section 7.1 in Wang et al.(2025)]. 展开更多
关键词 enhancing model resolution refinement data assimilation systems section climate model climate projection higher horizontal resolution seasonal forecasting simulation seasonal migration rain bands model resolution
在线阅读 下载PDF
SDNet:A self-supervised bird recognition method based on large language models and diffusion models for improving long-term bird monitoring
6
作者 Zhongde Zhang Nan Su +3 位作者 Chenxun Deng Yandong Zhao Weiping Liu Qiaoling Han 《Avian Research》 2026年第1期200-215,共16页
The collection and annotation of lar ge-scale bird datasets are resource-intensive and time-consuming processes that significantly limit the scalability and accuracy of biodiversity monitoring systems.While self-super... The collection and annotation of lar ge-scale bird datasets are resource-intensive and time-consuming processes that significantly limit the scalability and accuracy of biodiversity monitoring systems.While self-supervised learning(SSL)has emerged as a promising approach for leveraging unannotated data,current SSL methods face two critical challenges in bird species recognition:(1)long-tailed data distributions that result in poor performance on underrepresented species;and(2)domain shift issues caused by data augmentation strategies designed to mitigate class imbalance.Here we present SDNet,a novel SSL-based bird recognition framework that integrates diffusion models with large language models(LLMs)to overcome these limitations.SDNet employs LLMs to generate semantically rich textual descriptions for tail-class species by prompting the models with species taxonomy,morphological attributes,and habitat information,producing detailed natural language priors that capture fine-grained visual characteristics(e.g.,plumage patterns,body proportions,and distinctive markings).These textual descriptions are subsequently used by a conditional diffusion model to synthesize new bird image samples through cross-attention mechanisms that fuse textual embeddings with intermediate visual feature representations during the denoising process,ensuring generated images preserve species-specific morphological details while maintaining photorealistic quality.Additionally,we incorporate a Swin Transformer as the feature extraction backbone whose hierarchical window-based attention mechanism and shifted windowing scheme enable multi-scale local feature extraction that proves particularly effective at capturing finegrained discriminative patterns(such as beak shape and feather texture)while mitigating domain shift between synthetic and original images through consistent feature representations across both data sources.SDNet is validated on both a self-constructed dataset(Bird_BXS)an d a publicly available benchmark(Birds_25),demonstrating substantial improvements over conventional SSL approaches.Our results indicate that the synergistic integration of LLMs,diffusion models,and the Swin Transformer architecture contributes significantly to recognition accuracy,particularly for rare and morphologically similar species.These findings highlight the potential of SDNet for addressing fundamental limitations of existing SSL methods in avian recognition tasks and establishing a new paradigm for efficient self-supervised learning in large-scale ornithological vision applications. 展开更多
关键词 Biodiversity conservation Bird intelligent monitoring Diffusion models Large-scale language models Long-tailed learning Self-supervised learning
在线阅读 下载PDF
Secured-FL:Blockchain-Based Defense against Adversarial Attacks on Federated Learning Models
7
作者 Bello Musa Yakubu Nor Shahida Mohd Jamail +1 位作者 Rabia Latif Seemab Latif 《Computers, Materials & Continua》 2026年第3期734-757,共24页
Federated Learning(FL)enables joint training over distributed devices without data exchange but is highly vulnerable to attacks by adversaries in the form of model poisoning and malicious update injection.This work pr... Federated Learning(FL)enables joint training over distributed devices without data exchange but is highly vulnerable to attacks by adversaries in the form of model poisoning and malicious update injection.This work proposes Secured-FL,a blockchain-based defensive framework that combines smart contract-based authentication,clustering-driven outlier elimination,and dynamic threshold adjustment to defend against adversarial attacks.The framework was implemented on a private Ethereum network with a Proof-of-Authority consensus algorithm to ensure tamper-resistant and auditable model updates.Large-scale simulation on the Cyber Data dataset,under up to 50%malicious client settings,demonstrates Secured-FL achieves 6%-12%higher accuracy,9%-15%lower latency,and approximately 14%less computational expense compared to the PPSS benchmark framework.Additional tests,including confusion matrices,ROC and Precision-Recall curves,and ablation tests,confirm the interpretability and robustness of the defense.Tests for scalability also show consistent performance up to 500 clients,affirming appropriateness to reasonably large deployments.These results make Secured-FL a feasible,adversarially resilient FL paradigm with promising potential for application in smart cities,medicine,and other mission-critical IoT deployments. 展开更多
关键词 Federated learning(FL) blockchain FL based privacy model defense FL model security ethereum smart contract
在线阅读 下载PDF
Recent Advances and Prospects in Research of In Vitro 3D Functional Skin Tissue Models
8
作者 Li Tao Zhang Liqing 《China Detergent & Cosmetics》 2026年第1期75-88,共14页
With the increasing demand for understanding skin physiology and advancing regenerative medicine,in vitro three-dimensional(3D)functional skin tissue models have become vital tools in dermatological research.These mod... With the increasing demand for understanding skin physiology and advancing regenerative medicine,in vitro three-dimensional(3D)functional skin tissue models have become vital tools in dermatological research.These models effectively mimic the complex structure and functions of human skin.This review comprehensively discusses the latest advancements in construction techniques,material selection,and applications of 3D skin models.It highlights the advantages and challenges associated with cutting-edge technologies such as layer-by-layer cell coating,3D bioprinting,bio-spray technology,and photolithographic microfabrication in creating highly realistic skin models.Moreover,it examines the wide-ranging applications of 3D skin models,includingelucidation of skin disease mechanisms,investigation of skin barrier functions,studies on skin aging and repair,hair regeneration,efficacy screening of therapeutic agents,cosmetic safety assessment,and personalized medicine.Finally,this review anticipates future trends in developing 3D skin models with greater structural and functional complexity,enhanced multifunctionality,and improved clinical translation. 展开更多
关键词 3D skin models tissue engineering BIOPRINTING skin barrier disease modeling drug screening hair regeneration skin aging
暂未订购
Evaluating the Shanghai Typhoon Model against State-of-the-Art Machine-Learning Weather Prediction Models:A Case Study for Typhoon Danas(2025)
9
作者 Zeyi NIU Wei HUANG +5 位作者 Yuhua YANG Mengqi YANG Lin DENG Haibo WANG Hong LI Xu ZHANG 《Advances in Atmospheric Sciences》 2026年第4期744-750,共7页
This study traces the development of the Shanghai Typhoon Model(SHTM)from a traditional physics-based regional model toward a data-driven,machine-learning typhoon forecasting system.After upgrading its initial and bou... This study traces the development of the Shanghai Typhoon Model(SHTM)from a traditional physics-based regional model toward a data-driven,machine-learning typhoon forecasting system.After upgrading its initial and boundary conditions,SHTM now leverages large-scale constraints from machine-learning weather prediction(MLWP)models,resulting in an ML–physics hybrid framework.During Typhoon Danas(2025),the hybrid SHTM achieves substantially lower track errors than both the advanced ECMWF Integrated Forecasting System(IFS)and leading MLWP models such as PanGu and FuXi.Furthermore,the hybrid SHTM consistently maintains mean track errors below 200 km up to a forecast lead time of 108 hours,representing a significant advancement in forecast accuracy.In addition,this study highlights the technical roadmap for transitioning from a physics-based typhoon model to a fully data-driven ML typhoon forecast system.It also emphasizes that advances in the physical modeling framework provide a critical foundation for further improving the performance of future data-driven ML typhoon models. 展开更多
关键词 Shanghai Typhoon model(SHTM) machine-learning weather prediction machine learning-physics hybrid model
在线阅读 下载PDF
Hepatitis C Patient Education:Large Language Models Show Promise in Disseminating Guidelines
10
作者 Jinyan Chen Ruijie Zhao +10 位作者 Chiyu He Huigang Li Yajie You Zuyuan Lin Ze Xiang Jianyong Zhuo Wei Shen Zhihang Hu Shusen Zheng Xiao Xu Di Lu 《Journal of Clinical and Translational Hepatology》 2026年第1期116-119,共4页
This study evaluated the accuracy,completeness,and comprehensibility of responses from mainstream large language models(LLMs)to hepatitis C virus(HCV)-related questions,aiming to assess their performance in addressing... This study evaluated the accuracy,completeness,and comprehensibility of responses from mainstream large language models(LLMs)to hepatitis C virus(HCV)-related questions,aiming to assess their performance in addressing patient queries about disease and lifestyle behaviors.The models selected were ChatGPT-4o,Gemini 2.0 Pro,Claude 3.5 Sonnet,and DeepSeek V3,with 12 questions chosen by two HCV experts from the domains of prevention,diagnosis,and treatment. 展开更多
关键词 addressing patient queries disease lifestyle behaviorsthe large language models large language models llms GUIDELINES hepatitis C accuracy patient education COMPREHENSIBILITY
原文传递
Review of machine learning tight-binding models:Route to accurate and scalable electronic simulations
11
作者 Jijie Zou Zhanghao Zhouyin +1 位作者 Shishir Kumar Pandey Qiangqiang Gu 《Chinese Physics B》 2026年第1期2-12,共11页
The rapid advancement of machine learning based tight-binding Hamiltonian(MLTB)methods has opened new avenues for efficient and accurate electronic structure simulations,particularly in large-scale systems and long-ti... The rapid advancement of machine learning based tight-binding Hamiltonian(MLTB)methods has opened new avenues for efficient and accurate electronic structure simulations,particularly in large-scale systems and long-time scenarios.This review begins with a concise overview of traditional tight-binding(TB)models,including both(semi-)empirical and first-principles approaches,establishing the foundation for understanding MLTB developments.We then present a systematic classification of existing MLTB methodologies,grouped into two major categories:direct prediction of TB Hamiltonian elements and inference of empirical parameters.A comparative analysis with other ML-based electronic structure models is also provided,highlighting the advancement of MLTB approaches.Finally,we explore the emerging MLTB application ecosystem,highlighting how the integration of MLTB models with a diverse suite of post-processing tools from linear-scaling solvers to quantum transport frameworks and molecular dynamics interfaces is essential for tackling complex scientific problems across different domains.The continued advancement of this integrated paradigm promises to accelerate materials discovery and open new frontiers in the predictive simulation of complex quantum phenomena. 展开更多
关键词 machine learning tight-binding model electronic simulations
原文传递
Transformation of Verbal Descriptions of Process Flows into Business Process Modelling and Notation Models Using Multimodal Artificial Intelligence:Application in Justice
12
作者 Silvia Alayón Carlos Martín +3 位作者 Jesús Torres Manuel Bacallado Rosa Aguilar Guzmán Savirón 《Computer Modeling in Engineering & Sciences》 2026年第2期870-892,共23页
Business Process Modelling(BPM)is essential for analyzing,improving,and automating the flow of information within organizations,but traditional approaches based on manual interpretation are slow,error-prone,and requir... Business Process Modelling(BPM)is essential for analyzing,improving,and automating the flow of information within organizations,but traditional approaches based on manual interpretation are slow,error-prone,and require a high level of expertise.This article proposes an innovative alternative solution that overcomes these limitations by automatically generating comprehensive Business Process Modelling and Notation(BPMN)diagrams solely from verbal descriptions of the processes to be modeled,utilizing Large Language Models(LLMs)and multimodal Artificial Intelligence(AI).Experimental results,based on video recordings of process explanations provided by an expert from an organization(in this case,the Commercial Courts of a public justice administration),demonstrate that the proposed methodology successfully enables the automatic generation of complete and accurate BPMN diagrams,leading to significant improvements in the speed,accuracy,and accessibility of process modeling.This research makes a substantial contribution to the field of business process modeling,as its methodology is groundbreaking in its use of LLMs and multimodal AI capabilities to handle different types of source material(text and video),combining several tools to minimize the number of queries and reduce the complexity of the prompts required for the automatic generation of successful BPMN diagrams. 展开更多
关键词 Process modelling verbal description BPMN LLM multimodal AI
在线阅读 下载PDF
Semantic Causality Evaluation of Correlation Analysis Utilizing Large Language Models
13
作者 Adam Dudáš 《Computers, Materials & Continua》 2026年第5期2246-2269,共24页
It is known that correlation does not imply causality.Some relationships identified in the analysis of data are coincidental or unknown,and some are produced by real-world causality of the situation,which is problemat... It is known that correlation does not imply causality.Some relationships identified in the analysis of data are coincidental or unknown,and some are produced by real-world causality of the situation,which is problematic,since there is a need to differentiate between these two scenarios.Until recently,the proper−semantic−causality of the relationship could have been determined only by human experts from the area of expertise of the studied data.This has changed with the advance of large language models,which are often utilized as surrogates for such human experts,making the process automated and readily available to all data analysts.This motivates the main objective of this work,which is to introduce the design and implementation of a large language model-based semantic causality evaluator based on correlation analysis,together with its visual analysis model called Causal heatmap.After the implementation itself,the model is evaluated from the point of view of the quality of the visual model,from the point of view of the quality of causal evaluation based on large language models,and from the point of view of comparative analysis,while the results reached in the study highlight the usability of large language models in the task and the potential of the proposed approach in the analysis of unknown datasets.The results of the experimental evaluation demonstrate the usefulness of the Causal heatmap method,supported by the evident highlighting of interesting relationships,while suppressing irrelevant ones. 展开更多
关键词 CORRELATION CAUSALITY correlation analysis large language models VISUALIZATION
在线阅读 下载PDF
Recent advances in animal models for pathological scar research:A comprehensive review of experimental approaches and translational relevance
14
作者 Diana-Larisa Ancuța Mariana Văduva +1 位作者 Cristin Coman Iuliana Caraș 《Animal Models and Experimental Medicine》 2026年第1期59-71,共13页
Pathological scarring,manifested in the form of hypertrophic scars(HTS)and keloid scars(KS),represents a major clinical challenge due to its aesthetic and functional implications for patients.Understanding the molecul... Pathological scarring,manifested in the form of hypertrophic scars(HTS)and keloid scars(KS),represents a major clinical challenge due to its aesthetic and functional implications for patients.Understanding the molecular mechanisms involved in these types of scars and developing effective treatments requires the use of controlled ex-perimental models,especially animals,to overcome the limitations of clinical studies.The aim of this sistematic review is to critically analyze the animal models used in the last five years(2020-2025)for the study of pathological scars,highlighting their advantages,limitations and applicability in the development of new therapeutic strat-egies.Murine,rabbit and porcine models,as well as alternative models,offer varied perspectives on the formation and treatment of HTS and KS,with an emphasis on histological and molecular correlations with human pathology.By synthesizing recent data,the paper highlights the essential role of preclinical research in optimizing an-tifibrotic treatments and in advancing the translation of data into the clinical sphere.Overall,animal models remain essential for bridging mechanistic insights with clinical translation,supporting the development of more effective and personalized anti-scar therapies. 展开更多
关键词 animal model EXPERIMENT hypertrophic scar keloid scar TRANSLATION
暂未订购
A Survey on Medical Competence Evaluation Benchmarks for Large Language Models
15
作者 Qiting Wang Huiru Zou +3 位作者 Haobin Zhang Yongshun Huang Junzhang Tian Weibin Cheng 《Health Care Science》 2026年第1期4-18,共15页
Large language models(LLMs)show considerable potential to revolutionize healthcare through their performance across diverse clinical applications.Given the inherent constraints of LLMs and the critical nature of medic... Large language models(LLMs)show considerable potential to revolutionize healthcare through their performance across diverse clinical applications.Given the inherent constraints of LLMs and the critical nature of medical practice,a rigorous and systematic evaluation of their medical competence is imperative.This study presents a comprehensive review of the established methodologies and benchmarks for evaluating the medical competence of LLMs,encompassing a thorough analysis of current assessment practices across medical knowledge,clinical practice competence,and ethical-safety considerations.By integrating clinician competency assessment frameworks into LLMs evaluation,we propose a structured tri-dimensional framework that systematically organizes existing evaluation approaches according to medical theoretical knowledge,clinical practice ability,and ethical-safety considerations.Furthermore,this research provides critical insights into future developmental trajectories while establishing foundational frameworks and standardization protocols for the integration of LLMs into medical practice. 展开更多
关键词 BENCHMARK large language model medical competence ABSTRACT
在线阅读 下载PDF
Test for Varying-Coefficient Models with High-Dimensional Data
16
作者 YANG Lin GAO Yuzhao QU Lianqiang 《Journal of Systems Science & Complexity》 2026年第1期203-229,共27页
The authors consider the issue of hypothesis testing in varying-coefficient regression models with high-dimensional data.Utilizing kernel smoothing techniques,the authors propose a locally concerned U-statistic method... The authors consider the issue of hypothesis testing in varying-coefficient regression models with high-dimensional data.Utilizing kernel smoothing techniques,the authors propose a locally concerned U-statistic method to assess the overall significance of the coefficients.The authors establish that the proposed test is asymptotically normal under both the null hypothesis and local alternatives.Based on the locally concerned U-statistic,the authors further develop a globally concerned U-statistic to test whether the coefficient function is zero.A stochastic perturbation method is employed to approximate the distribution of the globally concerned test statistic.Monte Carlo simulations demonstrate the validity of the proposed test in finite samples. 展开更多
关键词 Hypothesis testing high-dimensional data kernel smoothing U-STATISTIC varying-coefficient models
原文传递
Decision-making performance of large language models vs.human physicians in challenging lung cancer cases:A real-world case-based study
17
作者 Ning Yang Kailai Li +19 位作者 Baiyang Liu Xiting Chen Aimin Jiang Chang Qi Wenyi Gan Lingxuan Zhu Weiming Mou Dongqiang Zeng Mingjia Xiao Guangdi Chu Shengkun Peng Hank ZHWong Lin Zhang Hengguo Zhang Xinpei Deng Quan Cheng Bufu Tang Anqi Lin Juan Zhou Peng Luo 《Intelligent Oncology》 2026年第1期15-24,共10页
Background:Despite the promise shown by large language models(LLMs)for standardized tasks,their multidimensional performance in real-world oncology decision-making remains unevaluated.This study aims to introduce a fr... Background:Despite the promise shown by large language models(LLMs)for standardized tasks,their multidimensional performance in real-world oncology decision-making remains unevaluated.This study aims to introduce a framework for evaluating LLMs and physician decisions in challenging lung cancer cases.Methods:We curated 50 challenging lung cancer cases(25 local and 25 published)classified as complex,rare,or refractory.Blinded three-dimensional,five-point Likert evaluations(1–5 for comprehensiveness,specificity,and readability)compared standalone LLMs(DeepSeek R1,Claude 3.5,Gemini 1.5,and GPT-4o),physicians by experience level(junior,intermediate,and senior),and AI-assisted juniors;intergroup differences and augmentation effects were analyzed statistically.Results:Of 50 challenging cases(18 complex,17 rare,and 15 refractory)rated by three experts,DeepSeek R1 achieved scores of 3.95±0.33,3.71±0.53,and 4.26±0.18 for comprehensiveness,specificity,and readability,respectively,positioning it between intermediate(3.68,3.68,3.75)and senior(4.50,4.64,4.53)physicians.GPT-4o and Claude 3.5 reached intermediate physician–level comprehensiveness(3.76±0.39,3.60±0.39)but junior-to-intermediate physician–level specificity(3.39±0.39,3.39±0.49).All LLMs scored higher on rare cases than intermediate physicians but fell below junior physicians in refractory-case specificity.AIassisted junior physicians showed marked gains in rare cases,with comprehensiveness rising from 2.32 to 4.29(84.8%),specificity from 2.24 to 4.26(90.8%),and readability from 2.76 to 4.59(66.0%),while specificity declined by 3.2%(3.17 to 3.07)in refractory cases.Error analysis showed complementary strengths,with physicians demonstrating reasoning stability and LLMs excelling in knowledge updating and risk management.Conclusions:LLMs performed variably in clinical decision-making tasks depending on case type,performing better in rare cases and worse in refractory cases requiring longitudinal reasoning.Complementary strengths between LLMs and physicians support case-and task-tailored human–AI collaboration. 展开更多
关键词 Large language models Clinical evaluation DECISION-MAKING Lung cancer
暂未订购
上一页 1 2 250 下一页 到第
使用帮助 返回顶部