期刊文献+
共找到935,205篇文章
< 1 2 250 >
每页显示 20 50 100
CIT-Rec:Enhancing Sequential Recommendation System with Large Language Models
1
作者 Ziyu Li Zhen Chen +2 位作者 Xuejing Fu Tong Mo Weiping Li 《Computers, Materials & Continua》 2026年第3期2328-2343,共16页
Recommendation systems are key to boosting user engagement,satisfaction,and retention,particularly on media platforms where personalized content is vital.Sequential recommendation systems learn from user-item interact... Recommendation systems are key to boosting user engagement,satisfaction,and retention,particularly on media platforms where personalized content is vital.Sequential recommendation systems learn from user-item interactions to predict future items of interest.However,many current methods rely on unique user and item IDs,limiting their ability to represent users and items effectively,especially in zero-shot learning scenarios where training data is scarce.With the rapid development of Large Language Models(LLMs),researchers are exploring their potential to enhance recommendation systems.However,there is a semantic gap between the linguistic semantics of LLMs and the collaborative semantics of recommendation systems,where items are typically indexed by IDs.Moreover,most research focuses on item representations,neglecting personalized user modeling.To address these issues,we propose a sequential recommendation framework using LLMs,called CIT-Rec,a model that integrates Collaborative semantics for user representation and Image and Text information for item representation to enhance Recommendations.Specifically,by aligning intuitive image information with text containing semantic features,we can more accurately represent items,improving item representation quality.We focus not only on item representations but also on user representations.To more precisely capture users’personalized preferences,we use traditional sequential recommendation models to train on users’historical interaction data,effectively capturing behavioral patterns.Finally,by combining LLMs and traditional sequential recommendation models,we allow the LLM to understand linguistic semantics while capturing collaborative semantics.Extensive evaluations on real-world datasets show that our model outperforms baseline methods,effectively combining user interaction history with item visual and textual modalities to provide personalized recommendations. 展开更多
关键词 Large language models vision language models sequential recommendation instruction tuning
在线阅读 下载PDF
SDNet:A self-supervised bird recognition method based on large language models and diffusion models for improving long-term bird monitoring
2
作者 Zhongde Zhang Nan Su +3 位作者 Chenxun Deng Yandong Zhao Weiping Liu Qiaoling Han 《Avian Research》 2026年第1期200-215,共16页
The collection and annotation of lar ge-scale bird datasets are resource-intensive and time-consuming processes that significantly limit the scalability and accuracy of biodiversity monitoring systems.While self-super... The collection and annotation of lar ge-scale bird datasets are resource-intensive and time-consuming processes that significantly limit the scalability and accuracy of biodiversity monitoring systems.While self-supervised learning(SSL)has emerged as a promising approach for leveraging unannotated data,current SSL methods face two critical challenges in bird species recognition:(1)long-tailed data distributions that result in poor performance on underrepresented species;and(2)domain shift issues caused by data augmentation strategies designed to mitigate class imbalance.Here we present SDNet,a novel SSL-based bird recognition framework that integrates diffusion models with large language models(LLMs)to overcome these limitations.SDNet employs LLMs to generate semantically rich textual descriptions for tail-class species by prompting the models with species taxonomy,morphological attributes,and habitat information,producing detailed natural language priors that capture fine-grained visual characteristics(e.g.,plumage patterns,body proportions,and distinctive markings).These textual descriptions are subsequently used by a conditional diffusion model to synthesize new bird image samples through cross-attention mechanisms that fuse textual embeddings with intermediate visual feature representations during the denoising process,ensuring generated images preserve species-specific morphological details while maintaining photorealistic quality.Additionally,we incorporate a Swin Transformer as the feature extraction backbone whose hierarchical window-based attention mechanism and shifted windowing scheme enable multi-scale local feature extraction that proves particularly effective at capturing finegrained discriminative patterns(such as beak shape and feather texture)while mitigating domain shift between synthetic and original images through consistent feature representations across both data sources.SDNet is validated on both a self-constructed dataset(Bird_BXS)an d a publicly available benchmark(Birds_25),demonstrating substantial improvements over conventional SSL approaches.Our results indicate that the synergistic integration of LLMs,diffusion models,and the Swin Transformer architecture contributes significantly to recognition accuracy,particularly for rare and morphologically similar species.These findings highlight the potential of SDNet for addressing fundamental limitations of existing SSL methods in avian recognition tasks and establishing a new paradigm for efficient self-supervised learning in large-scale ornithological vision applications. 展开更多
关键词 Biodiversity conservation Bird intelligent monitoring Diffusion models Large-scale language models Long-tailed learning Self-supervised learning
在线阅读 下载PDF
Recent Advances and Prospects in Research of In Vitro 3D Functional Skin Tissue Models
3
作者 Li Tao Zhang Liqing 《China Detergent & Cosmetics》 2026年第1期75-88,共14页
With the increasing demand for understanding skin physiology and advancing regenerative medicine,in vitro three-dimensional(3D)functional skin tissue models have become vital tools in dermatological research.These mod... With the increasing demand for understanding skin physiology and advancing regenerative medicine,in vitro three-dimensional(3D)functional skin tissue models have become vital tools in dermatological research.These models effectively mimic the complex structure and functions of human skin.This review comprehensively discusses the latest advancements in construction techniques,material selection,and applications of 3D skin models.It highlights the advantages and challenges associated with cutting-edge technologies such as layer-by-layer cell coating,3D bioprinting,bio-spray technology,and photolithographic microfabrication in creating highly realistic skin models.Moreover,it examines the wide-ranging applications of 3D skin models,includingelucidation of skin disease mechanisms,investigation of skin barrier functions,studies on skin aging and repair,hair regeneration,efficacy screening of therapeutic agents,cosmetic safety assessment,and personalized medicine.Finally,this review anticipates future trends in developing 3D skin models with greater structural and functional complexity,enhanced multifunctionality,and improved clinical translation. 展开更多
关键词 3D skin models tissue engineering BIOPRINTING skin barrier disease modeling drug screening hair regeneration skin aging
暂未订购
Semantic Causality Evaluation of Correlation Analysis Utilizing Large Language Models
4
作者 Adam Dudáš 《Computers, Materials & Continua》 2026年第5期2246-2269,共24页
It is known that correlation does not imply causality.Some relationships identified in the analysis of data are coincidental or unknown,and some are produced by real-world causality of the situation,which is problemat... It is known that correlation does not imply causality.Some relationships identified in the analysis of data are coincidental or unknown,and some are produced by real-world causality of the situation,which is problematic,since there is a need to differentiate between these two scenarios.Until recently,the proper−semantic−causality of the relationship could have been determined only by human experts from the area of expertise of the studied data.This has changed with the advance of large language models,which are often utilized as surrogates for such human experts,making the process automated and readily available to all data analysts.This motivates the main objective of this work,which is to introduce the design and implementation of a large language model-based semantic causality evaluator based on correlation analysis,together with its visual analysis model called Causal heatmap.After the implementation itself,the model is evaluated from the point of view of the quality of the visual model,from the point of view of the quality of causal evaluation based on large language models,and from the point of view of comparative analysis,while the results reached in the study highlight the usability of large language models in the task and the potential of the proposed approach in the analysis of unknown datasets.The results of the experimental evaluation demonstrate the usefulness of the Causal heatmap method,supported by the evident highlighting of interesting relationships,while suppressing irrelevant ones. 展开更多
关键词 CORRELATION CAUSALITY correlation analysis large language models VISUALIZATION
在线阅读 下载PDF
Decision-making performance of large language models vs.human physicians in challenging lung cancer cases:A real-world case-based study
5
作者 Ning Yang Kailai Li +19 位作者 Baiyang Liu Xiting Chen Aimin Jiang Chang Qi Wenyi Gan Lingxuan Zhu Weiming Mou Dongqiang Zeng Mingjia Xiao Guangdi Chu Shengkun Peng Hank ZHWong Lin Zhang Hengguo Zhang Xinpei Deng Quan Cheng Bufu Tang Anqi Lin Juan Zhou Peng Luo 《Intelligent Oncology》 2026年第1期15-24,共10页
Background:Despite the promise shown by large language models(LLMs)for standardized tasks,their multidimensional performance in real-world oncology decision-making remains unevaluated.This study aims to introduce a fr... Background:Despite the promise shown by large language models(LLMs)for standardized tasks,their multidimensional performance in real-world oncology decision-making remains unevaluated.This study aims to introduce a framework for evaluating LLMs and physician decisions in challenging lung cancer cases.Methods:We curated 50 challenging lung cancer cases(25 local and 25 published)classified as complex,rare,or refractory.Blinded three-dimensional,five-point Likert evaluations(1–5 for comprehensiveness,specificity,and readability)compared standalone LLMs(DeepSeek R1,Claude 3.5,Gemini 1.5,and GPT-4o),physicians by experience level(junior,intermediate,and senior),and AI-assisted juniors;intergroup differences and augmentation effects were analyzed statistically.Results:Of 50 challenging cases(18 complex,17 rare,and 15 refractory)rated by three experts,DeepSeek R1 achieved scores of 3.95±0.33,3.71±0.53,and 4.26±0.18 for comprehensiveness,specificity,and readability,respectively,positioning it between intermediate(3.68,3.68,3.75)and senior(4.50,4.64,4.53)physicians.GPT-4o and Claude 3.5 reached intermediate physician–level comprehensiveness(3.76±0.39,3.60±0.39)but junior-to-intermediate physician–level specificity(3.39±0.39,3.39±0.49).All LLMs scored higher on rare cases than intermediate physicians but fell below junior physicians in refractory-case specificity.AIassisted junior physicians showed marked gains in rare cases,with comprehensiveness rising from 2.32 to 4.29(84.8%),specificity from 2.24 to 4.26(90.8%),and readability from 2.76 to 4.59(66.0%),while specificity declined by 3.2%(3.17 to 3.07)in refractory cases.Error analysis showed complementary strengths,with physicians demonstrating reasoning stability and LLMs excelling in knowledge updating and risk management.Conclusions:LLMs performed variably in clinical decision-making tasks depending on case type,performing better in rare cases and worse in refractory cases requiring longitudinal reasoning.Complementary strengths between LLMs and physicians support case-and task-tailored human–AI collaboration. 展开更多
关键词 Large language models Clinical evaluation DECISION-MAKING Lung cancer
暂未订购
When Large Language Models and Machine Learning Meet Multi-Criteria Decision Making: Fully Integrated Approach for Social Media Moderation
6
作者 Noreen Fuentes Janeth Ugang +4 位作者 Narcisan Galamiton Suzette Bacus Samantha Shane Evangelista Fatima Maturan Lanndon Ocampo 《Computers, Materials & Continua》 2026年第1期2137-2162,共26页
This study demonstrates a novel integration of large language models,machine learning,and multicriteria decision-making to investigate self-moderation in small online communities,a topic under-explored compared to use... This study demonstrates a novel integration of large language models,machine learning,and multicriteria decision-making to investigate self-moderation in small online communities,a topic under-explored compared to user behavior and platform-driven moderation on social media.The proposed methodological framework(1)utilizes large language models for social media post analysis and categorization,(2)employs k-means clustering for content characterization,and(3)incorporates the TODIM(Tomada de Decisão Interativa Multicritério)method to determine moderation strategies based on expert judgments.In general,the fully integrated framework leverages the strengths of these intelligent systems in a more systematic evaluation of large-scale decision problems.When applied in social media moderation,this approach promotes nuanced and context-sensitive self-moderation by taking into account factors such as cultural background and geographic location.The application of this framework is demonstrated within Facebook groups.Eight distinct content clusters encompassing safety,harassment,diversity,and misinformation are identified.Analysis revealed a preference for content removal across all clusters,suggesting a cautious approach towards potentially harmful content.However,the framework also highlights the use of other moderation actions,like account suspension,depending on the content category.These findings contribute to the growing body of research on self-moderation and offer valuable insights for creating safer and more inclusive online spaces within smaller communities. 展开更多
关键词 Self-moderation user-generated content k-means clustering TODIM large language models
在线阅读 下载PDF
Assessing Large Language Models for Early Article Identification in Otolaryngology—Head and Neck Surgery Systematic Reviews
7
作者 Ajibola B.Bakare Young Lee +2 位作者 Jhuree Hong Claus-Peter Richter Jonathan P.Kuriakose 《Health Care Science》 2026年第1期19-28,共10页
Background:Assess ChatGPT and Bard's effectiveness in the initial identification of articles for Otolaryngology—Head and Neck Surgery systematic literature reviews.Methods:Three PRISMA-based systematic reviews(Ja... Background:Assess ChatGPT and Bard's effectiveness in the initial identification of articles for Otolaryngology—Head and Neck Surgery systematic literature reviews.Methods:Three PRISMA-based systematic reviews(Jabbour et al.2017,Wong et al.2018,and Wu et al.2021)were replicated using ChatGPTv3.5 and Bard.Outputs(author,title,publication year,and journal)were compared to the original references and cross-referenced with medical databases for authenticity and recall.Results:Several themes emerged when comparing Bard and ChatGPT across the three reviews.Bard generated more outputs and had greater recall in Wong et al.'s review,with a broader date range in Jabbour et al.'s review.In Wu et al.'s review,ChatGPT-2 had higher recall and identified more authentic outputs than Bard-2.Conclusion:Large language models(LLMs)failed to fully replicate peer-reviewed methodologies,producing outputs with inaccuracies but identifying relevant,especially recent,articles missed by the references.While human-led PRISMA-based reviews remain the gold standard,refining LLMs for literature reviews shows potential. 展开更多
关键词 artificial intelligence BARD ChatGPT large language models systematic review
暂未订购
Therapeutic Potential of Fingolimod and Dimethyl Fumarate in Preclinical Pancreatic Cancer Models
8
作者 Pauline Gousseau Laurie Genest +1 位作者 Guillaume Froget Tristan Rupp 《Oncology Research》 2026年第3期387-405,共19页
Objectives:The five-year survival rate for pancreatic cancer is notably low,posing a significant challenge to patient health.The primary treatments are radiotherapy and chemotherapy,sometimes combined with targeted th... Objectives:The five-year survival rate for pancreatic cancer is notably low,posing a significant challenge to patient health.The primary treatments are radiotherapy and chemotherapy,sometimes combined with targeted therapy;however,their clinical benefits are limited.Therefore,developing new models to evaluate the therapeutic potential of novel molecules is essential.Fingolimod and Dimethyl Fumarate(DMF),currently used to treat multiple sclerosis,have recently been shown to have anti-cancer effects in several preclinical tumor models.This study aims to evaluate the therapeutic potential of Fingolimod and DMF in pancreatic cancer by investigating their respective in vitro cytotoxicity and in vivo antitumor effects.Methods:In this study,we evaluated for the first time these two drugs in pancreatic preclinical models in vitro using 3D spheroid tumor models and in vivo,which are compared to two standard-of-care consisting of Gemcitabine and Erlotinib.Results:In vitro,both Fingolimod and DMF induced cytotoxicity in spheroids from two pancreatic cell lines.Additionally,Fingolimod and DMF displayed anticancer effects in two subcutaneous xenograft models using PANC-1 and CFPAC-1 cells.Conclusions:Although the responses observed with Fingolimod and DMF were similar to those of Gemcitabine and Erlotinib,these findings indicate a potential emerging interest in Fingolimod and DMF for the treatment of pancreatic cancer.However,further work is still necessary to fully characterize how these drugs affect tumor progression. 展开更多
关键词 Pancreatic cancer preclinical models tumor progression FINGOLIMOD dimethyl Fumarate
暂未订购
Animal models of benign airway stenosis:Advances in construction techniques,evaluation systems,and perspectives
9
作者 Wusheng Zhang Yilin Chen +4 位作者 Chengcheng Yang Yuchao Dong Haidong Huang Hui Shi Chong Bai 《Animal Models and Experimental Medicine》 2026年第2期280-297,共18页
The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention a... The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention and treatment methods.Animal models serve as essential tools for investigating disease mechanisms and assessing novel therapeutic strategies,and the scientific rigor of their construction and validation significantly impacts the reliability of research findings.This paper systematically reviews the research progress and evaluation systems of BAS animal models over the past decade,aiming to provide a robust foundation for the optimized construction of BAS models,intervention studies,and clinical translation.This effort is intended to facilitate the innovation and advancement in BAS prevention and treatment strategies. 展开更多
关键词 airway stenosis animal models benign airway stenosis evaluation systems
暂未订购
A comparison of hydrological loading deformations from GRACE mascon and load models with reprocessed IGS station positions
10
作者 Jianhe Feng Yanlin Li +1 位作者 Na Wei Zhao Li 《Geodesy and Geodynamics》 2026年第2期186-196,共11页
Global Navigation Satellite System(GNSS)observations are critical for establishing high-precision terrestrial reference frames(TRF),but the environmental loading effects,particularly hydrological loading deformation(H... Global Navigation Satellite System(GNSS)observations are critical for establishing high-precision terrestrial reference frames(TRF),but the environmental loading effects,particularly hydrological loading deformation(HYLD),remain unaccounted in existing TRF like ITRF2020,limiting their accuracy.This study evaluates the performance of multiple HYLD datasets derived from GRACE(mascon and spherical harmonic(SH)products)and four hydrological models(LSDM,ERA5,GLDAS2,and MERRA2)in explaining seasonal and non-seasonal GNSS displacements globally using IGS Repro3 and Re pro 2datasets.Among these six HYLD datasets,we demonstrate that the GRACE mascon solution achieves superior performance in explaining the seasonal and non-seasonal GNSS displacements,by quantifying the amplitude reduction ratio(AMPR)and root mean square reduction ratio(RMSR)induced by HYLD corrections,respectively.The mascon-derived HYLD achieves better correction,particularly with the vertical median AMPR of 35.1%and RMSR of 4%.In contrast,hydrological models and SH product have relatively lower performance in explaining GNSS displacements,with ERA5 achieving only 24.7%for the ve rtical AMPR.The HYLDs of coastal stations generally exhibit worse perfo rmance with lower AMPR and more negative RMSR distributions,likely reflecting the influence of ocean loading and their limitations in accurately isolating the land water signal within land boundaries;whereas the mascon result shows minimal differences between inland and coastal stations,benefitting from the reduced leakage of land water into the oceans.Furthermore,the transition from Repro2 to the improved reprocessing strategy in Re pro3 enhances the overall consistency between HYLDs and GNSS displacements,specifically with a 7%improvement in the vertical AMPR with MERRA2. 展开更多
关键词 GRACE mascon Hydrological models GNSS displacement Time series analysis
原文传递
Command-agent:Reconstructing warfare simulation and command decision-making using large language models
11
作者 Mengwei Zhang Minchi Kuang +3 位作者 Heng Shi Jihong Zhu Jingyu Zhu Xiao Jiang 《Defence Technology(防务技术)》 2026年第2期294-313,共20页
War rehearsals have become increasingly important in national security due to the growing complexity of international affairs.However,traditional rehearsal methods,such as military chess simulations,are inefficient an... War rehearsals have become increasingly important in national security due to the growing complexity of international affairs.However,traditional rehearsal methods,such as military chess simulations,are inefficient and inflexible,with particularly pronounced limitations in command and decision-making.The overwhelming volume of information and high decision complexity hinder the realization of autonomous and agile command and control.To address this challenge,an intelligent warfare simulation framework named Command-Agent is proposed,which deeply integrates large language models(LLMs)with digital twin battlefields.By constructing a highly realistic battlefield environment through real-time simulation and multi-source data fusion,the natural language interaction capabilities of LLMs are leveraged to lower the command threshold and to enable autonomous command through the Observe-Orient-Decide-Act(OODA)feedback loop.Within the Command-Agent framework,a multimodel collaborative architecture is further adopted to decouple the decision-generation and command-execution functions of LLMs.By combining specialized models such as Deep Seek-R1 and MCTool,the limitations of single-model capabilities are overcome.MCTool is a lightweight execution model fine-tuned for military Function Calling tasks.The framework also introduces a Vector Knowledge Base to mitigate hallucinations commonly exhibited by LLMs.Experimental results demonstrate that Command-Agent not only enables natural language-driven simulation and control but also deeply understands commander intent.Leveraging the multi-model collaborative architecture,during red-blue UAV confrontations involving 2 to 8 UAVs,the integrated score is improved by an average of 41.8%compared to the single-agent system(MCTool),accompanied by a 161.8%optimization in the battle loss ratio.Furthermore,when compared with multi-agent systems lacking the knowledge base,the inclusion of the Vector Knowledge Base further improves overall performance by 16.8%.In comparison with the general model(Qwen2.5-7B),the fine-tuned MCTool leads by 5%in execution efficiency.Therefore,the proposed Command-Agent introduces a novel perspective to the military command system and offers a feasible solution for intelligent battlefield decision-making. 展开更多
关键词 Digital twin battlefield Large language models Multi-agent system Military command
在线阅读 下载PDF
Task-Structured Curriculum Learning for Multi-Task Distillation:Enhancing Step-by-Step Knowledge Transfer in Language Models
12
作者 Ahmet Ezgi Aytug Onan 《Computers, Materials & Continua》 2026年第3期1647-1673,共27页
Knowledge distillation has become a standard technique for compressing large language models into efficient student models,but existing methods often struggle to balance prediction accuracy with explanation quality.Re... Knowledge distillation has become a standard technique for compressing large language models into efficient student models,but existing methods often struggle to balance prediction accuracy with explanation quality.Recent approaches such as Distilling Step-by-Step(DSbS)introduce explanation supervision,yet they apply it in a uniform manner that may not fully exploit the different learning dynamics of prediction and explanation.In this work,we propose a task-structured curriculum learning(TSCL)framework that structures training into three sequential phases:(i)prediction-only,to establish stable feature representations;(ii)joint prediction-explanation,to align task outputs with rationale generation;and(iii)explanation-only,to refine the quality of rationales.This design provides a simple but effective modification to DSbS,requiring no architectural changes and adding negligible training cost.We justify the phase scheduling with ablation studies and convergence analysis,showing that an initial prediction-heavy stage followed by a balanced joint phase improves both stability and explanation alignment.Extensive experiments on five datasets(e-SNLI,ANLI,CommonsenseQA,SVAMP,and MedNLI)demonstrate that TSCL consistently outperforms strong baselines,achieving gains of+1.7-2.6 points in accuracy and 0.8-1.2 in ROUGE-L,corresponding to relative error reductions of up to 21%.Beyond lexical metrics,human evaluation and ERASERstyle faithfulness diagnostics confirm that TSCL produces more faithful and informative explanations.Comparative training curves further reveal faster convergence and lower variance across seeds.Efficiency analysis shows less than 3%overhead in wall-clock training time and no additional inference cost,making the approach practical for realworld deployment.This study demonstrates that a simple task-structured curriculum can significantly improve the effectiveness of knowledge distillation.By separating and sequencing objectives,TSCL achieves a better balance between accuracy,stability,and explanation quality.The framework generalizes across domains,including medical NLI,and offers a principled recipe for future applications in multimodal reasoning and reinforcement learning. 展开更多
关键词 Knowledge distillation curriculum learning language models multi-task learning step-by-step learning
在线阅读 下载PDF
Classification of Job Offers into Job Positions Using O*NET and BERT Language Models
13
作者 Lino Gonzalez-Garcia Miguel-Angel Sicilia Elena García-Barriocanal 《Computers, Materials & Continua》 2026年第2期2133-2147,共15页
Classifying job offers into occupational categories is a fundamental task in human resource information systems,as it improves and streamlines indexing,search,and matching between openings and job seekers.Comprehensiv... Classifying job offers into occupational categories is a fundamental task in human resource information systems,as it improves and streamlines indexing,search,and matching between openings and job seekers.Comprehensive occupational databases such as O∗NET or ESCO provide detailed taxonomies of interrelated positions that can be leveraged to align the textual content of postings with occupational categories,thereby facilitating standardization,cross-system interoperability,and access to metadata for each occupation(e.g.,tasks,knowledge,skills,and abilities).In this work,we explore the effectiveness of fine-tuning existing language models(LMs)to classify job offers with occupational descriptors from O∗NET.This enables a more precise assessment of candidate suitability by identifying the specific knowledge and skills required for each position,and helps automate recruitment processes by mitigating human bias and subjectivity in candidate selection.We evaluate three representative BERT-like models:BERT,RoBERTa,and DeBERTa.BERT serves as the baseline encoder-only architecture;RoBERTa incorporates advances in pretraining objectives and data scale;and DeBERTa introduces architectural improvements through disentangled attention mechanisms.The best performance was achieved with the DeBERTa model,although the other models also produced strong results,and no statistically significant differences were observed acrossmodels.We also find that these models typically reach optimal performance after only a few training epochs,and that training with smaller,balanced datasets is effective.Consequently,comparable results can be obtained with models that require fewer computational resources and less training time,facilitating deployment and practical use. 展开更多
关键词 Occupational databases job offer classification language models O∗NET BERT RoBERTa DeBERTa
在线阅读 下载PDF
Prompt Injection Attacks on Large Language Models:A Survey of Attack Methods,Root Causes,and Defense Strategies
14
作者 Tongcheng Geng Zhiyuan Xu +1 位作者 Yubin Qu W.Eric Wong 《Computers, Materials & Continua》 2026年第4期134-185,共52页
Large language models(LLMs)have revolutionized AI applications across diverse domains.However,their widespread deployment has introduced critical security vulnerabilities,particularly prompt injection attacks that man... Large language models(LLMs)have revolutionized AI applications across diverse domains.However,their widespread deployment has introduced critical security vulnerabilities,particularly prompt injection attacks that manipulate model behavior through malicious instructions.Following Kitchenham’s guidelines,this systematic review synthesizes 128 peer-reviewed studies from 2022 to 2025 to provide a unified understanding of this rapidly evolving threat landscape.Our findings reveal a swift progression from simple direct injections to sophisticated multimodal attacks,achieving over 90%success rates against unprotected systems.In response,defense mechanisms show varying effectiveness:input preprocessing achieves 60%–80%detection rates and advanced architectural defenses demonstrate up to 95%protection against known patterns,though significant gaps persist against novel attack vectors.We identified 37 distinct defense approaches across three categories,but standardized evaluation frameworks remain limited.Our analysis attributes these vulnerabilities to fundamental LLM architectural limitations,such as the inability to distinguish instructions from data and attention mechanism vulnerabilities.This highlights critical research directions such as formal verification methods,standardized evaluation protocols,and architectural innovations for inherently secure LLM designs. 展开更多
关键词 Prompt injection attacks large language models defense mechanisms security evaluation
在线阅读 下载PDF
Foundation models:Insights and implications for gastrointestinal cancer
15
作者 Lei Shi Rui Huang +1 位作者 Li-Ling Zhao An-Jie Guo 《World Journal of Gastroenterology》 2025年第47期7-34,共28页
Gastrointestinal(GI)cancers represent a major global health concern due to their high incidence and mortality rates.Foundation models(FMs),also referred to as large models,represent a novel class of artificial intelli... Gastrointestinal(GI)cancers represent a major global health concern due to their high incidence and mortality rates.Foundation models(FMs),also referred to as large models,represent a novel class of artificial intelligence technologies that have demonstrated considerable potential in addressing these challenges.These models encompass large language models(LLMs),vision FMs(VFMs),and multimodal LLMs(MLLMs),all of which utilize transformer architectures and self-supervised pre-training on extensive unlabeled datasets to achieve robust cross-domain generalization.This review delineates the principal applications of these models:LLMs facilitate the structuring of clinical narratives,extraction of insights from medical records,and enhancement of physician-patient communication;VFMs are employed in the analysis of endoscopic,radiological,and pathological images for lesion detection and staging;MLLMs integrate heterogeneous data modalities,including imaging,textual information,and genomic data,to support diagnostic processes,treatment prediction,and prognostic evaluation.Despite these promising developments,several challenges remain,such as the need for data standardization,limited diversity within training datasets,substantial computational resource requirements,and ethical-legal concerns.In conclusion,FMs exhibit significant potential to advance research and clinical management of GI cancers.Future research efforts should prioritize the refinement of these models,promote international collaborations,and adopt interdisciplinary approaches.Such a comprehensive strategy is essential to fully harness the capabilities of FMs,driving substantial progress in the fight against GI malignancies. 展开更多
关键词 Foundation models Gastrointestinal cancers Large language models Vision foundation models Multimodal large language models
在线阅读 下载PDF
Machine learning models for predicting carbonation depth in fly ash concrete:performance and interpretability insights
16
作者 Arslan Qayyum Khan Syed Ghulam Muhammad +1 位作者 Ali Raza Amorn Pimanmas 《Journal of Road Engineering》 2026年第1期74-90,共17页
This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,suc... This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,such as adaptive boosting(AdaBoost),categorical boosting(CatBoost),gradient boosting regressor(GBR),hist gradient boosting regressor(HistGBR),and extreme gradient boosting(XGBoost),were developed and optimized using 729 high-quality dataset points incorporating seven input parameters,including cement,CO_(2),exposure time,water-binder ratio,fly ash,curing time,and compressive strength.Several performance evaluation metrics were used to compare the models.The GBR model emerged as the best-performing model,based on high coefficient of determination(R^(2))values and balanced error metrics across both validation and testing datasets.While all models performed exceptionally well on the training data,GBR demonstrated superior generalization capability,with R^(2) values of 0.9438 on the validation set and 0.9310 on the testing set.Furthermore,its low mean squared error(MSE),root mean square error(RMSE),mean absolute error(MAE),and median absolute error(MdAE)confirmed its robustness and accuracy.Moreover,shapley additive explanations(SHAP)analysis enhanced the interpretability of predictions,highlighting the curing time and exposure time as the most critical drivers of carbonation depth. 展开更多
关键词 Fly ash concrete Carbonation depth Machine learning Ensemble models SHAP analysis
在线阅读 下载PDF
Harnessing computational power for intelligent oncology in the age of large models: Status, challenges, and prospects
17
作者 Kexin Xu Yueran Xu Qing Shi 《Intelligent Oncology》 2026年第1期51-63,共13页
The integration of large-scale foundation models(e.g.,GPT series and AlphaFold)into oncology is fundamentally transforming both research methodologies and clinical practices,driven by unprecedented advancements in com... The integration of large-scale foundation models(e.g.,GPT series and AlphaFold)into oncology is fundamentally transforming both research methodologies and clinical practices,driven by unprecedented advancements in computational power.This review synthesizes recent progress in the application of large language models to core oncological tasks,including medical imaging analysis,genomic interpretation,and personalized treatment planning.Underpinned by advanced computational infrastructures,such as graphics processing unit/tensor processing unit clusters,heterogeneous computing,and cloud platforms,these models enable superior representation learning and generalization across multimodal data sources.This review examines how these infrastructures overcome key bottlenecks in intelligent oncology through scalable optimization strategies,including mixed-precision training,memory optimization,and heterogeneous computing.Alongside these technical advancements,the review explores pressing challenges,such as data heterogeneity,limited model interpretability,regulatory uncertainties,and the environmental impact of artificial intelligence(AI)systems.Special emphasis is placed on emerging solutions,encompassing green AI and edge computing,which offer promising approaches for low-resource deployment scenarios.Additionally,the review highlights the critical role of interdisciplinary collaboration among oncology,computer science,ethics,and policy to ensure that AI systems are not only powerful but also transparent,safe,and clinically relevant.Finally,the review outlines potential avenues for future research aimed at developing robust,scalable,and human-centered frameworks for intelligent oncology. 展开更多
关键词 Large language models Intelligent oncology Medical AI Computational infrastructure High-performance computing
在线阅读 下载PDF
Noisy data-driven identification for errors-in-variables MISO Hammerstein nonlinear models
18
作者 Jie Hou Haoran Wang +1 位作者 Penghua Li Hao Su 《Control Theory and Technology》 2026年第1期111-126,共16页
In this paper,we consider a multiple-input single-output(MISO)Hammerstein system whose inputs and output are disturbed by unknown Gaussian white measurement noises.The parameter estimation of such a system is a typica... In this paper,we consider a multiple-input single-output(MISO)Hammerstein system whose inputs and output are disturbed by unknown Gaussian white measurement noises.The parameter estimation of such a system is a typical errors-in-variables(EIV)nonlinear system identification problem.This paper proposes a bias-correction least squares(BCLS)identification methods to compute a consistent estimate of EIV MISO Hammerstein systems from noisy data.To obtain the unbiased parameter estimates of EIV MISO Hammerstein system,the analytical expression of estimated bias for the standard least squares(LS)algorithm is derived first,which is a function about the variances of noises.And then a recursive algorithm is proposed to estimate the unknown term of noises variances from noisy data.Finally,based on bias estimation scheme,the bias caused by the correlation between the input–output signals exciting the true system and the corresponding measurement noise,resulting in unbiased parameter estimates of the EIV MISO Hammerstein system.The performance of the proposed method is demonstrated through a simulation example and a chemical continuously stirred tank reactor(CSTR)system. 展开更多
关键词 Biased-corrected least squares ERRORS-IN-VARIABLES MISO Hammerstein models Parameter estimation System identification
原文传递
OPOR-Bench:Evaluating Large Language Models on Online Public Opinion Report Generation
19
作者 Jinzheng Yu Yang Xu +4 位作者 Haozhen Li Junqi Li Ligu Zhu Hao Shen Lei Shi 《Computers, Materials & Continua》 2026年第4期1403-1427,共25页
Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lack... Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lacks formal task definitions and corresponding benchmarks.To bridge this gap,we define the Automated Online Public Opinion Report Generation(OPOR-Gen)task and construct OPOR-Bench,an event-centric dataset with 463 crisis events across 108 countries(comprising 8.8 K news articles and 185 K tweets).To evaluate report quality,we propose OPOR-Eval,a novel agent-based framework that simulates human expert evaluation.Validation experiments show OPOR-Eval achieves a high Spearman’s correlation(ρ=0.70)with human judgments,though challenges in temporal reasoning persist.This work establishes an initial foundation for advancing automated public opinion reporting research. 展开更多
关键词 Online public opinion reports crisis management large language models agent-based evaluation
在线阅读 下载PDF
Predicting potential suitable areas of Orchidaceae plants with national key reserve from Heilongjiang province in MaxEnt models
20
作者 Weixue Zhong Xiaoxue Wei +6 位作者 Yujia Yu Xiaoqing Tang Ye Zhang Xinyu Huang Xiaohui Li Ying Liu Dewen Li 《Ecological Frontiers》 2026年第1期18-28,共11页
The study aimed at predicting potential suitable areas with national key reserve Orchidaceae plants in Heilongjiang province and conducive to plant protection.The distribution point data of six Orchidaceae plants and ... The study aimed at predicting potential suitable areas with national key reserve Orchidaceae plants in Heilongjiang province and conducive to plant protection.The distribution point data of six Orchidaceae plants and 19 bioclimatic variables were selected,and the environmental factors required for modeling were screened out by pearson correlation analysis and variance inflation factor(VIF)analysis.The potential suitable areas of Orchidaceae plants were predictat present and under different climate scenarios in 2090s by using geographic information system(GIS)and Maximum Entropy Model(MaxEnt).And then evaluated the prediction accuracy of the MaxEnt model using the AUC value,the TSS value and the Kappa value.The results showed that:1)The area under curve(AUC)values,true skill statistics(TSS)values and KAPPA values predicted by MaxEnt model were separately above 0.9,0.85 and 0.75.2)Under the climate scenario at present,the total suitable area of Orchidaceae plants was about 9.61×10^(6)km^(2),which was mainly distributed in Heilongjiang province.Among them,the high-suitable area of Cypripedium shanxiense S.C.Chen was the largest,the non-suitable area of Cypripedium guttatum Sw was the largest.3)Under different climate scenarios in 2090s,the total suitable area was slightly increasing(9.62×10^(6)km^(2)).Among them,Cypripedium shanxiense S.C.Chen and Gastrodiae Rhizoma both showed the trend of expansion to the southwest,China,and the suitable areas expanded significantly.Comprehensive factor analysis showed that temperature and precipitation were the main bioclimatic variables of suitable areas distribution,and the low emission scenario(SSP 2-4.5)will be more conducive to the survival of Orchidaceae plants. 展开更多
关键词 Orchidaceae plants Potential suitable areas Bioclimatic variables MaxEnt models National key reserve
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部