期刊文献+
共找到1,005,664篇文章
< 1 2 250 >
每页显示 20 50 100
Beyond Accuracy:Evaluating and Explaining the Capability Boundaries of Large Language Models in Syntax-Preserving Code Translation
1
作者 Yaxin Zhao Qi Han +1 位作者 Hui Shu Yan Guang 《Computers, Materials & Continua》 2026年第2期1371-1394,共24页
LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora... LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora,which introduces significant bias in performance evaluation;and(2)mainstream metrics focus primarily on surface-level accuracy,failing to uncover the underlying factors that constrain model capabilities.To address these issues,this paper presents TCode(Translation-Oriented Code Evaluation benchmark)—a complexity-controllable,contamination-free benchmark dataset for code translation—alongside a dedicated static feature sensitivity evaluation framework.The dataset is carefully designed to control complexity along multiple dimensions—including syntactic nesting and expression intricacy—enabling both broad coverage and fine-grained differentiation of sample difficulty.This design supports precise evaluation of model capabilities across a wide spectrum of translation challenges.The proposed evaluation framework introduces a correlation-driven analysis mechanism based on static program features,enabling predictive modeling of translation success from two perspectives:Code Form Complexity(e.g.,code length and character density)and Semantic Modeling Complexity(e.g.,syntactic depth,control-flow nesting,and type system complexity).Empirical evaluations across representative LLMs—including Qwen2.5-72B and Llama3.3-70B—demonstrate that even state-of-the-art models achieve over 80% compilation success on simple samples,but their accuracy drops sharply below 40% on complex cases.Further correlation analysis indicates that Semantic Modeling Complexity alone is correlated with up to 60% of the variance in translation success,with static program features exhibiting nonlinear threshold effects that highlight clear capability boundaries.This study departs fromthe traditional accuracy-centric evaluation paradigm and,for the first time,systematically characterizes the capabilities of large languagemodels in translation tasks through the lens of programstatic features.The findings provide actionable insights for model refinement and training strategy development. 展开更多
关键词 Large language models(LLMs) code translation compiler testing program analysis complexity-based evaluation
在线阅读 下载PDF
Integration of Large Language Models(LLMs)and Static Analysis for Improving the Efficacy of Security Vulnerability Detection in Source Code
2
作者 JoséArmando Santas Ciavatta Juan Ramón Bermejo Higuera +3 位作者 Javier Bermejo Higuera Juan Antonio Sicilia Montalvo Tomás Sureda Riera Jesús Pérez Melero 《Computers, Materials & Continua》 2026年第3期351-390,共40页
As artificial Intelligence(AI)continues to expand exponentially,particularly with the emergence of generative pre-trained transformers(GPT)based on a transformer’s architecture,which has revolutionized data processin... As artificial Intelligence(AI)continues to expand exponentially,particularly with the emergence of generative pre-trained transformers(GPT)based on a transformer’s architecture,which has revolutionized data processing and enabled significant improvements in various applications.This document seeks to investigate the security vulnerabilities detection in the source code using a range of large language models(LLM).Our primary objective is to evaluate the effectiveness of Static Application Security Testing(SAST)by applying various techniques such as prompt persona,structure outputs and zero-shot.To the selection of the LLMs(CodeLlama 7B,DeepSeek coder 7B,Gemini 1.5 Flash,Gemini 2.0 Flash,Mistral 7b Instruct,Phi 38b Mini 128K instruct,Qwen 2.5 coder,StartCoder 27B)with comparison and combination with Find Security Bugs.The evaluation method will involve using a selected dataset containing vulnerabilities,and the results to provide insights for different scenarios according to the software criticality(Business critical,non-critical,minimum effort,best effort)In detail,the main objectives of this study are to investigate if large language models outperform or exceed the capabilities of traditional static analysis tools,if the combining LLMs with Static Application Security Testing(SAST)tools lead to an improvement and the possibility that local machine learning models on a normal computer produce reliable results.Summarizing the most important conclusions of the research,it can be said that while it is true that the results have improved depending on the size of the LLM for business-critical software,the best results have been obtained by SAST analysis.This differs in“NonCritical,”“Best Effort,”and“Minimum Effort”scenarios,where the combination of LLM(Gemini)+SAST has obtained better results. 展开更多
关键词 AI+SAST secure code LLM benchmarking LLM vulnerability detection
在线阅读 下载PDF
OptoGPT code, model, and datasets made public
3
《Opto-Electronic Advances》 2025年第6期1-1,共1页
The article entitled with OptoGPT:A foundation model for inverse design in optical multilayer thin film structures1,with doi:10.29026/oea.2024.240062,published in No.7,Vol.7,2024 of Opto-Electronic Advances,has attrac... The article entitled with OptoGPT:A foundation model for inverse design in optical multilayer thin film structures1,with doi:10.29026/oea.2024.240062,published in No.7,Vol.7,2024 of Opto-Electronic Advances,has attracted attention from many researchers.As a result,the authors received many requests on the possibility sharing their code,model,and dataset in the mentioned work.To facilitate the needs of the research community,the authors decide to make the code,model,and datasets of OptoGPT public,enabling broader utilization and further development of enhanced models. 展开更多
关键词 foundation model inverse design optogpt model optical multilayer thin film structures code
在线阅读 下载PDF
An Online Judgement System Based on Code-generating Large Model 被引量:1
4
作者 Xudong Lu Zaixuan Wang +3 位作者 He Zhou Chen Yu Lizhen Cui Wei Guo 《计算机教育》 2025年第3期122-129,共8页
For computer science majors in higher education institutions,programming courses are one of the most important professional foundation courses.Proficiency in independent programming skills is of great help to the stud... For computer science majors in higher education institutions,programming courses are one of the most important professional foundation courses.Proficiency in independent programming skills is of great help to the study of subsequent courses and the personal development of students.In the teaching process of programming courses,online judgement systems are often used to improve students’programming level.Traditional online judgement systems lack guidance for students,and it is often difficult for inexperienced students to find and correct errors in their codes by themselves.We propose an online judgement system that integrates a large model of error correction to help students find errors and improve their programming skills. 展开更多
关键词 Online judgement system code-generating large model AI assistant
在线阅读 下载PDF
CoDefense:面向对抗性攻击的多粒度代码归一化防御方法
5
作者 田朝 邝仕琦 +2 位作者 闫明 王海弛 陈俊洁 《软件学报》 北大核心 2026年第3期1170-1196,共27页
近年来,以代码为输入的预训练模型在许多基于代码的关键任务中取得了显著的性能优势,但这类模型可能容易受到通过保留语义的代码转换实现的对抗性攻击,这种攻击会显著降低模型鲁棒性并可能进一步引发严重的安全问题.尽管已有对抗性训练... 近年来,以代码为输入的预训练模型在许多基于代码的关键任务中取得了显著的性能优势,但这类模型可能容易受到通过保留语义的代码转换实现的对抗性攻击,这种攻击会显著降低模型鲁棒性并可能进一步引发严重的安全问题.尽管已有对抗性训练方法通过生成对抗性样本作为增强数据来提升模型鲁棒性,但其有效性和效率在面对不同粒度和策略的未知对抗性攻击时仍显不足.为了克服这一局限性,提出一种基于代码归一化的预训练代码模型对抗性防御方法CoDefense.该方法的核心思想是作为代码模型的一个前置数据处理模块,通过多粒度代码归一化技术,对训练阶段的原始训练集和推理阶段的代码输入进行归一化预处理,以避免潜在对抗性样本对代码模型的影响.这种策略能够高效地防御不同粒度和策略的对抗性攻击.为验证CoDefense的有效性和效率,针对3种先进的对抗性攻击方法、3种流行的预训练代码模型以及3个基于代码的分类和生成任务,共设计了27个实验场景进行全面的实证研究.实验结果表明,CoDefense相较于最先进的对抗性训练方法,在防御对抗性攻击方面显著提升了有效性和效率.具体而言,CoDefense平均成功防御了95.33%的对抗性攻击.同时,在时间效率上,CoDefense相对于对抗性训练方法平均提升了85.86%. 展开更多
关键词 对抗性防御 预训练代码模型 深度学习
在线阅读 下载PDF
ASME Code Case 3029高温许用压应力计算方法的介绍及工程应用
6
作者 马忠明 《化工设备与管道》 北大核心 2026年第1期24-30,共7页
介绍了高温蠕变工况下运行的压力容器可能出现的失效模式,结合工程设计现状,指出了我国当前压力容器标准体系在确定高温蠕变工况许用压应力时存在的技术瓶颈,在此基础之上引出ASME Code Case 3029,对其适用范围、发展历程、产生背景及... 介绍了高温蠕变工况下运行的压力容器可能出现的失效模式,结合工程设计现状,指出了我国当前压力容器标准体系在确定高温蠕变工况许用压应力时存在的技术瓶颈,在此基础之上引出ASME Code Case 3029,对其适用范围、发展历程、产生背景及工程意义进行了简单的介绍,以某工程设计项目中的实际结构为例,介绍了该方法的使用过程及注意事项,并结合压力容器工程设计领域的实际需求,对我国标准体系下一步的制定或修订方向提出了展望。 展开更多
关键词 code Case 3029 蠕变屈曲 失稳 压力容器 许用应力
在线阅读 下载PDF
ChatGPT+VS Code在高中地理地图开发中的应用——以“国内人口迁移”为例
7
作者 王凌宇 白絮飞 《中国信息技术教育》 2026年第1期81-84,共4页
人工智能技术在中学地理教学中的应用是大势所趋。当前的研究主要聚焦于其作为学生的“助学者”和教师的“助教者”两大角色。然而,现有应用方式存在一定局限性:作为“助学者”,若学生使用不当可能引发依赖性,削弱其独立思考能力;作为... 人工智能技术在中学地理教学中的应用是大势所趋。当前的研究主要聚焦于其作为学生的“助学者”和教师的“助教者”两大角色。然而,现有应用方式存在一定局限性:作为“助学者”,若学生使用不当可能引发依赖性,削弱其独立思考能力;作为“助教者”,若教师生成教学设计的指令过于宽泛,结果易出现“张冠李戴”或“似是而非”等问题,需教师二次加工。相反,若教师能针对教学设计中的特定模块提供详细准确的指令,人工智能技术输出的结果将更具准确性和实用性,展现出更高研究价值。因此,本文从人工智能技术“助教者”身份出发,摒弃传统完整的教学过程设计,聚焦备课中的“地图开发”模块,采用由人工智能技术生成地图代码并通过第三方软件运行的方式,实现快速辅助教师生成所需地图的目标,提升备课效率与教学质量。 展开更多
关键词 ChatGPT VS code 人工智能技术 中学地理 地图开发
在线阅读 下载PDF
Lightweight Deep Learning Model and Novel Dataset for Restoring Damaged Barcodes and QR Codes in Logistics Applications
8
作者 Tarek Muallim Haluk Kucuk +1 位作者 Muhammet Bareket Metin Kahraman 《Computer Modeling in Engineering & Sciences》 2025年第6期3557-3581,共25页
This study introduces a lightweight deep learning model and a novel synthetic dataset designed to restore damaged one-dimensional(1D)barcodes and Quick Response(QR)codes,addressing critical challenges in logistics ope... This study introduces a lightweight deep learning model and a novel synthetic dataset designed to restore damaged one-dimensional(1D)barcodes and Quick Response(QR)codes,addressing critical challenges in logistics operations.The proposed solution leverages an efficient Pix2Pix-based framework,a type of conditional Generative Adversarial Network(GAN)optimized for image-to-image translation tasks,enabling the recovery of degraded barcodes and QR codes with minimal computational overhead.A core contribution of this work is the development of a synthetic dataset that simulates realistic damage scenarios frequently encountered in logistics environments,such as low contrast,misalignment,physical wear,and environmental interference.By training on this diverse and realistic dataset,the model demonstrates exceptional performance in restoring readability and decoding accuracy.The lightweight architecture,featuring a U-Net-based encoder-decoder with separable convolutions,ensures computational efficiency,making the approach suitable for real-time deployment on embedded and resource-constrained devices commonly used in logistics systems.Experimental results reveal significant improvements:QR code decoding ratios increased from 14%to 99%on training data and from 15%to 68%on validation data,while 1D barcode decoding ratios improved from 7%to 73%on training data and from 9%to 44%on validation data.By providing a robust,resource-efficient solution for restoring damaged barcodes and QR codes,this study offers practical advancements for enhancing the reliability of automated scanning systems in logistics operations,particularly under challenging conditions. 展开更多
关键词 BARcode quick response code RESTORATION applied deep learning
在线阅读 下载PDF
System Modeling and Deep Learning-Based Security Analysis of Uplink NOMA Relay Networks with IRS and Fountain Codes
9
作者 Phu Tran Tin Minh-Sang Van Nguyen +2 位作者 Quy-Anh Bui Agbotiname Lucky Imoize Byung-Seo Kim 《Computer Modeling in Engineering & Sciences》 2025年第8期2521-2543,共23页
Digital content such as games,extended reality(XR),and movies has been widely and easily distributed over wireless networks.As a result,unauthorized access,copyright infringement by third parties or eavesdroppers,and ... Digital content such as games,extended reality(XR),and movies has been widely and easily distributed over wireless networks.As a result,unauthorized access,copyright infringement by third parties or eavesdroppers,and cyberattacks over these networks have become pressing concerns.Therefore,protecting copyrighted content and preventing illegal distribution in wireless communications has garnered significant attention.The Intelligent Reflecting Surface(IRS)is regarded as a promising technology for future wireless and mobile networks due to its ability to reconfigure the radio propagation environment.This study investigates the security performance of an uplink Non-Orthogonal Multiple Access(NOMA)system integrated with an IRS and employing Fountain Codes(FCs).Specifically,two users send signals to the base station at separate distances.A relay receives the signal from the nearby user first and then relays it to the base station.The IRS receives the signal from the distant user and reflects it to the relay,which then sends the reflected signal to the base station.Furthermore,a malevolent eavesdropper intercepts both user and relay communications.We construct mathematical equations for Outage Probability(OP),throughput,diversity evaluation,and Interception Probability(IP),offering quantitative insights to assess system security and performance.Additionally,OP and IP are analyzed using a Deep Neural Network(DNN)model.A deeper comprehension of the security performance of the IRS-assisted NOMA systemin signal transmission is provided by Monte Carlo simulations,which are also carried out to confirm the theoretical conclusions. 展开更多
关键词 Copyright management deep neural network fountain codes intelligent reflecting surface non-orthogonal multiple access physical layer security UPLINK
在线阅读 下载PDF
VS Code软件技术在红绿彩文化网站构建中的应用研究
10
作者 李萍 杨冬梅 《办公自动化》 2026年第6期1-3,共3页
红绿彩文化作为中华民族传统文化的重要组成部分,有着悠久的历史和独特的艺术价值。其兴起打破以往以单色釉为主导的高温烧瓷的局面,在历史的长河中不断地推陈出新。然而,在现代社会中,随着文化的多元化和工业化的冲击,红绿彩文化发展... 红绿彩文化作为中华民族传统文化的重要组成部分,有着悠久的历史和独特的艺术价值。其兴起打破以往以单色釉为主导的高温烧瓷的局面,在历史的长河中不断地推陈出新。然而,在现代社会中,随着文化的多元化和工业化的冲击,红绿彩文化发展面临着诸多挑战。在此背景下,借助VS Code以及JavaScript相关技术,对红绿彩文化网站进行深化设计与研究,为构建交互型红绿彩文化网站提供借鉴,同时也将为传播红绿彩文化提供具体可行的方法。 展开更多
关键词 红绿彩文化 JAVASCRIPT VS code 网站
在线阅读 下载PDF
A decision framework for rural domestic sewage treatment models and process:Evidence from Inner Mongolia Autonomous Region,China 被引量:1
11
作者 Ying Yan Pengyu Li +5 位作者 Zixuan Wang Yubo Tan Tianlong Zheng Jianguo Liu Xiaoxia Yang Junxin Liu 《Journal of Environmental Sciences》 2026年第1期302-311,共10页
Rural domestic sewage treatment is critical for environmental protection.This study defines the spatial pattern of villages from the perspective of rural sewage treatment and develops an integrated decision-making sys... Rural domestic sewage treatment is critical for environmental protection.This study defines the spatial pattern of villages from the perspective of rural sewage treatment and develops an integrated decision-making system to propose a sewage treatment mode and scheme suitable for local conditions.By considering the village spatial layout and terrain factors,a decision tree model of residential density and terrain type was constructed with accuracies of 76.47%and 96.00%,respectively.Combined with binary classification probability unit regression,an appropriate sewage treatment mode for the village was determined with 87.00%accuracy.The Analytic Hierarchy Process(AHP),combined with the Technique for Order Preference(TOPSIS)by Similarity to an Ideal Solution model,formed the basis for optimal treatment process selection under different emission standards.Verification was conducted in 542 villages across three counties of the Inner Mongolia Autonomous Region,focusing on the standard effluent effect(0.3773),low investment cost(0.3196),and high standard effluent effect(0.5115)to determine the best treatment process for the same emission standard under different needs.The annual environmental and carbon emission benefits of sewage treatment in these villages were estimated.This model matches village density,geographic feature,and social development level,and provides scientific support and a theoretical basis for rural sewage treatment decision-making. 展开更多
关键词 Rural domestic sewage Sewage treatment model DECISION-MAKING Environmental-economic benefits Inner Mongolia
原文传递
An effective method for generating crystal structures based on the variational autoencoder and the diffusion model
12
作者 Chen Chen Jinzhou Zheng +3 位作者 Chaoqin Chu Qinkun Xiao Chaozheng He Xi Fu 《Chinese Chemical Letters》 2025年第4期461-466,共6页
Two dimensional(2D) materials based on boron and carbon have attracted wide attention due to their unique properties. BC compounds have rich active sites and diverse chemical coordination, showing great potential in o... Two dimensional(2D) materials based on boron and carbon have attracted wide attention due to their unique properties. BC compounds have rich active sites and diverse chemical coordination, showing great potential in optoelectronic applications. However, due to the limitation of calculation and experimental conditions, it is still a challenging task to predict new 2D BC monolayer materials. Specifically, we utilized Crystal Diffusion Variational Autoencoder(CDVAE) and pre-trained Materials Graph Neural Network with 3-Body Interactions(M3GNet) model to generate novel and stable BCP materials. Each crystal structure was treated as a high-dimensional vector, where the encoder extracted lattice information and element coordinates, mapping the high-dimensional data into a low-dimensional latent space. The decoder then reconstructed the latent representation back into the original data space. Additionally, our designed attribute predictor network combined the advantages of dilated convolutions and residual connections,effectively increasing the model's receptive field and learning capacity while maintaining relatively low parameter count and computational complexity. By progressively increasing the dilation rate, the model can capture features at different scales. We used the DFT data set of about 1600 BCP monolayer materials to train the diffusion model, and combined with the pre-trained M3GNet model to screen the best candidate structure. Finally, we used DFT calculations to confirm the stability of the candidate structure.The results show that the combination of generative deep learning model and attribute prediction model can help accelerate the discovery and research of new 2D materials, and provide effective methods for exploring the inverse design of new two-dimensional materials. 展开更多
关键词 Deep generative model BCP monolayer Inverse design CDVAE DFT
原文传递
Rateless Polar Codes with Unequal Error Protection Property
13
作者 Cui Chen Xiang Wei +1 位作者 Ma Siwei Guo Qing 《China Communications》 2026年第1期10-23,共14页
Mobile communications are reaching out to every aspect of our daily life,necessitating highefficiency data transmission and support for diverse data types and communication scenarios.Polar codes have emerged as a prom... Mobile communications are reaching out to every aspect of our daily life,necessitating highefficiency data transmission and support for diverse data types and communication scenarios.Polar codes have emerged as a promising solution due to their outstanding error-correction performance and low complexity.Unequal error protection(UEP)involves nonuniform error safeguarding for distinct data segments,achieving a fine balance between error resilience and resource allocation,which ultimately enhancing system performance and efficiency.In this paper,we propose a novel class of UEP rateless polar codes.The codes are designed based on matrix extension of polar codes,and elegant mapping and duplication operations are designed to achieve UEP property while preserving the overall performance of conventional polar codes.Superior UEP performance is attained without significant modifications to conventional polar codes,making it straightforward for compatibility with existing polar codes.A theoretical analysis is conducted on the block error rate and throughput efficiency performance.To the best of our knowledge,this work provides the first theoretical performance analysis of UEP rateless polar codes.Simulation results show that the proposed codes significantly outperform existing polar coding schemes in both block error rate and throughput efficiency. 展开更多
关键词 matrix extension polar codes rateless coding unequal error protection
在线阅读 下载PDF
Description and Evaluation of the Emission and Atmospheric Processes Integrated and Coupled Community(EPICC)Model Version 1.0 被引量:1
14
作者 EPICC Model Working Group 《Advances in Atmospheric Sciences》 2026年第4期671-694,I0001-I0006,共30页
We present a comprehensive description and benchmark evaluation of the global–regional chemical transport model called the Emission and Atmospheric Processes Integrated and Coupled Community(EPICC)model.The framework... We present a comprehensive description and benchmark evaluation of the global–regional chemical transport model called the Emission and Atmospheric Processes Integrated and Coupled Community(EPICC)model.The framework incorporates(1)grid configuration,(2)transport dynamics,(3)chemical mechanisms,(4)aerosol processes,(5)wet/dry deposition parameterizations,and(6)heterogeneous chemistry treatments associated with sulfate,nitrous acid(HONO)chemistry,and aerosol/cloud–photolysis interactions(APIs/CPIs).Openly shared with the atmospheric research community,the model facilitates integration of advanced physicochemical schemes to enhance simulation accuracy.Globally,the model demonstrates realistic representations of ozone(O_(3))and aerosol optical depth.The EPICC model generally demonstrates robust performance in simulating regional concentrations of O_(3) and PM_(2.5)(and its components)in China.It successfully captures vertical profiles of both global and regional O_(3).Notably,the model mitigates frequently reported sulfate underestimations in highly industrialized regions of China.The model accurately captures two regional severe pollution episodes observed in eastern China(January/June 2021).Sensitivity experiments highlight the critical roles of heterogeneous chemical mechanisms associated with sulfate,HONO chemistry,APIs,and CPIs in capturing PM_(2.5) and O_(3) concentrations in China.Improved sulfate mechanisms result in an increase of approximately 32.4%(2.8μg m^(−3))in simulated winter sulfate concentrations when observations exceed 10μg m^(−3).Enhanced HONO elevates winter O_(3) and PM_(2.5) by≤20 and≤10μg m^(−3),respectively.Overall,CPIs dominate over APIs in improving O_(3) and PM_(2.5) simulations across China.Locally,APIs mitigate PM_(2.5) and O_(3) discrepancies in the Sichuan Basin.Seasonal cloud–chemistry coupling explains the weaker impact of PM_(2.5) in summer. 展开更多
关键词 EPICC model PM_(2.5) O_(3) sulfate nitrous acid aerosol/cloud-photolysis interactions
在线阅读 下载PDF
Multi-Head Encoder Shared Model Integrating Intent and Emotion for Dialogue Summarization
15
作者 Xinlai Xing Junliang Chen +2 位作者 Xiaochuan Zhang Shuran Zhou Runqing Zhang 《Computers, Materials & Continua》 2025年第2期2275-2292,共18页
In task-oriented dialogue systems, intent, emotion, and actions are crucial elements of user activity. Analyzing the relationships among these elements to control and manage task-oriented dialogue systems is a challen... In task-oriented dialogue systems, intent, emotion, and actions are crucial elements of user activity. Analyzing the relationships among these elements to control and manage task-oriented dialogue systems is a challenging task. However, previous work has primarily focused on the independent recognition of user intent and emotion, making it difficult to simultaneously track both aspects in the dialogue tracking module and to effectively utilize user emotions in subsequent dialogue strategies. We propose a Multi-Head Encoder Shared Model (MESM) that dynamically integrates features from emotion and intent encoders through a feature fusioner. Addressing the scarcity of datasets containing both emotion and intent labels, we designed a multi-dataset learning approach enabling the model to generate dialogue summaries encompassing both user intent and emotion. Experiments conducted on the MultiWoZ and MELD datasets demonstrate that our model effectively captures user intent and emotion, achieving extremely competitive results in dialogue state tracking tasks. 展开更多
关键词 Dialogue summaries dialogue state tracking emotion recognition task-oriented dialogue system pre-trained language model
在线阅读 下载PDF
Agri-Eval:Multi-level Large Language Model Valuation Benchmark for Agriculture
16
作者 WANG Yaojun GE Mingliang +2 位作者 XU Guowei ZHANG Qiyu BIE Yuhui 《农业机械学报》 北大核心 2026年第1期290-299,共10页
Model evaluation using benchmark datasets is an important method to measure the capability of large language models(LLMs)in specific domains,and it is mainly used to assess the knowledge and reasoning abilities of LLM... Model evaluation using benchmark datasets is an important method to measure the capability of large language models(LLMs)in specific domains,and it is mainly used to assess the knowledge and reasoning abilities of LLMs.Therefore,in order to better assess the capability of LLMs in the agricultural domain,Agri-Eval was proposed as a benchmark for assessing the knowledge and reasoning ability of LLMs in agriculture.The assessment dataset used in Agri-Eval covered seven major disciplines in the agricultural domain:crop science,horticulture,plant protection,animal husbandry,forest science,aquaculture science,and grass science,and contained a total of 2283 questions.Among domestic general-purpose LLMs,DeepSeek R1 performed best with an accuracy rate of 75.49%.In the realm of international general-purpose LLMs,Gemini 2.0 pro exp 0205 standed out as the top performer,achieving an accuracy rate of 74.28%.As an LLMs in agriculture vertical,Shennong V2.0 outperformed all the LLMs in China,and the answer accuracy rate of agricultural knowledge exceeded that of all the existing general-purpose LLMs.The launch of Agri-Eval helped the LLM developers to comprehensively evaluate the model's capability in the field of agriculture through a variety of tasks and tests to promote the development of the LLMs in the field of agriculture. 展开更多
关键词 large language models assessment systems agricultural knowledge agricultural datasets
在线阅读 下载PDF
Ecological Dynamics of a Logistic Population Model with Impulsive Age-selective Harvesting
17
作者 DAI Xiangjun JIAO Jianjun 《应用数学》 北大核心 2026年第1期72-79,共8页
In this paper,we establish and study a single-species logistic model with impulsive age-selective harvesting.First,we prove the ultimate boundedness of the solutions of the system.Then,we obtain conditions for the asy... In this paper,we establish and study a single-species logistic model with impulsive age-selective harvesting.First,we prove the ultimate boundedness of the solutions of the system.Then,we obtain conditions for the asymptotic stability of the trivial solution and the positive periodic solution.Finally,numerical simulations are presented to validate our results.Our results show that age-selective harvesting is more conducive to sustainable population survival than non-age-selective harvesting. 展开更多
关键词 The logistic population model Selective harvesting Asymptotic stability EXTINCTION
在线阅读 下载PDF
Special Issue“Recent Developments in Dimension Reduction and Model Checking”——In Honor of Professor Lixing Zhu's Outstanding Contributions in Statistics
18
作者 ZHU Liping XU Wangli LI Yingxing 《Journal of Systems Science & Complexity》 2026年第1期1-2,共2页
The proliferation of high-dimensional data and the widespread use of complex models present central challenges in contemporary statistics and data science.Dimension reduction and model checking,as two foundational pil... The proliferation of high-dimensional data and the widespread use of complex models present central challenges in contemporary statistics and data science.Dimension reduction and model checking,as two foundational pillars supporting scientific inference and data-driven decisionmaking,have evolved through the collective wisdom of generations of statisticians.This special issue,titled"Recent Developments in Dimension Reduction and Model Checking for regressions",not only aims to showcase cutting-edge advances in the field but also carries a distinct sense of academic homage to honor the groundbreaking and enduring contributions of Professor Lixing Zhu,a leading scholar whose work has profoundly shaped both areas. 展开更多
关键词 scientific inference model checking model checkingas complex models dimension reduction high dimensional data
原文传递
Impact of flex power on inter-and intra-differential code bias variation
19
作者 Jiayu Li Yan Xiang +1 位作者 Chengeng Su Xiaolin Ji 《Geodesy and Geodynamics》 2026年第2期154-167,共14页
Differential Code Bias(DCB)is the time delays between two different GNSS signals,which is crucial for GNSS positioning.Previous studies have shown that it can be significantly affected by the flex power operations in ... Differential Code Bias(DCB)is the time delays between two different GNSS signals,which is crucial for GNSS positioning.Previous studies have shown that it can be significantly affected by the flex power operations in satellites.This study proposes a 15-min short-term DCB estimation method to analyze flex power's impact on DCB variations.The method jointly estimates satellite DCB,receiver DCB,and ionospheric parameters using over 300 MGEX stations.We examined three representative flex power events in 2024,achieving average internal RMS values of 0.042 ns and 0.0068 ns for inter-frequency and intra-frequency scenarios respectively.Results show that intra-frequency DCB exhibits clear shift biases synchronized with flex power state transitions while maintaining stability within 0.20 ns during nontransition periods.No definitive impact on inter-frequency DCB was observed at current estimation precision levels. 展开更多
关键词 Differential code bias Flex power GNSS
原文传递
Ecosystem service models are indeed being validated:A response to Pereira et al.(2025)
20
作者 James M.Bullock Danny A.P.Hooftman +1 位作者 John W.Redhead Simon Willcock 《Geography and Sustainability》 2026年第1期247-248,共2页
In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation ... In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation is provided”and that“the validation step is largely overlooked”.This assertion may have been true several years ago,for example,when Ochoa and Urbina-Cardona(2017)made a similar observation.However,there has been much work on ES model validation over the last decade. 展开更多
关键词 evaluation MAPPING modeling es model ecosystem services VALIDATION
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部