期刊文献+
共找到349篇文章
< 1 2 18 >
每页显示 20 50 100
Contribution of the MERISE-Type Conceptual Data Model to the Construction of Monitoring and Evaluation Indicators of the Effectiveness of Training in Relation to the Needs of the Labor Market in the Republic of Congo
1
作者 Roch Corneille Ngoubou Basile Guy Richard Bossoto Régis Babindamana 《Open Journal of Applied Sciences》 2024年第8期2187-2200,共14页
This study proposes the use of the MERISE conceptual data model to create indicators for monitoring and evaluating the effectiveness of vocational training in the Republic of Congo. The importance of MERISE for struct... This study proposes the use of the MERISE conceptual data model to create indicators for monitoring and evaluating the effectiveness of vocational training in the Republic of Congo. The importance of MERISE for structuring and analyzing data is underlined, as it enables the measurement of the adequacy between training and the needs of the labor market. The innovation of the study lies in the adaptation of the MERISE model to the local context, the development of innovative indicators, and the integration of a participatory approach including all relevant stakeholders. Contextual adaptation and local innovation: The study suggests adapting MERISE to the specific context of the Republic of Congo, considering the local particularities of the labor market. Development of innovative indicators and new measurement tools: It proposes creating indicators to assess skills matching and employer satisfaction, which are crucial for evaluating the effectiveness of vocational training. Participatory approach and inclusion of stakeholders: The study emphasizes actively involving training centers, employers, and recruitment agencies in the evaluation process. This participatory approach ensures that the perspectives of all stakeholders are considered, leading to more relevant and practical outcomes. Using the MERISE model allows for: • Rigorous data structuring, organization, and standardization: Clearly defining entities and relationships facilitates data organization and standardization, crucial for effective data analysis. • Facilitation of monitoring, analysis, and relevant indicators: Developing both quantitative and qualitative indicators helps measure the effectiveness of training in relation to the labor market, allowing for a comprehensive evaluation. • Improved communication and common language: By providing a common language for different stakeholders, MERISE enhances communication and collaboration, ensuring that all parties have a shared understanding. The study’s approach and contribution to existing research lie in: • Structured theoretical and practical framework and holistic approach: The study offers a structured framework for data collection and analysis, covering both quantitative and qualitative aspects, thus providing a comprehensive view of the training system. • Reproducible methodology and international comparison: The proposed methodology can be replicated in other contexts, facilitating international comparison and the adoption of best practices. • Extension of knowledge and new perspective: By integrating a participatory approach and developing indicators adapted to local needs, the study extends existing research and offers new perspectives on vocational training evaluation. 展开更多
关键词 MERISE Conceptual data model (MCD) Monitoring Indicators Evaluation of training Effectiveness training-Employment Adequacy Labor Market Information Systems Analysis Adjustment of training Programs EMPLOYABILITY Professional Skills
在线阅读 下载PDF
Efficient deep-learning-based surrogate model for reservoir production optimization using transfer learning and multi-fidelity data
2
作者 Jia-Wei Cui Wen-Yue Sun +2 位作者 Hoonyoung Jeong Jun-Rong Liu Wen-Xin Zhou 《Petroleum Science》 2025年第4期1736-1756,共21页
In the realm of subsurface flow simulations,deep-learning-based surrogate models have emerged as a promising alternative to traditional simulation methods,especially in addressing complex optimization problems.However... In the realm of subsurface flow simulations,deep-learning-based surrogate models have emerged as a promising alternative to traditional simulation methods,especially in addressing complex optimization problems.However,a significant challenge lies in the necessity of numerous high-fidelity training simulations to construct these deep-learning models,which limits their application to field-scale problems.To overcome this limitation,we introduce a training procedure that leverages transfer learning with multi-fidelity training data to construct surrogate models efficiently.The procedure begins with the pre-training of the surrogate model using a relatively larger amount of data that can be efficiently generated from upscaled coarse-scale models.Subsequently,the model parameters are finetuned with a much smaller set of high-fidelity simulation data.For the cases considered in this study,this method leads to about a 75%reduction in total computational cost,in comparison with the traditional training approach,without any sacrifice of prediction accuracy.In addition,a dedicated well-control embedding model is introduced to the traditional U-Net architecture to improve the surrogate model's prediction accuracy,which is shown to be particularly effective when dealing with large-scale reservoir models under time-varying well control parameters.Comprehensive results and analyses are presented for the prediction of well rates,pressure and saturation states of a 3D synthetic reservoir system.Finally,the proposed procedure is applied to a field-scale production optimization problem.The trained surrogate model is shown to provide excellent generalization capabilities during the optimization process,in which the final optimized net-present-value is much higher than those from the training data ranges. 展开更多
关键词 Subsurface flow simulation Surrogate model Transfer learning Multi-fidelity training data Production optimization
原文传递
Educational and research utility of the registrar clinical encounters in training(ReCEnT)project:an exploration of mechanisms using the context,input,process and product(CIPP)framework
3
作者 Michael Tran Susan Wearne +1 位作者 Andrew Davey Parker Magin 《Family Medicine and Community Health》 2025年第3期72-80,共9页
Background The Registrar Clinical Encounters in Training(ReCEnT)project is an Australian general practice vocational training programme with integrated and interdependent education and research functions.Trainees(regi... Background The Registrar Clinical Encounters in Training(ReCEnT)project is an Australian general practice vocational training programme with integrated and interdependent education and research functions.Trainees(registrars)contemporaneously document in-consultation clinical experience and actions.Objectives Using a realist lens,we elucidate the mechanisms underpinning project outcomes to answer questions around programme effectiveness,impacts,sustainability and the lessons and findings that are translatable to other primary care training programmes.Methods The context,input,process and product framework was used.As a means to understand the interactions between each of the interdependent components,it allows for inferences regarding causal mechanisms for specific outcomes.Results Context:ReCEnT occurs within an apprenticeship-like model of general practice vocational training entailing a central supervisor/apprentice relationship.ReCEnT has demystified the content and characteristics of registrar consultations.Input:multiple stakeholder involvement is both advantageous and a logistical challenge,with the programme’s success dependent on registrars,practices and training providers providing detailed and accurate data,with prompt subsequent processing.Process:contemporaneous consultation data collection in different stages of training constitutes a component of registrars’programmatic assessment.Product:individualised feedback provides educational benefit through reflection.Clinical and educational research questions can be addressed with resulting research translation feeding back into the programme model and government policy.Clinical behaviour change is also evaluated.Conclusion ReCEnT is unique,globally,in its scope and longevity(2010–present).Creation of meaningful,individualised feedback facilitates reflection and provides both immediate educational benefits and the substrate for further research,programme and policy design and targeted formal teaching and learning. 展开更多
关键词 training program data collection context input process product CIPP framework registrar clinical encounters feedback apprenticeship model realist lens elucidate mechanisms underpinning
原文传递
Trend Analysis of Large-Scale Twitter Data Based on Witnesses during a Hazardous Event: A Case Study on California Wildfire Evacuation
4
作者 Syed A. Morshed Khandakar Mamun Ahmed +1 位作者 Kamar Amine Kazi Ashraf Moinuddin 《World Journal of Engineering and Technology》 2021年第2期229-239,共11页
Social media data created a paradigm shift in assessing situational awareness during a natural disaster or emergencies such as wildfire, hurricane, tropical storm etc. Twitter as an emerging data source is an effectiv... Social media data created a paradigm shift in assessing situational awareness during a natural disaster or emergencies such as wildfire, hurricane, tropical storm etc. Twitter as an emerging data source is an effective and innovative digital platform to observe trend from social media users’ perspective who are direct or indirect witnesses of the calamitous event. This paper aims to collect and analyze twitter data related to the recent wildfire in California to perform a trend analysis by classifying firsthand and credible information from Twitter users. This work investigates tweets on the recent wildfire in California and classifies them based on witnesses into two types: 1) direct witnesses and 2) indirect witnesses. The collected and analyzed information can be useful for law enforcement agencies and humanitarian organizations for communication and verification of the situational awareness during wildfire hazards. Trend analysis is an aggregated approach that includes sentimental analysis and topic modeling performed through domain-expert manual annotation and machine learning. Trend analysis ultimately builds a fine-grained analysis to assess evacuation routes and provide valuable information to the firsthand emergency responders<span style="font-family:Verdana;">.</span> 展开更多
关键词 WILDFIRE EVACUATION TWITTER large-scale data Topic model Sentimental Analysis Trend Analysis
在线阅读 下载PDF
Inverse Estimation on Trigger Factors of Simultaneous Slope Failures with Purification of Training Data Sets
5
作者 Hirohito Kojima Ryo Sekine +1 位作者 Tomoya Yoshida Ryo Nozaki 《Journal of Earth Science and Engineering》 2013年第9期594-602,共9页
This paper presents an procedure for purifying training data sets (i.e., past occurrences of slope failures) for inverse estimation on unobserved trigger factors of "different types of simultaneous slope failures"... This paper presents an procedure for purifying training data sets (i.e., past occurrences of slope failures) for inverse estimation on unobserved trigger factors of "different types of simultaneous slope failures". Due to difficulties in pixel-by-pixel observations of trigger factors, as one of the measures, the authors had proposed an inverse analysis algorithm on trigger factors based on SEM (structural equation modeling). Through a measurement equation, the trigger factor is inversely estimated, and a TFI (trigger factor influence) map can be also produced. As a subsequence subject, a purification procedure of training data set should be constructed to improve the accuracy of TFI map which depends on the representativeness of given training data sets of different types of slope failures. The proposed procedure resamples the matched pixels between original groups of past slope failures (i.e., surface slope failures, deep-seated slope failures, landslides) and classified three groups by K-means clustering for all pixels corresponding to those slope failures. For all cases of three types of slope failures, the improvement of success rates with respect to resampled training data sets was confirmed. As a final outcome, the differences between TFI maps produced by using original and resampled training data sets, respectively, are delineated on a DIF map (difference map) which is useful for analyzing trigger factor influence in terms of "risky- and safe-side assessment" sub-areas with respect to "different types of simultaneous slope failures". 展开更多
关键词 Purification of training data simultaneous slope failures inverse analysis of unobserved trigger factor spatial data integration structural equation modeling.
在线阅读 下载PDF
Product Data Model for Performance-driven Design 被引量:2
6
作者 Guang-Zhong Hu Xin-Jian Xu +2 位作者 Shou-Ne Xiao Guang-Wu Yang Fan Pu 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2017年第5期1112-1122,共11页
When designing large-sized complex machinery products, the design focus is always on the overall per- formance; however, there exist no design theory and method based on performance driven. In view of the defi- ciency... When designing large-sized complex machinery products, the design focus is always on the overall per- formance; however, there exist no design theory and method based on performance driven. In view of the defi- ciency of the existing design theory, according to the performance features of complex mechanical products, the performance indices are introduced into the traditional design theory of "Requirement-Function-Structure" to construct a new five-domain design theory of "Client Requirement-Function-Performance-Structure-Design Parameter". To support design practice based on this new theory, a product data model is established by using per- formance indices and the mapping relationship between them and the other four domains. When the product data model is applied to high-speed train design and combining the existing research result and relevant standards, the corresponding data model and its structure involving five domains of high-speed trains are established, which can provide technical support for studying the relationships between typical performance indices and design parame- ters and the fast achievement of a high-speed train scheme design. The five domains provide a reference for the design specification and evaluation criteria of high speed train and a new idea for the train's parameter design. 展开更多
关键词 Complex product design Performance driven data model Mapping relationship High-speed train
在线阅读 下载PDF
铁路列车群运行多智能体感知模型与仿真
7
作者 骆晖 《铁道运输与经济》 北大核心 2026年第1期141-150,共10页
为探讨铁路高精度与智能化运行仿真,研究铁路工程数据驱动建模与列车群多智能体自主感知仿真理论与方法。首先以工程勘察设计数据驱动生成线路等矢量数据模型,构建轨道区段、信号机、道岔、列车等智能体模型;其次研究单列车自主感知控... 为探讨铁路高精度与智能化运行仿真,研究铁路工程数据驱动建模与列车群多智能体自主感知仿真理论与方法。首先以工程勘察设计数据驱动生成线路等矢量数据模型,构建轨道区段、信号机、道岔、列车等智能体模型;其次研究单列车自主感知控制模型的构建与运行;最后通过构建CTC智能体实现数据感知与处理分析、列车群运行状态的动态监控与调度,完成列车群自主仿真运行。仿真实验结果表明,在CTC智能体的智能监测和决策下,单列车及列车群模型可实现安全、高效地仿真运行。研究通过数据驱动建模,解决传统仿真系统模型精度不足、建模效率低下的问题,通过CTC智能体集中控制,实现列车群的协同仿真与自主决策,为构建自主化、智能化的铁路运输仿真系统提供了理论支撑和技术路径,为铁路线路及车站设计、能力评估提供高可信度仿真工具。 展开更多
关键词 数据驱动建模 铁路运行仿真 列车群多智能体 CTC智能体 自主感知控制
在线阅读 下载PDF
高精度机械实训装置的误差分析与补偿策略研究
8
作者 杜少华 《自动化应用》 2026年第1期107-109,共3页
针对现代机械实训装置日趋追求高精度与高动态响应的需求,基于误差理论和现代控制方法,对高精度机械实训装置中存在的各类误差进行了系统分析,构建了多层级误差模型,并提出了一种基于自适应模型与数据反馈融合的补偿策略。通过理论推导... 针对现代机械实训装置日趋追求高精度与高动态响应的需求,基于误差理论和现代控制方法,对高精度机械实训装置中存在的各类误差进行了系统分析,构建了多层级误差模型,并提出了一种基于自适应模型与数据反馈融合的补偿策略。通过理论推导、数学建模和实验验证,揭示了机理、环境及随机因素在误差形成中的作用机理,利用误差传播公式和补偿算法实现了误差的实时在线校正。实验结果表明,该策略能将装置的定位精度提高近90%,具有较好的应用前景,以期为高精度机电系统的误差控制提供一定理论与实践支持。 展开更多
关键词 机械实训装置 误差分析 补偿策略 自适应模型 数据反馈
在线阅读 下载PDF
基于YOLOv5的安全帽佩戴检测系统研究
9
作者 余琴 唐俊 叶杨 《现代信息科技》 2026年第3期159-164,共6页
随着城市化进程的加速与建筑行业的蓬勃发展,建筑工地的安全管理问题愈发突出。传统安全监管方式效率低下,难以适应复杂多变的施工环境。为此,文章提出一种基于YOLOv5算法的工地安全帽佩戴检测系统,旨在依托深度学习技术,自动识别工人... 随着城市化进程的加速与建筑行业的蓬勃发展,建筑工地的安全管理问题愈发突出。传统安全监管方式效率低下,难以适应复杂多变的施工环境。为此,文章提出一种基于YOLOv5算法的工地安全帽佩戴检测系统,旨在依托深度学习技术,自动识别工人的安全帽佩戴状态,进而提升施工现场安全管理的智能化水平。该系统采用Python语言开发,集成图片检测、实时视频流监测、历史录像检测等功能模块,具备友好的用户界面与便捷的操作体验。通过数据标注、模型训练与预测,实验结果表明,该系统在复杂施工环境中能准确识别安全帽佩戴情况,能有效提升施工现场的安全监管水平。 展开更多
关键词 YOLOv5 安全帽检测 数据标注 模型训练 模型预测
在线阅读 下载PDF
基于大数据分析与预训练模型的脑血管病教学反馈系统构建
10
作者 李旻 姚雪帆 宋珏娴 《中国毕业后医学教育》 2026年第2期136-139,共4页
随着信息技术和人工智能的快速发展,预训练模型和大数据分析技术逐渐被应用于教学反馈系统的优化。该文探讨如何构建一个基于大数据分析和预训练模型的脑血管病教学反馈系统,通过收集、分析和反馈教学过程中生成的大量数据,以及利用预... 随着信息技术和人工智能的快速发展,预训练模型和大数据分析技术逐渐被应用于教学反馈系统的优化。该文探讨如何构建一个基于大数据分析和预训练模型的脑血管病教学反馈系统,通过收集、分析和反馈教学过程中生成的大量数据,以及利用预训练模型的自然语言处理能力,提升教学质量和学员学习效果。通过对学习行为、理论考试成绩、技能操作成绩和临床实习表现等多维度数据的挖掘,该文提出了一种智能化、动态化结合预训练模型的反馈系统模型,并分析了该系统在教学中的应用价值。 展开更多
关键词 大数据分析 脑血管病 预训练模型 智能化教学
在线阅读 下载PDF
融合预训练语言模型的冠心病专病库建设及应用
11
作者 薛扬 侯旭敏 《软件导刊》 2026年第1期32-38,共7页
冠心病专病库的数据处理效率和准确性在临床研究与决策中发挥着至关重要的作用。因此,建设一个高效、准确的专病库是十分必要的,可支持临床研究者快速获取关键信息、优化治疗决策,从而提升患者的整体护理质量。基于Clinical-BERT+Bi-LST... 冠心病专病库的数据处理效率和准确性在临床研究与决策中发挥着至关重要的作用。因此,建设一个高效、准确的专病库是十分必要的,可支持临床研究者快速获取关键信息、优化治疗决策,从而提升患者的整体护理质量。基于Clinical-BERT+Bi-LSTM+CRF模型,结合数据平台与企业服务总线(ESB)对专病库数据处理进行优化。实验结果表明,数据抽取时间平均缩短了36倍(t=115.96,P<0.01),结构化数据的准确率提高了6.9%(χ2=222.41,P<0.01),说明这一优化能够有效提升冠心病专病库数据处理的效率和准确性,为冠心病的临床研究和决策提供了可靠的数据支持。 展开更多
关键词 冠心病 专病库建设 数据处理 预训练模型
在线阅读 下载PDF
基于数据选择和异构训练的大模型训练优化算法
12
作者 钟磊 陈志敏 +1 位作者 陈竹青 高翔 《计算机工程与设计》 北大核心 2026年第1期165-170,共6页
针对现有大模型训练优化算法只关注筛选训练数据或增大模型规模,仅从单一方面进行优化的问题,提出一种基于数据选择和异构训练的大模型训练优化算法。基于LDA模型对原始数据集和目标数据集进行建模,计算重要性得分并据此筛选出高质量的... 针对现有大模型训练优化算法只关注筛选训练数据或增大模型规模,仅从单一方面进行优化的问题,提出一种基于数据选择和异构训练的大模型训练优化算法。基于LDA模型对原始数据集和目标数据集进行建模,计算重要性得分并据此筛选出高质量的训练数据集;根据训练损失选择部分数据进行反向传播,并将模型训练中的参数更新阶段卸载到CPU中进行,对训练时间和模型规模进行优化,最终完成大模型的训练。实验结果表明,与DSIR和PyTorch相比,所提算法在牺牲少量EM值和F1值的情况下,将模型训练时间平均缩短了20.5%,模型大小增大将近两倍,验证了算法的有效性。 展开更多
关键词 大语言模型 大模型训练优化 深度学习 数据选择 主题模型 异构训练 选择性反向传播
在线阅读 下载PDF
模型分野:生成式人工智能数据训练版权规制的类型化进路
13
作者 李安阳 《四川师范大学学报(社会科学版)》 北大核心 2026年第2期52-62,200,201,共13页
当下关于生成式人工智能(GenAI)数据训练版权规制的研究,普遍建立在技术阶段区分的基础之上,缺乏基于模型类型的探讨。随着自定义模型在内容生成中的重要性日益提升,其与基础模型在数据来源与规模、训练目标与生成影响、配置方式与传播... 当下关于生成式人工智能(GenAI)数据训练版权规制的研究,普遍建立在技术阶段区分的基础之上,缺乏基于模型类型的探讨。随着自定义模型在内容生成中的重要性日益提升,其与基础模型在数据来源与规模、训练目标与生成影响、配置方式与传播范式等方面均存在较大差异,亟须采取有针对性的规制方案来平衡版权保护与技术创新之间的结构张力。GenAI数据训练版权规制类型化进路,可在模型类型区分的基础上,对人工智能数据训练行为采取差异化的规制路径,进一步细化人工智能版权规制的颗粒度,平衡版权保护与技术进步需求之间的张力。基础模型以宽松规制为主,明确其训练行为为合理使用,通过设定主体义务、设立“人工智能发展税”以降低对版权人权益的损害;自定义模型则以严格规制为主,不作合理使用的例外规定,仅在特定情形使用法定许可,通过系统设置相关主体的义务,降低对版权人利益直接侵害的可能性。 展开更多
关键词 生成式人工智能 数据训练 版权规制 基础模型 自定义模型
在线阅读 下载PDF
Quantifying the impact of dust retention on maize canopy spectral reflectance and vegetation indices in dust belt regions:A case study in southern Xinjiang,China
14
作者 MA Baodong GAO Shuxian +2 位作者 KANG Ting CHE Defu SHU Yang 《Journal of Arid Land》 2026年第1期101-130,共30页
Sand dust belts span approximately one-fifth of the global land surface.In these regions,dust tends to settle on vegetation surfaces,altering the observed reflectance and affecting remote sensing detections.To enhance... Sand dust belts span approximately one-fifth of the global land surface.In these regions,dust tends to settle on vegetation surfaces,altering the observed reflectance and affecting remote sensing detections.To enhance the accuracy of maize growth monitoring in dust-affected regions,this study aims to quantify the effect of sand dust retention on maize during the tasseling stage in the Kashgar Prefecture,Xinjiang Uygur Autonomous Region,China,by analyzing changes in canopy reflectance and vegetation indices.First,field sampling was conducted to measure the key canopy structure parameters and dust retention levels of maize,and laboratory spectral measurements were performed on leaf spectral properties under gradient dust retention.The measured data were then used to drive the LargE-Scale remote sensing data and image Simulation framework(LESS)model for simulating realistic maize canopy spectra across different dust levels,with validation against Sentinel-2 imagery.Second,on the basis of the simulated and satellite-derived spectra,the dust resistance of 36 common vegetation indices was systematically evaluated,and new robust dust-resistant indices were developed.The results showed that compared with dust-free maize,the canopy reflectance of dust-retained maize followed an increase–decrease–increase pattern,with critical turning points at 735 and 1325 nm.The maximum reflectance difference of–0.11755(change rate:29.002%)occurred within the 735–1325 nm range at 24 g/m^(2)dust retention,and the minimum reflectance difference of 0.04285(change rate:148.950%)was observed in the 350–735 nm range under the same dust retention level.Among the 36 vegetation indices,only the global environment monitoring index(GEMI)and the ratio of transformed chlorophyll absorption in reflectance index to optimized soil-adjusted vegetation index(TCARI/OSAVI)exhibited dust resistance,with GEMI being effective below 6 g/m^(2)and TCARI/OSAVI remaining stable across all levels(average ratio:0.970).The newly developed indices in this study,(RE3–RE2)/(NIR–RE2),(RE3–RE2)/(RE4–RE2),and(NIR–RE2)/(RE4–RE2),retained values within the predefined dust-resistant range over the full dust retention levels of 0–24 g/m^(2),thus showing a more stable dust resistance compared with the commonly used 36 vegetation indices.Specially,(RE3–RE2)/(RE4–RE2)performed the most robustly in Sentinel-2 imagery,that is,58.020%of pixels were within the dust-resistant range,and an average ratio of 0.937 was obtained for the original-spectra index.This study provides a scientific basis for crop monitoring and management in dust-affected regions. 展开更多
关键词 sand dust retention canopy spectral reflectance large-scale remote sensing data and image Simulation framework(LESS)model dust-resistant vegetation indices tasseling-stage maize Sentinel-2 imagery
在线阅读 下载PDF
农机化技术在农业种植业的推广效益研究
15
作者 任鹏 李红舒 《中国农机装备》 2026年第1期110-113,共4页
为了构建农机化技术推广的结构化路径,采用作业参数包建模、设备运行数据采集与服务链路协同分析的方法,对作业组织、参数设定与推广执行的关键机制进行系统解析,研究影响推广效益的技术与流程要素。推广体系需依托数据链式调度、机具... 为了构建农机化技术推广的结构化路径,采用作业参数包建模、设备运行数据采集与服务链路协同分析的方法,对作业组织、参数设定与推广执行的关键机制进行系统解析,研究影响推广效益的技术与流程要素。推广体系需依托数据链式调度、机具状态量化与场景化培训实现作业全流程的参数一致性与技术可复制性,为推进农业生产机械化提供可执行的技术框架。 展开更多
关键词 农机化推广 参数包建模 作业数据链 技术培训体系 效益分析
在线阅读 下载PDF
基于伪标签的二阶段时序半监督学习框架
16
作者 彭鸿鑫 骆淑云 罗志一 《电子科技》 2026年第2期9-18,共10页
针对部分场景下时序分类问题中标签数据稀缺问题,文中提出了一种基于伪标签的二阶段时序半监督学习框架。在第1阶段,利用对比学习进行训练,构建基分类模型,并对无标签数据进行类别标记。在第2阶段,借助合适的伪标签技术对模型进行再训练... 针对部分场景下时序分类问题中标签数据稀缺问题,文中提出了一种基于伪标签的二阶段时序半监督学习框架。在第1阶段,利用对比学习进行训练,构建基分类模型,并对无标签数据进行类别标记。在第2阶段,借助合适的伪标签技术对模型进行再训练,以充分利用标签数据和无标签数据之间的紧密关联来提升模型性能。在多个公开时序分类数据集进行实验来验证所提框架的有效性,并对不同第2阶段伪标签训练方法的适用条件进行深入探讨。实验结果表明,在标签数据比例仅为1%和5%的情况下,所提学习框架在两个基模型和多个数据集上的准确率平均提升了约5.1%和3.5%,充分证明了所提方法能够有效解决半监督时序分类问题。 展开更多
关键词 半监督分类 时序数据 学习框架 伪标签技术 二阶段训练 对比学习 预训练 模型微调
在线阅读 下载PDF
A Support Data-Based Core-Set Selection Method for Signal Recognition 被引量:1
17
作者 Yang Ying Zhu Lidong Cao Changjie 《China Communications》 SCIE CSCD 2024年第4期151-162,共12页
In recent years,deep learning-based signal recognition technology has gained attention and emerged as an important approach for safeguarding the electromagnetic environment.However,training deep learning-based classif... In recent years,deep learning-based signal recognition technology has gained attention and emerged as an important approach for safeguarding the electromagnetic environment.However,training deep learning-based classifiers on large signal datasets with redundant samples requires significant memory and high costs.This paper proposes a support databased core-set selection method(SD)for signal recognition,aiming to screen a representative subset that approximates the large signal dataset.Specifically,this subset can be identified by employing the labeled information during the early stages of model training,as some training samples are labeled as supporting data frequently.This support data is crucial for model training and can be found using a border sample selector.Simulation results demonstrate that the SD method minimizes the impact on model recognition performance while reducing the dataset size,and outperforms five other state-of-the-art core-set selection methods when the fraction of training sample kept is less than or equal to 0.3 on the RML2016.04C dataset or 0.5 on the RML22 dataset.The SD method is particularly helpful for signal recognition tasks with limited memory and computing resources. 展开更多
关键词 core-set selection deep learning model training signal recognition support data
在线阅读 下载PDF
Statistical delay distribution analysis on high-speed railway trains
18
作者 Yuxiang Yang Ping Huang +2 位作者 Qiyuan Peng Jie LI Chao Wen 《Journal of Modern Transportation》 2019年第3期188-197,共10页
The focus of this study is to explore the statis-tical distribution models of high-speed railway (HSR) train delays. Based on actual HSR operational data, the delay causes and their classification, delay frequency, nu... The focus of this study is to explore the statis-tical distribution models of high-speed railway (HSR) train delays. Based on actual HSR operational data, the delay causes and their classification, delay frequency, number of affected trains, and space–time delay distributions are discussed. Eleven types of delay events are classified, and a detailed analysis of delay distribution for each classifica-tion is presented. Models of delay probability delay prob-ability distribution for each cause are proposed. Different distribution functions, including the lognormal, exponen-tial, gamma, uniform, logistic, and normal distribution, were selected to estimate and model delay patterns. The most appropriate distribution, which can approximate the delay duration corresponding to each cause, is derived. Subsequently, the Kolmogorov–Smirnov (K–S) test was used to test the goodness of fit of different train delay distribution models and the associated parameter values. The test results show that the distribution of the test data is consistent with that of the selected models. The fitting distribution models show the execution effect of the timetable and help in finding out the potential conflicts in real-time train operations. 展开更多
关键词 High-speed RAILWAY train DELAY CAUSE Actual operation data DISTRIBUTION model
在线阅读 下载PDF
Security Vulnerability Analyses of Large Language Models (LLMs) through Extension of the Common Vulnerability Scoring System (CVSS) Framework
19
作者 Alicia Biju Vishnupriya Ramesh Vijay K. Madisetti 《Journal of Software Engineering and Applications》 2024年第5期340-358,共19页
Large Language Models (LLMs) have revolutionized Generative Artificial Intelligence (GenAI) tasks, becoming an integral part of various applications in society, including text generation, translation, summarization, a... Large Language Models (LLMs) have revolutionized Generative Artificial Intelligence (GenAI) tasks, becoming an integral part of various applications in society, including text generation, translation, summarization, and more. However, their widespread usage emphasizes the critical need to enhance their security posture to ensure the integrity and reliability of their outputs and minimize harmful effects. Prompt injections and training data poisoning attacks are two of the most prominent vulnerabilities in LLMs, which could potentially lead to unpredictable and undesirable behaviors, such as biased outputs, misinformation propagation, and even malicious content generation. The Common Vulnerability Scoring System (CVSS) framework provides a standardized approach to capturing the principal characteristics of vulnerabilities, facilitating a deeper understanding of their severity within the security and AI communities. By extending the current CVSS framework, we generate scores for these vulnerabilities such that organizations can prioritize mitigation efforts, allocate resources effectively, and implement targeted security measures to defend against potential risks. 展开更多
关键词 Common Vulnerability Scoring System (CVSS) Large Language models (LLMs) DALL-E Prompt Injections training data Poisoning CVSS Metrics
在线阅读 下载PDF
人工智能大模型训练数据的风险类型与法律规制 被引量:41
20
作者 黄锫 《政法论丛》 北大核心 2025年第1期23-37,共15页
训练数据对于人工智能大模型的开发具有不可或缺的重要作用。但是基于我国现行的法律制度和大模型的技术原理,会存在训练数据侵权风险、训练数据偏差风险和训练数据泄露风险等三种风险类型。人工智能大模型训练数据的侵权风险主要包括... 训练数据对于人工智能大模型的开发具有不可或缺的重要作用。但是基于我国现行的法律制度和大模型的技术原理,会存在训练数据侵权风险、训练数据偏差风险和训练数据泄露风险等三种风险类型。人工智能大模型训练数据的侵权风险主要包括大模型预训练时使用作品类数据可能会违反《著作权法》的规定、使用个人信息数据可能会违反《个人信息保护法》的规定等两种情形。人工智能大模型训练数据的偏差风险主要包括价值性偏差风险、时效性偏差风险和真实性偏差风险等三种情形。人工智能大模型训练数据的泄露风险主要包括面向开发者的数据泄露风险、面向攻击者的数据泄露风险等两种情形。可以通过调整现行立法来满足人工智能大模型开发者的训练数据需求,通过元规制的方式激励人工智能大模型开发者防范训练数据的偏差风险,以及通过加强法定义务督促人工智能大模型开发者防范训练数据的泄露风险。 展开更多
关键词 生成式人工智能 大模型 训练数据 法律规制
在线阅读 下载PDF
上一页 1 2 18 下一页 到第
使用帮助 返回顶部