用于流程发现的局部日志采样方法

Local log sampling method for process discovery

下载PDF

导出

摘要针对传统流程发现算法在处理大规模事件日志时的性能瓶颈问题,提出一种基于轨迹信息增量的日志采样方法,通过量化事件之间的直接跟随关系和轨迹的特征信息,将轨迹是否带有新的流程行为作为采样标准,基于统计理论确定了最小连续遍历样本数量。为了进一步提高预处理速度,提出二进制指数跳跃算法来避免扫描重复轨迹。通过4个真实事件日志的实验表明,所提采样方法可以快速有效地缩小事件日志的规模,并保留关键的控制流和频率信息,同时提高流程发现算法的运行速度。 To address the performance bottleneck of traditional process discovery algorithms in processing large-scale event logs, a log sampling method based on trace incremental information was proposed. This method quantified the directly follow relationship between events and the feature information of traces, takes whether a trace carries a new process behavior as the sampling criterion, and determined the minimum number of consecutive traversal samples based on statistical theory. To further improve the preprocessing speed, a binary exponential skip algorithm was proposed to avoid the scanning of duplicate traces. Experiments on four real-life event logs showed that the proposed sampling method could quickly and efficiently reduce the size of event logs and retain critical control flow and frequency information, while improving the running speed of process discovery algorithm.

作者倪可俞东进孙笑笑胡华 NI Ke;YU Dongjin;SUN Xiaoxiao;HU Hua(School of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018,China;Hangzhou Normal University,Hangzhou 311121,China)

机构地区杭州电子科技大学计算机学院杭州师范大学

出处《计算机集成制造系统》 EI CSCD 北大核心 2022年第10期3166-3174,共9页 Computer Integrated Manufacturing Systems

基金国家自然科学基金资助项目(61702144) 工信部工业互联网创新发展工程资助项目(TC200802G,TC2008033) 浙江省重点研发计划资助项目(2020C01165) 浙江省自然科学基金资助项目(LQ20F020017)。

关键词流程发现日志采样事件日志信息增量流程模型 process discovery log sampling event log incremental information process model

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

1张伟茵.区块链技术在沿黄档案资源共享平台中的应用研究[J].办公自动化,2022,27(21):44-47. 被引量：1
2于晓彤,王玉梅.新冠肺炎疫情事件强度对环卫工工作绩效的影响[J].管理科学,2021,34(3):15-26. 被引量：1
3张鑫,李佳杰,俞灏,刘攀.低碳排放约束下城市多方式交通网络道路收费研究[J].公路交通科技,2022,39(5):131-139. 被引量：2
4何金龙.两种不同干预策略在射血分数降低心力衰竭患者中的应用效果[J].医学信息,2022,35(18):48-52.
5张佳,尹君驰,王宏,李川江,马博,肖萍.输送带输煤采样技术研究现状及发展趋势[J].煤炭科学技术,2022,50(9):200-206. 被引量：8
6孙广通.基于轨迹交叉理论的电力设施保护策略和措施[J].电力安全技术,2022,24(10):44-47. 被引量：2
7苏轩,刘聪,张帅鹏,曾庆田,李彩虹.面向日志完备性的事件日志采样方法[J].计算机集成制造系统,2022,28(10):3156-3165. 被引量：7
8徐悦,吴筱萌,程鸿浩,陈诗燕,周夏芝,邹运鼎,毕守东.广翅蜡蝉科与其蜘蛛类天敌空间格局动态分析[J].西北农林科技大学学报（自然科学版）,2022,50(11):110-119.

计算机集成制造系统

2022年第10期

浏览历史

内容加载中请稍等...

用于流程发现的局部日志采样方法

相关作者

相关机构

相关主题

浏览历史