事件数据处理
2021-12-23
研究内容
- 事件数据质量控制:针对多源事件日志中存在的事件丢失、乱序、标签错误、结构错误问题,研究事件日志中每条轨迹在正确流程模型指导下的高效过滤和修复机制;针对动态、多变的流程执行环境引发的流程变更导致的事件轨迹与流程模型不符问题,研究基于正确事件日志对原始流程模型进行最小代价的自动修复技术。该方面研究还包括基于速度约束的流数据清理技术等。
- 事件数据集成与管理:针对来自各行各业、急剧增长的具有流程特性的事件序列数据和流程模型,研究多源、异构、海量事件日志(亦称流程实例日志)和流程模型的高效集成、统一存储、特征提取、相似性计算、差异性计算、分类聚类、多维索引和综合检索等关键处理技术,并结合大数据处理平台,研究上述技术的分布式并行加速算法,为事件数据的管理和分析再利用奠定坚实的应用基础。
-
流程数据分析与挖掘:开发把流程挖掘问题分解为多个分布到计算机集群的较小挖掘问题的并行化技术(T1);针对无法存储极长一段时间内全部事件的应用,开发无需存储所有事件就能够增量学习流程模型的即时流程挖掘技术(T2);开发能够系统地突出共性和差异的可比较流程挖掘技术,以便能够处理随着时间发生改变而且有很多变种的异质流程(T3)。
研究成果
- Jianmin Wang, Shaoxu Song, Xuemin Lin, Xiaochen Zhu, Jian Pei. Cleaning Structured Event Logs: A Graph Repair Approach. IEEE International Conference on Data Engineering, ICDE 2015
- Shaoxu Song, Aoqian Zhang, Jianmin Wang, Philip S. Yu. SCREEN: Stream Data Cleaning under Speed Constraints. ACM SIGMOD International Conference on Management of Data, SIGMOD 2015
- Xiaochen Zhu, Shaoxu Song, Xiang Lian, Jianmin Wang, Lei Zou. Matching Heterogeneous Event Data. ACM SIGMOD International Conference on Management of Data, SIGMOD 2014: 1211-1222
- Xiaochen Zhu, Shaoxu Song, Jianmin Wang, Philip S. Yu, Jiaguang Sun. Matching Heterogeneous Events with Patterns. IEEE International Conference on Data Engineering, ICDE 2014: 376-387
- Tao Jin, Jianmin Wang, Yun Yang, Lijie Wen, Keqin Li. Refactor Business Process Models with Maximized Parallelism. IEEE Transactions on Services Computing, 2014
- Tao Jin, Jianmin Wang, Lijie Wen, Gen Zou. Computing Refined Ordering Relations with Uncertainty for Acyclic Process Models. IEEE Transactions on Services Computing, 2014
- Jianmin Wang, Tao Jin, Raymond K. Wong, Lijie Wen. Querying business process model repositories - A survey of current approaches and issues. World Wide Web, 2014
- Hedong Yang, Lijie Wen, Jianmin Wang, Raymond K. Wong. CPL+: An improved approach for evaluating the local completeness of event logs. Information Processing Letters, 2014
- Jianmin Wang, Shaoxu Song, Xiaochen Zhu, Xuemin Lin. Efficient Recovery of Missing Events. Proceedings of the VLDB Endowment, PVLDB 6(10): 841-852 (2013)
- Tao Jin, Jianmin Wang, Marcello La Rosa, Arthur H. M. ter Hofstede, Lijie Wen. Efficient querying of large process model repositories. Computers in Industry, 2013
- Jianmin Wang, Raymond K. Wong, Jianwei Ding, Qinlong Guo, Lijie Wen. Efficient Selection of Process Mining Algorithms. IEEE Transactions on Services Computing, 2013
- Liang Song, Jianmin Wang, Lijie Wen, Hui Kong. Efficient Semantics-Based Compliance Checking Using LTL Formulae and Unfolding. Journal of Applied Mathematics, 2013
- Zhaoxia Wang, Jianmin Wang, Xiaochen Zhu, Lijie Wen. Verification of workflow nets with transition conditions. Journal of Zhejiang University - Science C, 2012
- Haiping Zha, Wil M. P. van der Aalst, Jianmin Wang, Lijie Wen, Jiaguang Sun. Verifying workflow processes: a transformation-based approach. Software and System Modeling, 2011
- Haiping Zha, Jianmin Wang, Lijie Wen, Chaokun Wang, Jiaguang Sun. A workflow net similarity measure based on transition adjacency relations. Computers in Industry, 2010
- Lijie Wen, Jianmin Wang, Wil M. P. van der Aalst, Biqing Huang, Jiaguang Sun. Mining process models with prime invisible tasks. Data & Knowledge Engineering, 2010
- Lijie Wen, Jianmin Wang, Wil M. P. van der Aalst, Biqing Huang, Jiaguang Sun. A novel approach for process mining based on event types. Journal of Intelligent Information Systems, 2009
- Lijie Wen, Wil M. P. van der Aalst, Jianmin Wang, Jiaguang Sun. Mining process models with non-free-choice constructs. Data Mining and Knowledge Discovery, 2007
- 殷明; 闻立杰; 王建民; 查海平; 刘英博; 董子禾. 一种器械设备的工作状态检测方法,2014/05/27. 清华大学,专利,申请号:201410225173.3
- 流程数据管理与分析挖掘软件 V1.0,清华大学,软件著作权,登记号:2014SR190808
- 流程模式及片段优化分析工具软件 V1.0,清华大学,软件著作权,登记号:2014SR190823
- 在线数据流程管理平台 V1.0,清华大学,软件著作权,登记号:2014SR190811
- 流程数据管理与分析挖掘软件 V1.0,清华大学,软件著作权,登记号:2014SR190808