An intelligent approach to data extraction and task identification for process mining

被引:19
|
作者
Li, Jiexun [1 ]
Wang, Harry Jiannan [2 ]
Bai, Xue [3 ]
机构
[1] Oregon State Univ, Coll Business, Dept Business Informat Syst, Corvallis, OR 97331 USA
[2] Univ Delaware, Dept Accounting & Management Informat Syst, Lerner Coll Business & Econ, Newark, DE 19716 USA
[3] Univ Connecticut, Sch Business, Dept Operat & Informat Management, Storrs, CT 06269 USA
关键词
Business process management; Computational experiments; Data extraction; Process mining; Task identification; Text mining;
D O I
10.1007/s10796-015-9564-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Business process mining has received increasing attention in recent years due to its ability to provide process insights by analyzing event logs generated by various enterprise information systems. A key challenge in business process mining projects is extracting process related data from massive event log databases, which requires rich domain knowledge and advanced database skills and could be very labor-intensive and overwhelming. In this paper, we propose an intelligent approach to data extraction and task identification by leveraging relevant process documents. In particular, we analyze those process documents using text mining techniques and use the results to identify the most relevant database tables for process mining. The novelty of our approach is to formalize data extraction and task identification as a problem of extracting attributes as process components, and relations among process components, using sequence kernel techniques. Our approach can reduce the effort and increase the accuracy of data extraction and task identification for process mining. A business expense imbursement case is used to illustrate our approach.
引用
收藏
页码:1195 / 1208
页数:14
相关论文
共 50 条
  • [21] Toward intelligent assistance for a data mining process: An ontology-based approach for cost-sensitive classification
    Bernstein, A
    Provost, F
    Hill, S
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) : 503 - 518
  • [22] Modelling Task Durations Towards Automated, Big Data, Process Mining
    Faddy, Malcolm
    Yang, Lingkai
    Mcclean, Sally
    Donnelly, Mark
    Khan, Kashaf
    Burke, Kevin
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2025, 41 (01)
  • [23] Clinical data mining - An approach for identification of Refractive errors
    Shekar, D. V. Chandra
    Srinivas, V. Sesha
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 551 - +
  • [24] Data Mining Approach to the Identification of At-Risk Students
    Ho, Li Chin
    Shim, Kyong Jin
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5333 - 5335
  • [25] Effective educational process: A data-mining approach
    Ranjan, Jayanthi
    Malik, Kamna
    VINE, 2007, 37 (04): : 502 - 515
  • [26] Ranking Sentences for Keyphrase Extraction: A Relational Data Mining Approach
    Ceci, Michelangelo
    Loglisci, Corrado
    Macchia, Lucrezia
    10TH ITALIAN RESEARCH CONFERENCE ON DIGITAL LIBRARIES (IRCDL 2014), 2014, 38 : 52 - 59
  • [27] Data Mining Approach for Intelligent Customer Behavior Analysis for a Retail Store
    Abirami, M.
    Pattabiraman, V.
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON BIG DATA AND CLOUD COMPUTING CHALLENGES (ISBCC - 16'), 2016, 49 : 283 - 291
  • [28] A KNOWLEGDE MANAGEMENT APPROACH THROUGH INTELLIGENT AGENTS AS DATA MINING TECHNIQUES
    Irina, Tudor
    Liviu, Ionita
    BALKAN REGIONAL CONFERENCE ON ENGINEERING AND BUSINESS EDUCATION & ICEBE, VOLS I AND II, CONFERENCE PROCEEDINGS, 2009, : 420 - 423
  • [29] Intelligent data mining system
    Takahara, Y
    Liu, YM
    Hu, JH
    Shimazu, H
    Okada, M
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON E-BUSINESS (ICEB2002), 2002, : 274 - 280
  • [30] An Ontology based Approach to Intelligent Data Mining for Environmental Virtual Warehouses of Sensor Data
    Trifan, M.
    Ionescu, Bogdan
    Ionescu, Dan
    Prostean, O.
    Prostean, G.
    2008 IEEE INTERNATIONAL CONFERENCE ON VIRTUAL ENVIRONMENTS, HUMAN-COMPUTER INTERFACES AND MEASUREMENT SYSTEMS, 2008, : 125 - +