Adapting CRISP-DM for idea mining a data mining process for generating ideas using a textual dataset

被引:0
|
作者
Ayele W.Y. [1 ]
机构
[1] Stockholm University, Department of Computer and Systems Sciences, DSV Stockholm University
关键词
CRISP-DM; CRISP-IM; Dynamic topic modeling; Idea evaluation; Idea generation; Idea mining evaluation;
D O I
10.14569/IJACSA.2020.0110603
中图分类号
学科分类号
摘要
Data mining project managers can benefit from using standard data mining process models. The benefits of using standard process models for data mining, such as the de facto and the most popular, Cross-Industry-Standard-Process model for Data Mining (CRISP-DM) are reduced cost and time. Also, standard models facilitate knowledge transfer, reuse of best practices, and minimize knowledge requirements. On the other hand, to unlock the potential of ever-growing textual data such as publications, patents, social media data, and documents of various forms, digital innovation is increasingly needed. Furthermore, the introduction of cutting-edge machine learning tools and techniques enable the elicitation of ideas. The processing of unstructured textual data to generate new and useful ideas is referred to as idea mining. Existing literature about idea mining merely overlooks the utilization of standard data mining process models. Therefore, the purpose of this paper is to propose a reusable model to generate ideas, CRISP-DM, for Idea Mining (CRISP-IM). The design and development of the CRISP-IM are done following the design science approach. The CRISP-IM facilitates idea generation, through the use of Dynamic Topic Modeling (DTM), unsupervised machine learning, and subsequent statistical analysis on a dataset of scholarly articles. The adapted CRISP-IM can be used to guide the process of identifying trends using scholarly literature datasets or temporally organized patent or any other textual dataset of any domain to elicit ideas. The ex-post evaluation of the CRISP-IM is left for future study. © Science and Information Organization.
引用
收藏
页码:20 / 32
页数:12
相关论文
共 50 条
  • [1] Adapting CRISP-DM for Idea Mining A Data Mining Process for Generating Ideas Using a Textual Dataset
    Ayele, Workneh Y.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (06) : 20 - 32
  • [2] Adapting the CRISP-DM Data Mining Process: A Case Study in the Financial Services Domain
    Plotnikova, Veronika
    Dumas, Marlon
    Milani, Fredrik
    RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS 2021), 2021, 415 : 55 - 71
  • [3] Specializing CRISP-DM for evidence mining
    Venter, Jacobus
    de Waal, Alta
    Willers, Cornelius
    ADVANCES IN DIGITAL FORENSIC III, 2007, 242 : 303 - +
  • [4] USING DATA MINING FOR BANK DIRECT MARKETING: AN APPLICATION OF THE CRISP-DM METHODOLOGY
    Moro, Sergio
    Laureano, Raul M. S.
    Cortez, Paulo
    EUROPEAN SIMULATION AND MODELLING CONFERENCE 2011, 2011, : 117 - +
  • [5] Using Data Mining for Prediction of Hospital Length of Stay: An Application of the CRISP-DM Methodology
    Caetano, Nuno
    Cortez, Paulo
    Laureano, Raul M. S.
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2014, 2015, 227 : 149 - 166
  • [6] Analysing warranty claims of automobiles - An application description following the CRISP-DM data mining process
    Hipp, J
    Lindner, G
    INTERNET APPLICATIONS, 1999, 1749 : 31 - 40
  • [7] Applying the CRISP-DM data mining process in the financial services industry: Elicitation of adaptation requirements
    Plotnikova, Veronika
    Dumas, Marlon
    Milani, Fredrik P.
    DATA & KNOWLEDGE ENGINEERING, 2022, 139
  • [8] Synthesizing CRISP-DM and Quality Management: A Data Mining Approach for Production Processes
    Schafer, Franziska
    Zeiselmair, Christian
    Becker, Jonas
    Otten, Heiner
    2018 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY MANAGEMENT, OPERATIONS AND DECISIONS (ICTMOD), 2018, : 190 - 195
  • [9] DMME: Data mining methodology for engineering applications - a holistic extension to the CRISP-DM model
    Huber, Steffen
    Wiemer, Hajo
    Schneider, Dorothea
    Ihlenfeldt, Steffen
    12TH CIRP CONFERENCE ON INTELLIGENT COMPUTATION IN MANUFACTURING ENGINEERING, 2019, 79 : 403 - 408
  • [10] CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories
    Martinez-Plumed, Fernando
    Contreras-Ochando, Lidia
    Ferri, Cesar
    Hernandez-Orallo, Jose
    Kull, Meelis
    Lachiche, Nicolas
    Ramirez-Quintana, Maria Jose
    Flach, Peter
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (08) : 3048 - 3061