Adapting CRISP-DM for idea mining a data mining process for generating ideas using a textual dataset

被引:0
|
作者
Ayele W.Y. [1 ]
机构
[1] Stockholm University, Department of Computer and Systems Sciences, DSV Stockholm University
关键词
CRISP-DM; CRISP-IM; Dynamic topic modeling; Idea evaluation; Idea generation; Idea mining evaluation;
D O I
10.14569/IJACSA.2020.0110603
中图分类号
学科分类号
摘要
Data mining project managers can benefit from using standard data mining process models. The benefits of using standard process models for data mining, such as the de facto and the most popular, Cross-Industry-Standard-Process model for Data Mining (CRISP-DM) are reduced cost and time. Also, standard models facilitate knowledge transfer, reuse of best practices, and minimize knowledge requirements. On the other hand, to unlock the potential of ever-growing textual data such as publications, patents, social media data, and documents of various forms, digital innovation is increasingly needed. Furthermore, the introduction of cutting-edge machine learning tools and techniques enable the elicitation of ideas. The processing of unstructured textual data to generate new and useful ideas is referred to as idea mining. Existing literature about idea mining merely overlooks the utilization of standard data mining process models. Therefore, the purpose of this paper is to propose a reusable model to generate ideas, CRISP-DM, for Idea Mining (CRISP-IM). The design and development of the CRISP-IM are done following the design science approach. The CRISP-IM facilitates idea generation, through the use of Dynamic Topic Modeling (DTM), unsupervised machine learning, and subsequent statistical analysis on a dataset of scholarly articles. The adapted CRISP-IM can be used to guide the process of identifying trends using scholarly literature datasets or temporally organized patent or any other textual dataset of any domain to elicit ideas. The ex-post evaluation of the CRISP-IM is left for future study. © Science and Information Organization.
引用
收藏
页码:20 / 32
页数:12
相关论文
共 50 条
  • [41] CHICKEN EGG SEXING BY USING DATA MINING PROCESS
    Toksoz, Canan
    Albayrak, Mehmet
    Yasar, Huseyin
    FRESENIUS ENVIRONMENTAL BULLETIN, 2021, 30 (02): : 1373 - 1381
  • [42] Analysis of stock price return using textual data and numerical data through text mining
    Takahashi, Satoru
    Takahashi, Masakazu
    Takahashi, Hiroshi
    Tsuda, Kazuhiko
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 310 - 316
  • [43] Business environmental analysis for textual data using data mining and sentence-level classification
    Kim, Yoon-Sung
    Rim, Hae-Chang
    Lee, Do-Gil
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2019, 119 (01) : 69 - 88
  • [44] Analysis of KDD CUP 99 dataset using clustering based data mining
    College of Computer Engineering and Sciences, Salman bin Abdulaziz University, Saudi Arabia
    Int. J. Database Theory Appl., 2013, 5 (23-34):
  • [45] Process Data Analysis Using Visual Analytics and Process Mining Techniques
    Sitova, Irina
    Pecerska, Jelena
    2020 61ST INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2020,
  • [46] A data mining based approach for process identification using historical data
    Oulhiq, Ridouane
    Benjelloun, Khalid
    Kali, Yassine
    Saad, Maarouf
    INTERNATIONAL JOURNAL OF MODELLING AND SIMULATION, 2022, 42 (02): : 335 - 349
  • [47] A research case study: Difficulties and recommendations when using a textual data mining tool
    Al-Hassan, Abeer A.
    Alshameri, Faleh
    Sibley, Edgar H.
    INFORMATION & MANAGEMENT, 2013, 50 (07) : 540 - 552
  • [48] Analyzing textual databases using data mining to enable fast product development processes
    Menon, R
    Tong, LH
    Sathiyakeerthi, S
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2005, 88 (02) : 171 - 180
  • [49] The effect of dataset size and the process of big data mining for investigating solar-thermal desalination by using machine learning
    Peng, Guilong
    Sun, Senshan
    Xu, Zhenwei
    Du, Juxin
    Qin, Yangjun
    Sharshir, Swellam W.
    Kandeal, A. W.
    Kabeel, A. E.
    Yang, Nuo
    INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2025, 236
  • [50] Building Process Understanding for Vaccine Manufacturing Using Data Mining
    Wiener, Matthew C.
    Obando, Louis
    O'Neill, Julia
    QUALITY ENGINEERING, 2010, 22 (03) : 157 - 168