The Use of Domain Knowledge Models for Effective Data Mining of Unstructured Customer Service Data in Engineering Applications

被引:9
|
作者
Munger, T. [1 ]
Desa, S. [1 ]
Wong, C. [2 ]
机构
[1] Univ Calif Santa Cruz, Baskin Sch Engn, Technol & Informat Management, Santa Cruz, CA 95064 USA
[2] Cisco Syst, Smart Serv Technol Grp, San Jose, CA 95134 USA
关键词
D O I
10.1109/BigDataService.2015.46
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Despite the fact that enterprises are routinely collecting massive amounts of data from customers, only a relatively small body of knowledge engineering (KE) work has addressed methods and application of KE to the design, development, and maintenance of engineering systems and products. A major challenge when applying KE to such applications is that the data is often unstructured and in the form of text exchanges between the customer and the enterprise. While the importance of modelling domain knowledge in order to produce meaningful results from mining unstructured data has been recognized, most approaches are based primarily on the linguistic structure of the text and keyword taxonomies. These approaches share the common issue that the knowledge extraction results are often not properly structured for solving the engineering problem of interest and, therefore, require manual post-processing before they can be applied. Our hypothesis is that the a priori modelling of the engineering problem of interest is crucial for both (1) efficient (rapid) collection, representation, and structuring of domain knowledge; and (2) the proper integration of domain knowledge with analytical KE methods in order facilitate the extraction of useful knowledge. In order to validate our hypothesis, we apply this approach to the important real-world engineering problem of monitoring the occurrence of product failure modes, and thereby product quality, using customer support cases. In order to translate the free-form text provided by the customer into engineering failure modes we use two methods from engineering design, the Function Analysis System Technique (FAST) and Failure Modes and Effects Analysis (FMEA), to provide the necessary domain knowledge model. This model then drives the collection, representation, and structuring of the failure modes for the product of interest. These failure modes are used as the class labels when applying data mining classification techniques (e.g., Support Vector Machine) to the support case data. The labelled support case data then can be aggregated by failure mode in order to compute a number of failure mode metrics that can be used to monitor product quality. We have demonstrated our approach to monitor the quality of a network security product at a large computer networking company using a data set of 100,000 customer support cases.
引用
收藏
页码:427 / 438
页数:12
相关论文
共 50 条
  • [31] Applications domain driven data mining methodology in bioinformatics
    Li, Yadan
    Bai, Qinghua
    Chen, Zhicheng
    BioTechnology: An Indian Journal, 2014, 10 (09) : 3772 - 3779
  • [32] Mining Vessel Tracking Data for Maritime Domain Applications
    Alessandrini, Alfredo
    Alvarez, Marlene
    Greidanus, Harm
    Gammieri, Vincenzo
    Arguedas, Virginia Fernandez
    Mazzarella, Fabio
    Santamaria, Carlos
    Stasolla, Mattia
    Tarchi, Dario
    Vespe, Michele
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 361 - 367
  • [33] Tensors for Data Mining and Data Fusion: Models, Applications, and Scalable Algorithms
    Papalexakis, Evangelos E.
    Faloutsos, Christos
    Sidiropoulos, Nicholas D.
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (02)
  • [34] Data mining applications in engineering design, manufacturing and logistics
    Feng, JCX
    Kusiak, A
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2006, 44 (14) : 2689 - 2694
  • [35] Comparison of applications for educational data mining in Engineering Education
    Buenano Fernandez, Diego
    Lujan-Mora, Sergio
    2017 IEEE WORLD ENGINEERING EDUCATION CONFERENCE (EDUNINE), 2017,
  • [36] A survey of knowledge discovery and data mining process models
    Kurgan, Lukasz A.
    Musilek, Petr
    KNOWLEDGE ENGINEERING REVIEW, 2006, 21 (01): : 1 - 24
  • [37] A Hybrid Feature Selection Method for Effective Data Classification in Data Mining Applications
    Sangaiya, Ilangovan
    Kumar, A. Vincent Antony
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2019, 11 (01) : 1 - 16
  • [38] A study on the applications of data mining techniques to enhance customer lifetime value
    Shen, Chia-Cheng
    Chuang, Huan-Ming
    WSEAS Transactions on Information Science and Applications, 2009, 6 (02): : 319 - 328
  • [39] An Empirical Assessment of Customer Lifetime Value Models within Data Mining
    Hiziroglu, Abdulkadir
    Sisci, Merve
    Cebeci, Halil Ibrahim
    Seymen, Omer Faruk
    BALTIC JOURNAL OF MODERN COMPUTING, 2018, 6 (04): : 434 - 448
  • [40] Data Mining with Histograms and Domain Knowledge - Case Studies and Considerations
    Rauch, Jan
    Simunek, Milan
    FUNDAMENTA INFORMATICAE, 2019, 166 (04) : 349 - 378