The Use of Domain Knowledge Models for Effective Data Mining of Unstructured Customer Service Data in Engineering Applications

被引：9

作者：

Munger, T. ^{[1
]}

Desa, S. ^{[1
]}

Wong, C. ^{[2
]}

机构：

[1] Univ Calif Santa Cruz, Baskin Sch Engn, Technol & Informat Management, Santa Cruz, CA 95064 USA

[2] Cisco Syst, Smart Serv Technol Grp, San Jose, CA 95134 USA

来源：

2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015) | 2015年

关键词：

D O I：

10.1109/BigDataService.2015.46

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Despite the fact that enterprises are routinely collecting massive amounts of data from customers, only a relatively small body of knowledge engineering (KE) work has addressed methods and application of KE to the design, development, and maintenance of engineering systems and products. A major challenge when applying KE to such applications is that the data is often unstructured and in the form of text exchanges between the customer and the enterprise. While the importance of modelling domain knowledge in order to produce meaningful results from mining unstructured data has been recognized, most approaches are based primarily on the linguistic structure of the text and keyword taxonomies. These approaches share the common issue that the knowledge extraction results are often not properly structured for solving the engineering problem of interest and, therefore, require manual post-processing before they can be applied. Our hypothesis is that the a priori modelling of the engineering problem of interest is crucial for both (1) efficient (rapid) collection, representation, and structuring of domain knowledge; and (2) the proper integration of domain knowledge with analytical KE methods in order facilitate the extraction of useful knowledge. In order to validate our hypothesis, we apply this approach to the important real-world engineering problem of monitoring the occurrence of product failure modes, and thereby product quality, using customer support cases. In order to translate the free-form text provided by the customer into engineering failure modes we use two methods from engineering design, the Function Analysis System Technique (FAST) and Failure Modes and Effects Analysis (FMEA), to provide the necessary domain knowledge model. This model then drives the collection, representation, and structuring of the failure modes for the product of interest. These failure modes are used as the class labels when applying data mining classification techniques (e.g., Support Vector Machine) to the support case data. The labelled support case data then can be aggregated by failure mode in order to compute a number of failure mode metrics that can be used to monitor product quality. We have demonstrated our approach to monitor the quality of a network security product at a large computer networking company using a data set of 100,000 customer support cases.

引用

页码：427 / 438

页数：12

共 50 条

[41] The role of domain knowledge in a large scale Data Mining project
Kopanas, I
Avouris, NM
Daskalaki, S
METHODS AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2002, 2308 : 288 - 299
[42] Using Declarative Specifications of Domain Knowledge for Descriptive Data Mining
Atzmueller, Martin
Seipel, Dietmar
APPLICATIONS OF DECLARATIVE PROGRAMMING AND KNOWLEDGE MANAGEMENT: 17TH INTERNATIONAL CONFERENCE, INAP 2007/21ST WORKSHOP ON LOGIC PROGRAMMING, WLP 2007, 2009, 5437 : 149 - 164
[43] Incorporating domain knowledge into attribute-oriented data mining
McClean, S
Scotney, B
Shapcott, M
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2000, 15 (06) : 535 - 547
[44] Combining expert knowledge and data mining in a medical diagnosis domain
Alonso, F
Caraça-Valente, JP
González, AL
Montes, C
EXPERT SYSTEMS WITH APPLICATIONS, 2002, 23 (04) : 367 - 375
[45] Decision Tree Algorithms: Integration of Domain Knowledge for Data Mining
Stravinskiene, Aukse
Gudas, Saulius
Dabrilaite, Aiste
BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2012, 2012, 127 : 13 - 24
[46] Mining Knowledge from Engineering Materials Database for Data Analysis
Doreswamy
Hemanth, K. S.
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 1217 - 1223
[47] An Effective way of Mining Knowledge from Heterogeneous Data Sources
Molli, Venkateswara Rao
Veeramanickam, M. R. M.
2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,
[48] Distributed Spatial Data Mining in Geospatial Knowledge Service Grid
Lin Jiaxiang
Chen Chongcheng
Wang Qinmin
Wang Weibin
Wu Jianwei
SECOND INTERNATIONAL CONFERENCE ON ADVANCED GEOGRAPHIC INFORMATION SYSTEMS, APPLICATIONS, AND SERVICES: GEOPROCESSING 2010, PROCEEDINGS, 2010, : 80 - 87
[49] Creation of unstructured big data from customer service: The case of parcel shipping companies on Twitter
Bhattacharjya, Jyotirmoyee
Ellison, Adrian Bachman
Pang, Vincent
Gezdur, Arda
INTERNATIONAL JOURNAL OF LOGISTICS MANAGEMENT, 2018, 29 (02) : 723 - 738
[50] Design of soft computing models for data mining applications
Sumathi, S
Sivanandam, SN
Jagadeeswari
INDIAN JOURNAL OF ENGINEERING AND MATERIALS SCIENCES, 2000, 7 (03) : 107 - 121

← 1 2 3 4 5 →