MINING FOR RELEVANT TERMS FROM LOG FILES

被引:0
|
作者
Saneifar, Hassan [1 ,2 ]
Bonniol, Stephane [2 ]
Laurent, Anne [1 ]
Poncelet, Pascal [1 ]
Roche, Mathieu [1 ]
机构
[1] Univ Montpellier 2, CNRS, LIRMM, 161 Rue Ada, F-34392 Montpellier 5, France
[2] Sain IP Technol, F-34960 Montpellier, France
关键词
Natural language processing; Information retrieval; Terminology extraction; Terminology ranking; Log files;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Information extracted from log files of computing systems can be considered one of the important resources of information systems. In the case of Integrated Circuit design, log files generated by design tools are not exhaustively exploited. The logs of this domain are multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect the grammar and the structures of natural language though they are written in English. According to features of such textual data, applying the classical methods of information extraction is not an easy task, more particularly for terminology extraction. We have previously introduced EXTERLOG approach to extract the terminology from such log files. In this paper, we introduce a new developed version of EXTERLOG guided by Web. We score the extracted terms by a Web and context based measure. We favor the more relevant terms of domain and emphasize the precision by filtering terms based on their scores. The experiments show that EXTERLOG is well-adapted terminology extraction approach from log files.
引用
收藏
页码:77 / +
页数:2
相关论文
共 50 条
  • [11] Terminology Extraction from Log Files
    Saneifar, Hassan
    Bonniol, Stephane
    Laurent, Anne
    Poncelet, Pascal
    Roche, Mathieu
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 769 - +
  • [12] Detecting Student at Risk of Failure: A Case Study of Conceptualizing Mining from Internet Access Log Files
    Trakunphutthirak, Ruangsak
    Cheung, Yen
    Lee, Vincent C. S.
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 365 - 371
  • [13] Merging Computer Log Files for Process Mining: An Artificial Immune System Technique
    Claes, Jan
    Poels, Geert
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, PT I, 2012, 99 : 99 - 110
  • [14] Flow-based Identification of Botnet Traffic by Mining Multiple Log Files
    Masud, Mohammad M.
    Al-Khateeb, Tahseen
    Khan, Latifur
    Thuraisingham, Bhavani
    Hamlen, Kevin W.
    DFMA 2008: FIRST INTERNATIONAL CONFERENCE ON DISTRIBUTED FRAMEWORKS & APPLICATIONS, PROCEEDINGS, 2008, : 200 - 206
  • [15] Integrating Computer Log Files for Process Mining: A Genetic Algorithm Inspired Technique
    Claes, Jan
    Poels, Geert
    ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, 2011, 83 : 282 - 293
  • [16] From Log Files to Assessment Metrics: Measuring Students' Science Inquiry Skills Using Educational Data Mining
    Gobert, Janice D.
    Sao Pedro, Michael
    Raziuddin, Juelaila
    Baker, Ryan S.
    JOURNAL OF THE LEARNING SCIENCES, 2013, 22 (04) : 521 - 563
  • [17] log files. stories from the internet of things
    Frank, Tina
    Puehrerfellner, Marianne
    von Rechbach, Barbara
    Lechner, David
    IOT'17: PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS, 2017, : 209 - 210
  • [18] Viewing & organizing log files
    Grenetz, P
    DR DOBBS JOURNAL, 2006, 31 (02): : 61 - 64
  • [19] Educational Data Mining from Action LOG Files of Intelligent Remote Laboratory with Embedded Simulations in Physics Teaching I
    Das, Sayan
    Schauer, Franz
    Ozvoldova, Miroslava
    CHALLENGES OF THE DIGITAL TRANSFORMATION IN EDUCATION, ICL2018, VOL 2, 2019, 917 : 675 - 686
  • [20] Pseudonymizing Unix log files
    Flegel, U
    INFRASTRUCTURE SECURITY, PROCEEDINGS, 2002, 2437 : 162 - 179