Text-mining: Application development challenges

被引:0
|
作者
Varadarajan, S [1 ]
Kasravi, K [1 ]
Feldman, R [1 ]
机构
[1] Elect Data Syst Corp, Troy, MI 48098 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reviews the best practices and challenges for project managers and developers involved in implementing text-mining applications. With focus on rule-based information extraction, and references to actual cases, the authors share their experiences from having developed several text-mining applications in diverse industries. First, project management issues are discussed, including a process for capturing business requirements and mapping them into features and linguistic patterns, development of linguistic rules, rule development standards, performance metrics, and an evaluation methodology. Linguistic representations such as sub-syntactic, syntactic, semantic, and application-specific rules are identified. Special emphasis is placed on post-information extraction processing, such as improving the relevance of the extracted information, summarization models, techniques for handling typographical errors, resolution of temporal information, anaphora resolution, and a discussion on shallow vs. full parsing. Lastly, the paper discusses various utilities to help with the development of a text-mining application, such as feature analysis, visualization, source document pre-processing, and rule authoring tools.
引用
收藏
页码:247 / 260
页数:14
相关论文
共 50 条
  • [21] Measuring flexibility: A text-mining approach
    Grajzel, Katalin
    Acar, Selcuk
    Dumas, Denis
    Organisciak, Peter
    Berthiaume, Kelly
    FRONTIERS IN PSYCHOLOGY, 2023, 13
  • [22] Text-mining offers clues to success
    Reardon, Sara
    NATURE, 2014, 509 (7501) : 410 - 410
  • [23] Status of text-mining techniques applied to biomedical text
    Erhardt, RAA
    Schneider, R
    Blaschke, C
    DRUG DISCOVERY TODAY, 2006, 11 (7-8) : 315 - 325
  • [24] Text-mining spat heats up
    Richard Van Noorden
    Nature, 2013, 495 : 295 - 295
  • [25] Analysis of patterns in meteorological research and development using a text-mining algorithm
    Park, Hongju
    Kim, Habin
    Park, Taeyoung
    Lee, Yung-Seop
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (05) : 935 - 947
  • [26] Text-mining the signals of climate change doubt
    Boussalis, Constantine
    Coan, Travis G.
    GLOBAL ENVIRONMENTAL CHANGE-HUMAN AND POLICY DIMENSIONS, 2016, 36 : 89 - 100
  • [27] Development of Thai Text-Mining Model for Classifying ICD-10 TM
    Jatunarapit, Pornrat
    Piromsopa, Krerk
    Charoeanlap, Chris
    2016 8TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2016,
  • [28] A Text-Mining Approach to Explain Unwanted Behaviours
    Chen, Wei
    Aspinall, David
    Gordon, Andrew D.
    Sutton, Charles
    Muttik, Igor
    PROCEEDINGS OF THE 9TH EUROPEAN WORKSHOP ON SYSTEM SECURITY, (EUROSEC 2016), 2016, : 19 - 24
  • [29] A TEXT-MINING APPROACH FOR CLASSIFICATION OF GENOMIC FRAGMENTS
    Gadia, Vinay
    Rosen, Gail
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, PROCEEDINGS, 2008, : 107 - 108
  • [30] Text-mining Approach for Estimating Vulnerability Score
    Miyamoto, Daisuke
    Yamamoto, Yasuhiro
    Nakayama, Masaya
    2015 4TH INTERNATIONAL WORKSHOP ON BUILDING ANALYSIS DATASETS AND GATHERING EXPERIENCE RETURNS FOR SECURITY (BADGERS), 2015, : 67 - 73