Distributed data mining and its applications to intelligent textual information processing

被引:0
|
作者
Qiu, SB [1 ]
Qiu, M [1 ]
机构
[1] Univ New Mexico, Dept Elect & Comp Engn, Albuquerque, NM 87131 USA
关键词
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Textual information processing is of fundamental importance, due to the massive amount of documents, especially online textual information that we need to process every day. In this paper, we stud), data mining techniques applied to intelligent textual information processing in distributed environments, including text classification. information extraction (IE) and topic detection and tracking (TDT). These intelligent processing techniques will improve the quality and efficiency of information resource management and utilization. Their statistical models and computational algorithms challenge the researches in data mining and distributed/parallel computing. When successfully applied, they will help enhance and benefit applications in IT, digital library, and information retrieval. Specifically, we study the distributed computing of the following algorithms: naive Bayes classifier combined with expectation-maximization (EM) for text classification, hidden Markov model for information extraction, and deterministic annealing with EM for topic detection and tracking. We also study the performances of the proposed algorithms and experiment on the improvements.
引用
收藏
页码:366 / 370
页数:5
相关论文
共 50 条
  • [41] Multilevel Natural Language Processing for Intelligent Information Retrieval and Text Mining
    I. V. Smirnov
    Scientific and Technical Information Processing, 2024, 51 (6) : 629 - 635
  • [42] Natural language interaction with the web of data by mining its textual side
    Cabrio, Elena
    Cojan, Julien
    Aprosio, Alessio Palmero
    Gandon, Fabien
    INTELLIGENZA ARTIFICIALE, 2012, 6 (02) : 121 - 133
  • [43] Intelligent statistical data mining with information complexity and genetic algorithms
    Bozdogan, H
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 15 - 56
  • [44] Design of an Information Intelligent System based on Web Data Mining
    Zhang, Xinlin
    Yin, Xiangdong
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 88 - 91
  • [45] Data analysis and mining environment: a distributed intelligent agent technology application
    Sugumaran, V
    Bose, R
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 1999, 99 (1-2) : 71 - 80
  • [46] Data analysis and mining environment: A distributed intelligent agent technology application
    Sugumaran, Vijayan
    Bose, Ranjit
    Industrial Management and Data Systems, 1999, 99 (01): : 71 - 80
  • [47] Data Processing and Information Mining of Sample in Statistical Model
    Zhang Deran
    Xiang Jing
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 1165 - 1169
  • [48] Distributed Publish/Subscribe Query Processing on the Spatio-Textual Data Stream
    Chen, Zhida
    Cong, Gao
    Zhang, Zhenjie
    Fu, Tom Z. J.
    Chen, Lisi
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 1095 - 1106
  • [49] An Approach to Intelligent Distributed Scanning and Analytical Processing of the Internet Inappropriate Information
    Branitskiy, Alexander
    Fedorchenko, Andrey
    Kotenko, Igor
    Saenko, Igor
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, : 146 - 151
  • [50] A Distributed Data Mining Framework Accelerated with Graphics Processing Units
    Nam-Luc Tran
    Dugauthier, Quentin
    Skhiri, Sabri
    2013 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CLOUDCOM-ASIA), 2013, : 366 - 372