Towards an intelligent text categorization for web resources: An implementation

被引:0
|
作者
Zadrozny, S [1 ]
Lawcewicz, K [1 ]
Kacprzyk, J [1 ]
机构
[1] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
关键词
automatic classification of documents; Internet; linguistic terms;
D O I
10.1016/B978-044451379-3/50012-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the concept and implementation of a software system, TCAT (Text CATegorization) system, for an automatic recognition of a topic of an Internet document. In the training mode the user provides the system with a list of topics and sets of documents representing each topic (supervised learning). In the recognition mode the system automatically classifies previously unseen document to a topic category. A simple learning algorithm is devised and implemented. The results of the classification are presented to the user in the form of a set of linguistic terms. Some new measures of correctness of the classification are proposed. The implemented system processes documents in several popular Internet-related formats.
引用
收藏
页码:153 / 164
页数:12
相关论文
共 50 条
  • [31] Implementation of Intelligent Recommendation System for Learning Resources
    Li Hui
    Shi Jun
    Shu Zhang
    Hu Yun
    2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2017), 2017, : 139 - 144
  • [32] Automatic categorization of web text documents using fuzzy inference rule
    Dhar, Ankita
    Mukherjee, Himadri
    Dash, Niladri Sekhar
    Roy, Kaushik
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2020, 45 (01):
  • [33] Automatic categorization of web text documents using fuzzy inference rule
    Ankita Dhar
    Himadri Mukherjee
    Niladri Sekhar Dash
    Kaushik Roy
    Sādhanā, 2020, 45
  • [34] INTIMATE: A web-based movie recommender using text categorization
    Mak, H
    Koprinska, I
    Poon, J
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 602 - 605
  • [35] Annotating text segments using a web-based categorization approach
    Chiao, HC
    Pu, HT
    Chien, LF
    DIGITAL LIBRARIES: IMPLEMENTING STRATEGIES AND SHARING EXPERIENCES, PROCEEDINGS, 2005, 3815 : 323 - 331
  • [36] Web Services Enabled Text Categorization System: Service Infrastructure Designing
    Zhang, Xiaobin
    Mei, Jian
    Wang, Suge
    Zhang, Wu
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (02): : 73 - 77
  • [37] Alternative implementation techniques for web text visualization
    Alonso, O
    Baeza-Yates, R
    FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, : 202 - 204
  • [38] The design and implementation of intelligent transportation web services
    Wu, CH
    Su, DC
    Chang, J
    Wei, CC
    Lin, KJ
    Ho, JM
    IEEE INTERNATIONAL CONFERENCE ON E-COMMERCE, 2003, : 49 - 52
  • [39] Hadoop MapReduce Implementation of A Novel scheme for Term weighting in Text Categorization
    Dalavi, Manesh
    Cheke, Shailesh
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 994 - 999
  • [40] Design and Implementation of Wind Resources Web Platform
    Jiang, Yuan
    Liang, Likai
    Tong, Qiang
    Yuan, Ruitong
    Li, Ruilin
    2018 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2018), 2018, 435