Automatic Content Analysis of Legislative Documents by Text Mining Techniques

被引:2
|
作者
Lin, Fu-Ren [1 ]
Chou, Shih-Yao [1 ]
Liao, Dachi [2 ]
Hao, De [1 ]
机构
[1] Natl Tsing Hua Univ, Inst Serv Sci, Hsinchu 30013, Taiwan
[2] Natl Sun Yat Sen Univ, Inst Polit Sci, Kaohsiung 80424, Taiwan
来源
2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS) | 2015年
关键词
D O I
10.1109/HICSS.2015.263
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Parliamentary Library of Taiwan's Legislative Yuan website provides a fair and objective channel for the public to track daily activities of the Legislative Yuan and legislators' inquiries. However the quantity of generated documents is so large that the general public may not be able to keep track of the legislative performance of each legislator from these contents. To mitigate the gap of legislative document generation and the sense making by the general public, this study proposed a text mining mechanism to automatically classify legislative documents referring to each legislator, and then represent the proportion of their legislative performance on certain categories. This study first initiated a basic legislative categorical structure by domain experts. Then a two-stage clustering was applied to perform feature selection for legislative documents. The SVM method was applied to build a model to classify the new document to the appropriate category. In order to maintain the classification categories up to date, in this study, we also evaluate the difference between labeling contents by domain experts and the general public. Experimental results show the effectiveness of the proposed test mining mechanism, which automatically classifies legislative documents to reveal legislators' performance accordingly. With this result, people can monitor legislators and track their legislative activities using the information from the Parliamentary Library of Legislative Yuan to update their perception on legislative performance in various categories.
引用
收藏
页码:2199 / 2208
页数:10
相关论文
共 50 条
  • [11] Text Mining - Automated Content Analysis
    Storch, Monika
    PSYCHOTHERAPIE PSYCHOSOMATIK MEDIZINISCHE PSYCHOLOGIE, 2021, 71 (07) : 301 - 302
  • [12] Text mining techniques for patent analysis
    Tseng, Yuen-Hsien
    Lin, Chi-Jen
    Lin, Yu-I
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1216 - 1247
  • [13] Content-based text mining technique for retrieval of CAD documents
    Yu, Wen-der
    Hsu, Jia-yang
    AUTOMATION IN CONSTRUCTION, 2013, 31 : 65 - 74
  • [14] Automated Text Mining for Requirements Analysis of Policy Documents
    Massey, Aaron K.
    Eisenstein, Jacob
    Anton, Annie, I
    Swire, Peter P.
    2013 21ST IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2013, : 4 - 13
  • [15] Text Mining for Automatic Lexical Analysis of Layman Text of Biomedical Argument
    Defilippi, D.
    Pivetti, S.
    Giacomini, M.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 12, 2009, 25 (12): : 281 - 284
  • [16] Automatic Building of an Ontology from a Corpus of Text Documents Using Data Mining Tools
    Toledo-Alvarado, J. I.
    Guzman-Arenas, A.
    Martinez-Luna, G. L.
    JOURNAL OF APPLIED RESEARCH AND TECHNOLOGY, 2012, 10 (03) : 398 - 404
  • [17] Techniques on Text Mining
    Sukanya, M.
    Biruntha, S.
    2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 269 - 271
  • [18] Text mining in the classification of digital documents
    Contreras Barrera, Marcial
    BIBLIOS-REVISTA DE BIBLIOTECOLOGIA Y CIENCIAS DE LA INFORMACION, 2016, (64): : 33 - 43
  • [19] Ontological text mining of software documents
    Witte, Rene
    Li, Qiangqiang
    Zhang, Yonggang
    Rilling, Juergen
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 168 - +
  • [20] Automatic Text Categorization Marathi documents
    Patil, Javdeep Jalindar
    Bogiri, Nagaraju
    2015 INTERNATIONAL CONFERENCE ON ENERGY SYSTEMS AND APPLICATIONS, 2015, : 689 - 694