Automatic Content Analysis of Legislative Documents by Text Mining Techniques

被引：2

作者：

Lin, Fu-Ren ^{[1
]}

Chou, Shih-Yao ^{[1
]}

Liao, Dachi ^{[2
]}

Hao, De ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Inst Serv Sci, Hsinchu 30013, Taiwan

[2] Natl Sun Yat Sen Univ, Inst Polit Sci, Kaohsiung 80424, Taiwan

来源：

2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS) | 2015年

关键词：

D O I：

10.1109/HICSS.2015.263

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Parliamentary Library of Taiwan's Legislative Yuan website provides a fair and objective channel for the public to track daily activities of the Legislative Yuan and legislators' inquiries. However the quantity of generated documents is so large that the general public may not be able to keep track of the legislative performance of each legislator from these contents. To mitigate the gap of legislative document generation and the sense making by the general public, this study proposed a text mining mechanism to automatically classify legislative documents referring to each legislator, and then represent the proportion of their legislative performance on certain categories. This study first initiated a basic legislative categorical structure by domain experts. Then a two-stage clustering was applied to perform feature selection for legislative documents. The SVM method was applied to build a model to classify the new document to the appropriate category. In order to maintain the classification categories up to date, in this study, we also evaluate the difference between labeling contents by domain experts and the general public. Experimental results show the effectiveness of the proposed test mining mechanism, which automatically classifies legislative documents to reveal legislators' performance accordingly. With this result, people can monitor legislators and track their legislative activities using the information from the Parliamentary Library of Legislative Yuan to update their perception on legislative performance in various categories.

引用

页码：2199 / 2208

页数：10

共 50 条

[11] Text Mining - Automated Content Analysis
Storch, Monika
PSYCHOTHERAPIE PSYCHOSOMATIK MEDIZINISCHE PSYCHOLOGIE, 2021, 71 (07) : 301 - 302
[12] Text mining techniques for patent analysis
Tseng, Yuen-Hsien
Lin, Chi-Jen
Lin, Yu-I
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1216 - 1247
[13] Content-based text mining technique for retrieval of CAD documents
Yu, Wen-der
Hsu, Jia-yang
AUTOMATION IN CONSTRUCTION, 2013, 31 : 65 - 74
[14] Automated Text Mining for Requirements Analysis of Policy Documents
Massey, Aaron K.
Eisenstein, Jacob
Anton, Annie, I
Swire, Peter P.
2013 21ST IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2013, : 4 - 13
[15] Text Mining for Automatic Lexical Analysis of Layman Text of Biomedical Argument
Defilippi, D.
Pivetti, S.
Giacomini, M.
WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 12, 2009, 25 (12): : 281 - 284
[16] Automatic Building of an Ontology from a Corpus of Text Documents Using Data Mining Tools
Toledo-Alvarado, J. I.
Guzman-Arenas, A.
Martinez-Luna, G. L.
JOURNAL OF APPLIED RESEARCH AND TECHNOLOGY, 2012, 10 (03) : 398 - 404
[17] Techniques on Text Mining
Sukanya, M.
Biruntha, S.
2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 269 - 271
[18] Text mining in the classification of digital documents
Contreras Barrera, Marcial
BIBLIOS-REVISTA DE BIBLIOTECOLOGIA Y CIENCIAS DE LA INFORMACION, 2016, (64): : 33 - 43
[19] Ontological text mining of software documents
Witte, Rene
Li, Qiangqiang
Zhang, Yonggang
Rilling, Juergen
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 168 - +
[20] Automatic Text Categorization Marathi documents
Patil, Javdeep Jalindar
Bogiri, Nagaraju
2015 INTERNATIONAL CONFERENCE ON ENERGY SYSTEMS AND APPLICATIONS, 2015, : 689 - 694

← 1 2 3 4 5 →