Topic Modeling as a Method of Educational Text Structuring

被引:2
|
作者
Sakhovskiy, Andrey [1 ]
Tutubalina, Elena [2 ]
Solovyev, Valery [1 ]
Solnyshkina, Marina [3 ]
机构
[1] Kazan Fed Univ, Res Lab Intellectual Technol Text Management, Kazan, Russia
[2] Natl Res Univ Higher Sch Econ, Lab Models & Methods Computat Pragmat, Moscow, Russia
[3] Kazan Fed Univ, Dept Theory & Practice Language Teaching, Res Lab Intellectual Technol Text Management, Kazan, Russia
基金
俄罗斯科学基金会;
关键词
Text structure; school textbooks; Topic Modeling;
D O I
10.1109/DeSE51703.2020.9450232
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article explores the problems of assigning documents to a limited number of topics and automating the process of topic structuring of Russian educational texts. For this purpose, we compiled an original corpus of school textbooks on Social Science. We utilized the Latent Dirichlet Allocation model for selection and comparative analysis of topics in the textbooks of different grades. This approach allows the reconstruction of the matrix of topics for each textbook in the.orpus. The research demonstrated a grade ranked character of the topics in the text collection under study, in particular, there is a higher cohesion of topics in high school. The research also offers an innovative methodology of quantitative describing topics dynamics in the textbook collection. It allows visualization and comparison of strategies for presenting educational topics by different authors. The results received can be beneficial for both textbook writers as well as teachers and schoolchildren.
引用
收藏
页码:399 / 405
页数:7
相关论文
共 50 条
  • [1] LDA-PSTR: A Topic Modeling Method for Short Text
    Zhou, Kai
    Yang, Qun
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 339 - 352
  • [2] Comparative Study on Perceived Trust of Topic Modeling Based on Affective Level of Educational Text
    Im, Youngjae
    Park, Jaehyun
    Kim, Minyeong
    Park, Kijung
    APPLIED SCIENCES-BASEL, 2019, 9 (21):
  • [3] I-Topic: An Image-text Topic Modeling Method Based on Community Detection
    Liu, Jiapeng
    Zhang, Leihan
    Yan, Qiang
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 797 - 800
  • [4] Topic Models: A Novel Method for Modeling Couple and Family Text Data
    Atkins, David C.
    Rubin, Timothy N.
    Steyvers, Mark
    Doeden, Michelle A.
    Baucom, Brian R.
    Christensen, Andrew
    JOURNAL OF FAMILY PSYCHOLOGY, 2012, 26 (05) : 816 - 827
  • [5] A Survey of Topic Modeling in Text Mining
    Alghamdi, Rubayyi
    Alfalqi, Khalid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (01) : 147 - 153
  • [6] Collaborative Topic Modeling for Text Tensors
    Ding, Weifeng
    Zheng, Xiaolin
    Chen, Chaochao
    Yu, Zukun
    Chen, Deren
    2014 IEEE 11TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2014, : 89 - 96
  • [7] Text segmentation: A topic modeling perspective
    Misra, Hemant
    Yvon, Francois
    Cappe, Olivier
    Jose, Joemon
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (04) : 528 - 544
  • [8] ULW-DMM: An Effective Topic Modeling Method for Microblog Short Text
    Yu, Jia
    Qiu, Lirong
    IEEE ACCESS, 2019, 7 : 884 - 893
  • [9] Statistical Topic Modeling for Urdu Text Articles
    Rehman, Anwar Ur
    Rehman, Zobia
    Akram, Junaid
    Ali, Waqar
    Shah, Munam Ali
    Salman, Muhammad
    2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 62 - 67
  • [10] Hierarchical Topic Modeling for Urdu Text Articles
    Rehman, Anwar Ur
    Khan, Ali Haider
    Aftab, Mustansar
    Rehman, Zobia
    Shah, Munam Ali
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 464 - 469