Topic Modeling as a Method of Educational Text Structuring

被引:2
|
作者
Sakhovskiy, Andrey [1 ]
Tutubalina, Elena [2 ]
Solovyev, Valery [1 ]
Solnyshkina, Marina [3 ]
机构
[1] Kazan Fed Univ, Res Lab Intellectual Technol Text Management, Kazan, Russia
[2] Natl Res Univ Higher Sch Econ, Lab Models & Methods Computat Pragmat, Moscow, Russia
[3] Kazan Fed Univ, Dept Theory & Practice Language Teaching, Res Lab Intellectual Technol Text Management, Kazan, Russia
基金
俄罗斯科学基金会;
关键词
Text structure; school textbooks; Topic Modeling;
D O I
10.1109/DeSE51703.2020.9450232
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article explores the problems of assigning documents to a limited number of topics and automating the process of topic structuring of Russian educational texts. For this purpose, we compiled an original corpus of school textbooks on Social Science. We utilized the Latent Dirichlet Allocation model for selection and comparative analysis of topics in the textbooks of different grades. This approach allows the reconstruction of the matrix of topics for each textbook in the.orpus. The research demonstrated a grade ranked character of the topics in the text collection under study, in particular, there is a higher cohesion of topics in high school. The research also offers an innovative methodology of quantitative describing topics dynamics in the textbook collection. It allows visualization and comparison of strategies for presenting educational topics by different authors. The results received can be beneficial for both textbook writers as well as teachers and schoolchildren.
引用
收藏
页码:399 / 405
页数:7
相关论文
共 50 条
  • [31] Incorporating word embeddings into topic modeling of short text
    Gao, Wang
    Peng, Min
    Wang, Hua
    Zhang, Yanchun
    Xie, Qianqian
    Tian, Gang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (02) : 1123 - 1145
  • [32] Topic modeling for large-scale text data
    Li, Xi-ming
    Ouyang, Ji-hong
    Lu, You
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (06) : 457 - 465
  • [33] Topic modeling for large-scale text data
    Xi-ming Li
    Ji-hong Ouyang
    You Lu
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 457 - 465
  • [34] Feature selection for text data via topic modeling
    Jang, Woosol
    Kim, Ye Eun
    Son, Won
    KOREAN JOURNAL OF APPLIED STATISTICS, 2022, 35 (06) : 739 - 754
  • [35] Topic Modeling for Interpretable Text Classification From EHRs
    Rijcken, Emil
    Kaymak, Uzay
    Scheepers, Floortje
    Mosteiro, Pablo
    Zervanou, Kalliopi
    Spruit, Marco
    FRONTIERS IN BIG DATA, 2022, 5
  • [36] Comparative Text Analytics via Topic Modeling in Banking
    Chen, Yu
    Rabbani, Rhaad M.
    Gupta, Aparna
    Zaki, Mohammed J.
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 306 - 313
  • [37] A general framework to expand short text for topic modeling
    Bicalho, Paulo
    Pita, Marcelo
    Pedrosa, Gabriel
    Lacerda, Anisio
    Pappa, Gisele L.
    INFORMATION SCIENCES, 2017, 393 : 66 - 81
  • [38] Topic Modeling for Assessment of Text Complexity in Russian Textbooks
    Sakhovskiy, Andrey
    Solovyev, Valery
    Solnyshkina, Marina
    2020 IVANNIKOV ISPRAS OPEN CONFERENCE (ISPRAS 2020), 2020, : 102 - 108
  • [39] Topic Modeling and Text Analysis for Qualitative Policy Research
    Isoaho, Karoliina
    Gritsenko, Daria
    Makela, Eetu
    POLICY STUDIES JOURNAL, 2021, 49 (01) : 300 - 324
  • [40] Incorporating word embeddings into topic modeling of short text
    Wang Gao
    Min Peng
    Hua Wang
    Yanchun Zhang
    Qianqian Xie
    Gang Tian
    Knowledge and Information Systems, 2019, 61 : 1123 - 1145