LVTIA: A new method for keyphrase extraction from scientific video lectures

被引:11
|
作者
Hassani, Hamid [1 ]
Ershadi, Mohammad Javad [1 ]
Mohebi, Azadeh [1 ]
机构
[1] Iranian Res Inst Informat Sci & Technol IranDoc, Informat Technol Res Dept, Tehran, Iran
关键词
Multimedia indexing; Video lecture indexing; Text mining; Keyword extraction; Keyphrase extraction; KEYWORD EXTRACTION;
D O I
10.1016/j.ipm.2021.102802
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the growth of technology, the expansion of communication infrastructure and crises of COVID-19 pandemic, e-learning and virtual education is expanding. One of the best ways to access and organize these information is indexing using automatic intelligent methods. Indexing requires assigning keywords or keyphrases to each video, to represent its content. The main focus of this research is to propose an approach by which appropriate keyphrases are assigned to scientific video lectures. For this purpose, a new algorithm called LVTIA, Lecture Video Text mining-base Indexing Algorithm, is proposed in which the textual content of video frames along with the text extracted from audio signal are merged together, and a new keyphrase extraction method is proposed. The proposed method considers new local and global features for each candidate phrases, along with a new feature reflecting the occurrence of each phrase in the audio signals or video frames. The method is implemented using five distinct data sets in English and Persian. The results are evaluated based on precision, recall, F1-measure and MAP@K metrics and compared with some of the well-known keyphrase extraction algorithms. Based on the results, the best MAP@K for English videos is related to LVTIA algorithm with the values of, 0.7912, 0.8069, 0.8069 for k = 5, 10, 15, respectively. In addition, LVTIA is able to provide best MAP@K for Persian videos which are 0.6367, 0.6866, 0.6874 for k = 5, 10, 15, respectively. According to Friedman nonparametric statistical test, the performance of different algorithms in precision, recall, F1-measure metrics, are statistically different from LVTIA as well.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] TopicLPRank: a keyphrase extraction method based on improved TopicRank
    Liao, Shengbin
    Yang, Zongkai
    Liao, Qingzhou
    Zheng, Zhangxiong
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 9073 - 9092
  • [22] Deep neural model with self-training for scientific keyphrase extraction
    Zhu, Xun
    Lyu, Chen
    Ji, Donghong
    Liao, Han
    Li, Fei
    PLOS ONE, 2020, 15 (05):
  • [23] TopicLPRank: a keyphrase extraction method based on improved TopicRank
    Shengbin Liao
    Zongkai Yang
    Qingzhou Liao
    Zhangxiong zheng
    The Journal of Supercomputing, 2023, 79 : 9073 - 9092
  • [24] Automatic keyphrase extraction from Chinese books
    Chen, Yijiang
    Shi, Xiaodong
    Zhou, Changle
    Su, Chang
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 92 - +
  • [25] Automatic Keyphrase Extraction from Medical Documents
    Sarkar, Kamal
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 273 - 278
  • [26] A New Scheme for Scoring Phrases in Unsupervised Keyphrase Extraction
    Florescu, Corina
    Caragea, Cornelia
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 477 - 483
  • [27] Automated annotation of scientific texts for ML-based keyphrase extraction and validation
    Amusat, Oluwamayowa O.
    Hegde, Harshad
    Mungall, Christopher J.
    Giannakou, Anna
    Byers, Neil P.
    Gunter, Dan
    Fagnan, Kjiersten
    Ramakrishnan, Lavanya
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2024, 2024
  • [28] An N-Gram Based Method for Bengali Keyphrase Extraction
    Sarkar, Kamal
    INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 36 - 41
  • [29] Automatic Keyphrase Extraction from Scientific Chinese Medical Abstracts Based on Character-Level Sequence Labeling
    Ding, Liangping
    Zhang, Zhixiong
    Liu, Huan
    Li, Jie
    Yu, Gaihong
    JOURNAL OF DATA AND INFORMATION SCIENCE, 2021, 6 (03) : 35 - 57
  • [30] Experiment Research on Feature Selection and Learning Method in Keyphrase Extraction
    Wang, Chen
    Li, Sujian
    Wang, Wei
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 305 - 312