Automatic Extraction of Explicit and Implicit Keywords to Build Document Descriptors

被引:0
|
作者
Ventura, Joao [1 ]
Silva, Joaquim [1 ]
机构
[1] Univ Nova Lisboa, CITI DI FCT, P-2829516 Caparica, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keywords are single and multiword terms that describe the semantic content of documents. They are useful in many applications, such as document searching and indexing, or to be read by humans. Keywords can be explicit, by occurring in documents, or implicit, since, although not explicitly written in documents, they are semantically related to their contents. This paper presents a statistical approach to build document descriptors with explicit and implicit keywords automatically extracted from the documents. Our approach is language-independent and we show comparative results for three different European languages.
引用
收藏
页码:492 / 503
页数:12
相关论文
共 50 条
  • [1] Automatic keywords extraction of Chinese document using small world structure
    Zhu, MX
    Cai, Z
    Cai, QS
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 438 - 443
  • [2] Automatic extraction of keywords from abstracts
    HaCohen-Kerner, Y
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 843 - 849
  • [3] Automatic keywords extraction for Punjabi language
    Gupta, Vishal
    Lehal, Gurpreet Singh
    International Journal of Computer Science Issues, 2011, 8 (5 5-3): : 327 - 331
  • [4] Automatic extraction of keywords for the Portuguese language
    Lacerda Dias, Maria Abadia
    de Gomensoro Malheiros, Marcelo
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2006, 3960 : 204 - 207
  • [5] Document Classification Based on Metadata and Keywords Extraction
    Rezqa, Eman Y.
    Baraka, Rebhi S.
    2021 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT 2021), 2021, : 18 - 24
  • [6] A novel machine extraction algorithm for implicit and explicit keywords based on dynamic web metadata of scientific scholars’ corpus
    Mosbah M.
    International Journal of Web Engineering and Technology, 2023, 18 (01) : 29 - 44
  • [7] Novel Approach for the Extraction of Keywords from Text Document
    Kulkarni, R. N.
    Koduri, Swetha
    2024 INTERNATIONAL CONFERENCE ON SOCIAL AND SUSTAINABLE INNOVATIONS IN TECHNOLOGY AND ENGINEERING, SASI-ITE 2024, 2024, : 266 - 271
  • [8] PAI: Automatic indexing for extracting asserted keywords from a document
    Matsumura, N
    Ohsawa, Y
    Ishizuka, M
    NEW GENERATION COMPUTING, 2003, 21 (01) : 37 - 47
  • [9] PAI: Automatic indexing for extracting asserted keywords from a document
    Naohiro Matsumura
    Yukio Ohsawa
    Mitsuru Ishizuka
    New Generation Computing, 2003, 21 : 37 - 47
  • [10] PAI: Automatic indexing for extracting asserted keywords from a document
    Matsumura, Naohiro
    Ohsawa, Yukio
    Ishizuka, Mitsuru
    New Gener Comput, 1600, 1 (37-47):