Text Reuse Detection by Keyword Extraction for Telegram Channels

被引:0
|
作者
Saki, Misam [1 ]
Faili, Heshaam [1 ]
Asadpour, Masoud [1 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Tehran, Iran
关键词
Text Reuse; Text Similarity; Text Clustering; Keyword Extraction; Telegram;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text reuse detection is the task of finding similar texts, which has many applications e.g. in plagiarism detection and analysis of information diffusion. The general approach to this problem is by detection of texts similarities in combination with other features such as time-stamp which can be used to specify the precedence of publishers e.g. to find the first publisher. In this article a method for finding similar texts has been proposed using keyword extraction which operates in linear time like LSH1 method. In addition, it supports dynamic inputs and does not depend on text vector dimensions. Our evaluations show, it has better performance in clustering quality measures and also run time.
引用
收藏
页码:1481 / 1484
页数:4
相关论文
共 50 条
  • [31] Text Summarization with Automatic Keyword Extraction in Telugu e-Newspapers
    Naidu, Reddy
    Bharti, Santosh Kumar
    Babu, Korra Sathya
    Mohapatra, Ramesh Kumar
    SMART COMPUTING AND INFORMATICS, 2018, 77 : 555 - 564
  • [32] A Feature Extraction Method Using Base Phrase and keyword In Chinese Text
    Li, Xin-fu
    Zhao, Lei-lei
    Wu, Li-hong
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 680 - +
  • [33] Incorporating keyword extraction and attention for multi-label text classification
    Zhao, Hua
    Li, Xiaoqian
    Wang, Fengling
    Zeng, Qingtian
    Diao, Xiuli
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2083 - 2093
  • [34] EFFICIENT KEYWORD EXTRACTION AND TEXT SUMMARIZATION FOR READING ARTICLES ON SMART PHONE
    Jeong, Hyoungil
    Ko, Youngjoong
    Seo, Jungyun
    COMPUTING AND INFORMATICS, 2015, 34 (04) : 779 - 794
  • [35] Keyword Combination Extraction in Text Categorization Based on Ant Colony Optimization
    Yu, Zi-jun
    Wu, Wei-gang
    Xiao, Jing
    Zhang, Jun
    Huang, Rui-Zhang
    Liu, Ou
    2009 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION, 2009, : 430 - +
  • [36] Performance Analysis of Keyword Extraction Algorithms Assessing Extractive Text Summarization
    Kumar, Akshi
    Sharma, Aditi
    Sharma, Sidhant
    Kashyap, Shashwat
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND ELECTRONICS (COMPTELIX), 2017, : 408 - 414
  • [37] Keyword Based Tweet Extraction and Detection of Related Topics
    Benny, Amrutha
    Philip, Mintu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 364 - 371
  • [38] Overview of Fingerprinting Methods for Local Text Reuse Detection
    Lulu, Leena
    Belkhouche, Boumediene
    Harous, Saad
    PROCEEDINGS OF THE 2016 12TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2016, : 191 - 196
  • [39] Automatic Keyword Extraction from Bengali Text using Improved RAKE Approach
    Haque, Mozammel
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [40] Uyghur-Kazakh-Kirghiz Text Keyword Extraction Based on Morpheme Segmentation
    Parhat, Sardar
    Sattar, Mutallip
    Hamdulla, Askar
    Kadir, Abdurahman
    INFORMATION, 2023, 14 (05)