Text Reuse Detection by Keyword Extraction for Telegram Channels

被引:0
|
作者
Saki, Misam [1 ]
Faili, Heshaam [1 ]
Asadpour, Masoud [1 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Tehran, Iran
关键词
Text Reuse; Text Similarity; Text Clustering; Keyword Extraction; Telegram;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text reuse detection is the task of finding similar texts, which has many applications e.g. in plagiarism detection and analysis of information diffusion. The general approach to this problem is by detection of texts similarities in combination with other features such as time-stamp which can be used to specify the precedence of publishers e.g. to find the first publisher. In this article a method for finding similar texts has been proposed using keyword extraction which operates in linear time like LSH1 method. In addition, it supports dynamic inputs and does not depend on text vector dimensions. Our evaluations show, it has better performance in clustering quality measures and also run time.
引用
收藏
页码:1481 / 1484
页数:4
相关论文
共 50 条
  • [1] Keyword extraction for text categorization
    An, JY
    Chen, YPP
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ACTIVE MEDIA TECHNOLOGY (AMT 2005), 2005, : 556 - 561
  • [2] Text Keyword Extraction Based on GPT
    He, Pinyao
    Huang, Jingyue
    Li, Ming
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1394 - 1398
  • [3] Automatic Keyword Extraction From Dialogue Text
    Sali, Yusuf
    Erden, Mustafa
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [4] Keyword extraction for social media short text
    Zhao, Dexin
    Du, Nana
    Chang, Zhi
    Li, Yukun
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 251 - 256
  • [5] News Aggregator from Telegram Channels Using Thematic Text Analysis
    Latypov, Igor, V
    Ehlakov, Eduard, V
    Ivanov, Nikita
    Smirnov, Egor F.
    Khramov, Ivan Yu
    PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 2150 - 2153
  • [6] Text Alignment in the Service of Text Reuse Detection
    Miller, Hadar
    Kuflik, Tsvi
    Lavee, Moshe
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [7] An Improved Focused Crawler Based on Text Keyword Extraction
    Zheng, Zhang
    Qian, Du
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 386 - 390
  • [8] Chinese Automatic Text Summarization Based on Keyword Extraction
    Jiang Xiao-yu
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 225 - 228
  • [9] Analysis of Text Collections for the Purposes of Keyword Extraction Task
    Vanyushkin, Alexander
    Graschenko, Leonid
    JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2020, 44 (01) : 171 - 184
  • [10] Comparing keyword extraction techniques for WEBSOM text archives
    Azcarraga, AP
    Yap, TN
    ICTAI 2001: 13TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2001, : 187 - 194