Plagiarism Detection Using Machine Learning-Based Paraphrase Recognizer

被引:7
|
作者
Chitra, A. [2 ]
Rajkumar, Anupriya [1 ]
机构
[1] Dr Mahalingam Coll Engn & Technol, CSE Dept, Pollachi, Tamil Nadu, India
[2] PSG Coll Technol, Comp Applicat, Coimbatore, Tamil Nadu, India
关键词
Paraphrase recognition; passage-level plagiarism detection; support vector machine;
D O I
10.1515/jisys-2014-0146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plagiarism in free text has become a common occurrence due to the wide availability of voluminous information resources. Automatic plagiarism detection systems aim to identify plagiarized content present in large repositories. This task is rendered difficult by the use of sophisticated plagiarism techniques such as paraphrasing and summarization, which mask the occurrence of plagiarism. In this work, a monolingual plagiarism detection technique has been developed to tackle cases of paraphrased plagiarism. A support vector machine based paraphrase recognition system, which works by extracting lexical, syntactic, and semantic features from input text has been used. Both sentence-level and passage-level approaches have been investigated. The performance of the system has been evaluated on various corpora, and the passage level approach has registered promising results.
引用
收藏
页码:351 / 359
页数:9
相关论文
共 50 条
  • [1] Machine Learning Models for Paraphrase Identification and its Applications on Plagiarism Detection
    Hunt, Ethan
    Janamsetty, Ritvik
    Kinares, Chanana
    Koh, Chanel
    Sanchez, Alexis
    Zhan, Felix
    Ozdemir, Murat
    Waseem, Shabnam
    Yolcu, Osman
    Dahal, Binay
    Zhan, Justin
    Gewali, Laxmi
    Oh, Paul
    2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 97 - 104
  • [2] Machine Learning-Based Detection of Ransomware Using SDN
    Cusack, Greg
    Michel, Oliver
    Keller, Eric
    PROCEEDINGS OF THE 2018 ACM INTERNATIONAL WORKSHOP ON SECURITY IN SOFTWARE DEFINED NETWORKS & NETWORK FUNCTION VIRTUALIZATION (SDN-NFVSEC'18), 2018, : 1 - 6
  • [3] BRAIN EMOTIONAL LEARNING-BASED PATTERN RECOGNIZER
    Lotfi, Ehsan
    Akbarzadeh-T, M. -R.
    CYBERNETICS AND SYSTEMS, 2013, 44 (05) : 402 - 421
  • [4] Paraphrase type identification for plagiarism detection using contexts and word embeddings
    Alvi, Faisal
    Stevenson, Mark
    Clough, Paul
    INTERNATIONAL JOURNAL OF EDUCATIONAL TECHNOLOGY IN HIGHER EDUCATION, 2021, 18 (01)
  • [5] Paraphrase type identification for plagiarism detection using contexts and word embeddings
    Faisal Alvi
    Mark Stevenson
    Paul Clough
    International Journal of Educational Technology in Higher Education, 18
  • [6] Effective detection of variable celestial objects using machine learning-based
    Chihara, N.
    Takata, T.
    Fujiwara, Y.
    Noda, K.
    Toyoda, K.
    Higuchi, K.
    Onizuka, M.
    ASTRONOMY AND COMPUTING, 2023, 45
  • [7] A Machine Learning-Based Detection of Earthquake Precursors Using Ionospheric Data
    Akyol, A. A.
    Arikan, O.
    Arikan, F.
    RADIO SCIENCE, 2020, 55 (11)
  • [8] Machine learning-based detection of freezing events using infrared thermography
    Shammi, Sayma
    Sohel, Ferdous
    Diepeveen, Dean
    Zander, Sebastian
    Jones, Michael G. K.
    Bekuma, Amanuel
    Biddulph, Ben
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 198
  • [9] Machine learning-based wavelength detection system
    Kwon, Ik-Hyun
    Choi, Yong-Joon
    Ide, Tomoya
    Noda, Toshihiko
    Takahashi, Kazuhiro
    Sawada, Kazuaki
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2025, 64 (01)
  • [10] Machine learning-based phishing attack detection
    Hossain S.
    Sarma D.
    Chakma R.J.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (09): : 378 - 388