Plagiarism Detection Using Machine Learning-Based Paraphrase Recognizer

被引:7
|
作者
Chitra, A. [2 ]
Rajkumar, Anupriya [1 ]
机构
[1] Dr Mahalingam Coll Engn & Technol, CSE Dept, Pollachi, Tamil Nadu, India
[2] PSG Coll Technol, Comp Applicat, Coimbatore, Tamil Nadu, India
关键词
Paraphrase recognition; passage-level plagiarism detection; support vector machine;
D O I
10.1515/jisys-2014-0146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plagiarism in free text has become a common occurrence due to the wide availability of voluminous information resources. Automatic plagiarism detection systems aim to identify plagiarized content present in large repositories. This task is rendered difficult by the use of sophisticated plagiarism techniques such as paraphrasing and summarization, which mask the occurrence of plagiarism. In this work, a monolingual plagiarism detection technique has been developed to tackle cases of paraphrased plagiarism. A support vector machine based paraphrase recognition system, which works by extracting lexical, syntactic, and semantic features from input text has been used. Both sentence-level and passage-level approaches have been investigated. The performance of the system has been evaluated on various corpora, and the passage level approach has registered promising results.
引用
收藏
页码:351 / 359
页数:9
相关论文
共 50 条
  • [41] Machine Learning-Based Method for Structural Damage Detection
    Irawan, Daniel
    Morozov, Evgeny V.
    Tahtali, Murat
    DATA SCIENCE IN ENGINEERING, VOL. 10, IMAC 2024, 2025, : 137 - 142
  • [42] Supervised Machine Learning-Based Detection of Concrete Efflorescence
    Fan, Ching-Lung
    Chung, Yu-Jen
    SYMMETRY-BASEL, 2022, 14 (11):
  • [43] Machine Learning-based Anomaly Detection for Particle Accelerators
    Marcato, Davide
    Arena, Giovanni
    Bortolato, Damiano
    Gelain, Fabio
    Martinelli, Valentina
    Munaron, Enrico
    Roetta, Marco
    Savarese, Giovanni
    Susto, Gian Antonio
    5TH IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2021), 2021, : 240 - 246
  • [44] Machine Learning-Based Attack Detection Method in Hadoop
    Li, Ningwei
    Gao, Hang
    Liu, Liang
    Peng, Jianfei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT III, 2020, 12454 : 184 - 196
  • [45] Machine Learning-Based Malicious Application Detection of Android
    Wei, Linfeng
    Luo, Weiqi
    Weng, Jian
    Zhong, Yanjun
    zhang, Xiaoqian
    Yan, Zheng
    IEEE ACCESS, 2017, 5 : 25591 - 25601
  • [46] Phishing Attacks Detection A Machine Learning-Based Approach
    Salahdine, Fatima
    El Mrabet, Zakaria
    Kaabouch, Naima
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 250 - 255
  • [47] Machine Learning-Based Multilevel Intrusion Detection Approach
    Ling, Jiasheng
    Zhang, Lei
    Liu, Chenyang
    Xia, Guoxin
    Zhang, Zhenxiong
    ELECTRONICS, 2025, 14 (02):
  • [48] Machine learning-based algorithmically generated domain detection?
    Wang, Zheng
    Guo, Yang
    Montgomery, Doug
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [49] Recent advancements in machine learning and deep learning-based breast cancer detection using mammograms
    Sahu, Adyasha
    Das, Pradeep Kumar
    Meher, Sukadev
    PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2023, 114
  • [50] Machine Learning based Paraphrase Identification System using Lexical Syntactic Features
    Mahajan, Rutal S.
    Zaveri, Mukesh A.
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH, 2016, : 830 - 834