Maintaining Academic Integrity in Programming: Locality-Sensitive Hashing and Recommendations

被引:5
|
作者
Karnalim, Oscar [1 ]
机构
[1] Maranatha Christian Univ, Fac Informat Technol, Bandung 40164, Indonesia
来源
EDUCATION SCIENCES | 2023年 / 13卷 / 01期
关键词
programming; plagiarism; collusion; similarity detection; recommendations; higher education; CODE PLAGIARISM DETECTION;
D O I
10.3390/educsci13010054
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Not many efficient similarity detectors are employed in practice to maintain academic integrity. Perhaps it is because they lack intuitive reports for investigation, they only have a command line interface, and/or they are not publicly accessible. This paper presents SSTRANGE, an efficient similarity detector with locality-sensitive hashing (MinHash and Super-Bit). The tool features intuitive reports for investigation and a graphical user interface. Further, it is accessible on GitHub. SSTRANGE was evaluated on the SOCO dataset under two performance metrics: f-score and processing time. The evaluation shows that both MinHash and Super-Bit are more efficient than their predecessors (Cosine and Jaccard with 60% less processing time) and a common similarity measurement (running Karp-Rabin greedy string tiling with 99% less processing time). Further, the effectiveness trade-off is still reasonable (no more than 24%). Higher effectiveness can be obtained by tuning the number of clusters and stages. To encourage the use of automated similarity detectors, we provide ten recommendations for instructors interested in employing such detectors for the first time. These include consideration of assessment design, irregular patterns of similarity, multiple similarity measurements, and effectiveness-efficiency trade-off. The recommendations are based on our 2.5-year experience employing similarity detectors (SSTRANGE's predecessors) in 13 course offerings with various assessment designs.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Batch-Orthogonal Locality-Sensitive Hashing for Angular Similarity
    Ji, Jianqiu
    Yan, Shuicheng
    Li, Jianmin
    Gao, Guangyu
    Tian, Qi
    Zhang, Bo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (10) : 1963 - 1974
  • [42] An improved method of locality-sensitive hashing for scalable instance matching
    Aydar, Mehmet
    Ayvaz, Serkan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (02) : 275 - 294
  • [43] Speeding up probabilistic roadmap planners with locality-sensitive hashing
    Rantanen, Mika T.
    Juhola, Martti
    ROBOTICA, 2015, 33 (07) : 1491 - 1506
  • [44] Cryptographically Secure Private Record Linkage Using Locality-Sensitive Hashing
    Wei, Ruidi
    Kerschbaum, Florian
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 17 (02): : 79 - 91
  • [45] Stratified Locality-Sensitive Hashing for Accelerated Physiological Time Series Retrieval
    Kim, Yongwook Bryce
    Hemberg, Erik
    O'Reilly, Una-May
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 2479 - 2483
  • [46] Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing
    Koga, Hisashi
    Ishibashi, Tetsuo
    Watanabe, Toshinori
    KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (01) : 25 - 53
  • [47] Locality-Sensitive Hashing for Information Retrieval System on Multiple GPGPU Devices
    Toan Nguyen Mau
    Inoguchi, Yasushi
    APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [48] Trust-Aware Hybrid Collaborative Recommendation with Locality-Sensitive Hashing
    Li, Dejuan
    Esquivel, James A.
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (04): : 1421 - 1434
  • [49] Fast Locality-Sensitive Hashing Frameworks for Approximate Near Neighbor Search
    Christiani, Tobias
    SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019), 2019, 11807 : 3 - 17
  • [50] Locality-Sensitive Hashing Scheme Based on Heap Sort of Hash Bucket
    Fang, Bo
    Hua, Zhongyun
    Huang, Hejiao
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 5 - 10