Efficiency of automatic text generators for online review content generation

被引:7
|
作者
Perez-Castro, A. [1 ]
Martinez-Torres, M. R. [1 ]
Toral, S. L. [2 ]
机构
[1] Univ Seville Spain, Fac Ciencias Econ & Empresariales, Ave Ramon & Cajal 1, Seville 41018, Spain
[2] Univ Seville Spain, ETS Ingn, Avda Camino Descubrimientos S-N, Seville 41092, Spain
关键词
Deceptive reviews generation; Word-based encoding; Context-based encoding; Pretrained models; Transfer learning; PRODUCT;
D O I
10.1016/j.techfore.2023.122380
中图分类号
F [经济];
学科分类号
02 ;
摘要
The evolution of Artificial Intelligence has led to the appearance of automatic text generators able to closely resemble human writing, endangering the development of e-commerce and the consumer confidence. Thus, it is critical to deeply understand how these text generators work to present the presence of deceptive reviews. This paper analyzes one of the most popular text generators, GPT2 (Generative Pre-trained Transformer 2), and studies its effectivity compared to human-generated reviews using previously published classifiers trained to distinguish between real and deceptive reviews. One parameter of the model is the so-called temperature, which determines how deterministic the model is. The temperature adjusts the probability distribution of the words in the model, so that a higher temperature translates into a higher degree of inventiveness in the generation of the texts. Findings reveal (i) that automatically-generated deceptive reviews worsen the accuracy of existing classifiers, this effect being accentuated by the degree of inventiveness; (ii) that their performance depends on the data used to train the generator; and (iii) that the sentiment polarity has no effect on the performance of detection classifiers.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] GReAT A Model for the Automatic Generation of Text Summaries
    Gomez Puyana, Claudia
    Pomares Quimbaya, Alexandra
    ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 280 - 288
  • [22] Automatic Debate Text Summarization in Online Debate Forum
    Chowanda, Alan Darmasaputra
    Sanyoto, Albert Richard
    Suhartono, Derwin
    Setiadi, Criscentia Jessica
    DISCOVERY AND INNOVATION OF COMPUTER SCIENCE TECHNOLOGY IN ARTIFICIAL INTELLIGENCE ERA, 2017, 116 : 11 - 19
  • [23] A Framework for the Automatic Extraction of Rules from Online Text
    Hassanpour, Saeed
    O'Connor, Martin J.
    Das, Amar K.
    RULE-BASED REASONING, PROGRAMMING, AND APPLICATIONS, 2011, 6826 : 266 - 280
  • [24] An Automatic Text Summarization: A Systematic Review
    Patel, Vishwa
    Tabrizi, Nasseh
    COMPUTACION Y SISTEMAS, 2022, 26 (03): : 1259 - 1267
  • [25] An evaluation of automatic text categorization in online discussion analysis
    Lui, Andrew Kwok-Fai
    Li, Siu Cheung
    Choy, Sheung On
    7TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2007, : 205 - +
  • [26] A Comprehensive Review on Automatic Text Summarization
    Akhmetov, Iskander
    Nurlybayeva, Sabina
    Ualiyeva, Irina
    Pak, Alexandr
    Gelbukh, Alexander
    COMPUTACION Y SISTEMAS, 2023, 27 (04): : 1203 - 1240
  • [27] Automatic Personalized Marathi Content Generation
    Vispute, Sushma Rahul
    Kanthekar, Siddheshwar
    Kadam, Abhijeet
    Kunte, Chaitanya
    Kadam, Prajakta
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 294 - 299
  • [28] A Review on Detection of Online Abusive Text
    Jain, Bhumika
    Bekal, Chaithra
    PavanKumar, S. P.
    INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019, 2020, 89 : 781 - 787
  • [29] Automatic Text Summarization of Biomedical Text Data: A Systematic Review
    Chaves, Andrea
    Kesiku, Cyrille
    Garcia-Zapirain, Begonya
    INFORMATION, 2022, 13 (08)
  • [30] Automatic Classification of Project Documents on the Basis of Text Content
    Al Qady, Mohammed
    Kandil, Amr
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2015, 29 (03)