Human Experts' Perceptions of Auto-Generated Summarization Quality

被引:2
|
作者
Lotfigolian, Maryam [1 ]
Papanikolaou, Christos [1 ]
Taghizadeh, Samaneh [1 ]
Sandnes, Frode Eika [1 ]
机构
[1] Oslo Metropolitan Univ, Dept Comp Sci, N-0130 Oslo, Norway
关键词
Automatic summarization; User perception; Quality; Evaluation; Artificial intelligence; NLP; Language model; GPT-3; ChatGPT;
D O I
10.1145/3594806.3594828
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this study we addressed automatic summarizations generated using modern artificial intelligence techniques. Several mathematical methods for evaluating the performance of automatic summarization exist. Such methods are commonly used as they allowmany test cases to be assessed with little human effort as manual assessments are challenging and time consuming. One question is whether the output of such measures matches human perception of summarization quality. In this study we document a study involving the human evaluation of the automatic summarization of 22 academic texts. The unique aspect of this study is that our participants had strong familiarity with the texts as they had studied these texts in depth. The results are quite varied but do not give the impression of unanimous agreement that automatic summarizations are of high quality and are trusted.
引用
收藏
页码:95 / 98
页数:4
相关论文
共 50 条
  • [1] Evaluating the quality of the ontology-based auto-generated questions
    Demaidi M.N.
    Gaber M.M.
    Filer N.
    Smart Learning Environments, 4 (1)
  • [2] On the Naturalness of Auto-generated Code-Can We Identify Auto-Generated Code Automatically?-
    Doi, Masayuki
    Higo, Yoshiki
    Arima, Ryo
    Shimonaka, Kento
    Kusumoto, Shinji
    2018 IEEE/ACM 26TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2018), 2018, : 340 - 343
  • [3] Towards Auto-Generated Data Systems
    Cheung, Alvin
    Ahmad, Maaz Bin Safeer
    Haynes, Brandon
    Kittivorawong, Chanwut
    Laddad, Shadaj
    Liu, Xiaoxuan
    Wang, Chenglong
    Yan, Cong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (12): : 4116 - 4129
  • [4] Auto-generated Strokes for Motion Segmentation
    Tian, Zhiqiang
    Xue, Jianru
    Li, Ce
    Lan, Xuguang
    Zheng, Nanning
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 857 - 860
  • [5] An Evaluation Model for Auto-generated Cognitive Scripts
    ELMougi, Ahmed M.
    Omar, Yasser M. K.
    Hodhod, Rania
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (08) : 333 - 340
  • [6] Mining Auto-Generated Test Inputs for Test Oracle
    Xu, Weifeng
    Wang, Hanlin
    Ding, Tao
    PROCEEDINGS OF THE 2013 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2013, : 89 - 94
  • [7] Bitext Dependency Parsing With Auto-Generated Bilingual Treebank
    Chen, Wenliang
    Kazama, Jun'ichi
    Zhang, Min
    Tsuruoka, Yoshimasa
    Zhang, Yujie
    Wang, Yiou
    Torisawa, Kentaro
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1461 - 1472
  • [8] Auto-Generated Summaries for Stochastic Radio Channel Models
    Bharti, Ayush
    Adeogun, Ramoni
    Pedersen, Troels
    2021 15TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2021,
  • [9] A database of battery materials auto-generated using ChemDataExtractor
    Huang, Shu
    Cole, Jacqueline M.
    SCIENTIFIC DATA, 2020, 7 (01)
  • [10] A Clustering Approach for Detecting Auto-generated Botnet Domains
    Pu, Yang
    Chen, Xiaojun
    Pu, Yiguo
    Shi, JinQiao
    APPLICATIONS AND TECHNIQUES IN INFORMATION SECURITY, ATIS 2015, 2015, 557 : 269 - 279