Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection

被引:17
|
作者
Jiang, Youxuan [1 ]
Kummerfeld, Jonathan K. [1 ]
Lasecki, Walter S. [1 ]
机构
[1] Univ Michigan, Comp Sci & Engn, Ann Arbor, MI 48109 USA
关键词
D O I
10.18653/v1/P17-2017
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Linguistically diverse datasets are critical for training and evaluating robust machine learning systems, but data collection is a costly process that often requires experts. Crowdsourcing the process of paraphrase generation is an effective means of expanding natural language datasets, but there has been limited analysis of the trade-offs that arise when designing tasks. In this paper, we present the first systematic study of the key factors in crowdsourcing paraphrase collection. We consider variations in instructions, incentives, data domains, and workflows. We manually analyzed paraphrases for correctness, grammaticality, and linguistic diversity. Our observations provide new insight into the trade-offs between accuracy and diversity in crowd responses that arise as a result of task design, providing guidance for future paraphrase generation procedures.
引用
收藏
页码:103 / 109
页数:7
相关论文
共 50 条
  • [21] TRADE-OFFS KEY IN TIRE DESIGN
    SZIGETHY, NM
    AUTOMOTIVE INDUSTRIES, 1982, 162 (12): : 15 - 18
  • [22] DESIGN TRADE-OFFS IN AVAILABILITY WARRANTIES
    MARSHALL, CW
    PROCEEDINGS ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 1981, (NSYM): : 95 - 100
  • [23] SYSTEM DESIGN MEANS TRADE-OFFS
    JURISON J
    Electronic Design, 1970, 18 (07): : 60 - 64
  • [24] DESIGN TRADE-OFFS IN THYRISTORS.
    Smith, C.J.
    New Electronics, 1977, 10 (04):
  • [25] Interactive Exploration of Design Trade-Offs
    Schulz, Adriana
    Wang, Harrison
    Grinspun, Eitan
    Solomon, Justin
    Matusik, Wojciech
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
  • [26] Trade-offs in stimulus control in a temporal discrimination task
    Pinto, Carlos
    Machado, Armando
    LEARNING AND MOTIVATION, 2023, 84
  • [27] Understanding the Design Trade-offs among Current Multicore Systems for Numerical Computations
    Kang, Seunghwa
    Bader, David A.
    Vuduc, Richard
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 808 - 819
  • [28] Understanding The Trade-Offs In Multi-Level Cell ReRAM Memory Design
    Xu, Cong
    Niu, Dimin
    Muralimanohar, Naveen
    Jouppi, Norman P.
    Xie, Yuan
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [29] Trade-offs
    Garland, Theodore, Jr.
    CURRENT BIOLOGY, 2014, 24 (02) : R60 - R61
  • [30] Trade-offs Between Query Difficulty and Sample Complexity in Crowdsourced Data Acquisition
    Chung, Hye Won
    Lee, Ji Oon
    Kim, Doyeon
    Hero, Alfred O.
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 639 - 646