Building a Question-Answering Corpus Using Social Media and News Articles

被引:5
|
作者
Cavalin, Paulo [1 ]
Figueiredo, Flavio [1 ]
de Bayser, Maira [1 ]
Moyano, Luis [1 ]
Candello, Heloisa [1 ]
Appel, Ana [1 ]
Souza, Renan [1 ]
机构
[1] IBM Res, Sao Paulo, Brazil
关键词
Question and Answer; Social media; Finance;
D O I
10.1007/978-3-319-41552-9_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Is it possible to develop a reliable QA-Corpus using social media data? What are the challenges faced when attempting such a task? In this paper, we discuss these questions and present our findings when developing a QA-Corpus on the topic of Brazilian finance. In order to populate our corpus, we relied on opinions from experts on Brazilian finance that are active on the Twitter application. From these experts, we extracted information from news websites that are used as answers in the corpus. Moreover, to effectively provide rankings of answers to questions, we employ novel word vector based similarity measures between short sentences (that accounts for both questions and Tweets). We validated our methods on a recently released dataset of similarity between short Portuguese sentences. Finally, we also discuss the effectiveness of our approach when used to rank answers to questions from real users.
引用
收藏
页码:353 / 358
页数:6
相关论文
共 50 条
  • [1] NPCEditor: A Tool for Building Question-Answering Characters
    Leuski, Anton
    Traum, David
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2463 - 2470
  • [2] A question-answering system using argumentation
    Moreale, E
    Vargas-Vera, M
    MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 400 - 409
  • [3] Use of credibility heuristics in a social question-answering service
    Matthews, Paul
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2015, 20 (01):
  • [4] Archetypes of influential users in social question-answering sites
    Chen, Miaomiao
    Chua, Alton Y. K.
    An, Lu
    INTERNET RESEARCH, 2025, 35 (01) : 419 - 447
  • [5] Project PIAF: Building a Native French Question-Answering Dataset
    Keraron, Rachel
    Lancrenon, Guillaume
    Bras, Mathilde
    Allary, Frederic
    Moyse, Gilles
    Scialom, Thomas
    Soriano-Morales, Edmundo-Pavel
    Staiano, Jacopo
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5481 - 5490
  • [6] Using and Extending WordNet to Support Question-Answering
    Clark, Peter
    Fellbaum, Christiane
    Hobbs, Jerry
    GWC 2008: FOURTH GLOBAL WORDNET CONFERENCE, PROCEEDINGS, 2007, : 111 - 119
  • [7] External Commonsense Knowledge as a Modality for Social Intelligence Question-Answering
    Natu, Sanika
    Sural, Shounak
    Sarkar, Sulagna
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3036 - 3042
  • [8] Automatic Medical Knowledge Acquisition Using Question-Answering
    Pasche, Emilie
    Teodoro, Douglas
    Gobeill, Julien
    Ruch, Patrick
    Lovis, Christian
    MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 569 - 573
  • [9] Evaluation of an Integrated Authoring Tool for Building Advanced Question-Answering Characters
    Gandhe, Sudeep
    Rushforth, Michael
    Aggarwal, Priti
    Traum, David
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1296 - +
  • [10] Building an Annotated Corpus for Text Summarization and Question Answering
    Varasai, Patcharee
    Pechsiri, Chaveevan
    Sukvari, Thana
    Satayamas, Vee
    Kawtrakul, Asanee
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3427 - 3434