A Natural Language Processing Model for the Development of an Italian-Language Chatbot for Public Administration

被引:0
|
作者
Piizzi, Antonio [1 ]
Vavallo, Donatello [1 ]
Lazzo, Gaetano [1 ]
Dimola, Saverio [1 ]
Zazzera, Elvira [2 ]
机构
[1] Tempo SRL, Bari, Italy
[2] Kad3 SRL, Fasano, Italy
关键词
-Natural Language Processing; chatbot; BERT; transformer; Italian language;
D O I
10.14569/IJACSA.2024.0150906
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Natural Language Processing models (NLP) are used in chatbots to understand user input, interpret its meaning, and generate conversational responses to provide immediate and consistent assistance. This reduces problem-solving time and staff workload and increases user satisfaction. There are both rule- based chatbots, which use decision trees and are programmed to answer specific questions, and self-learning chatbots, which can handle more complex conversations through continuous learning about data and user interactions. However, only a few chatbots have been developed specifically for the Italian language. T he development of chatbots for Public Administration (PA) in the Italian language presents unique challenges, particularly in creating models that can accurately understand and respond to user queries based on complex, context-specific documents. This paper proposes a novel natural language processing (NLP) model tailored to the Italian language, designed to support the development of an advanced Question Answering (QA) chatbot for PA. The core of the proposed model is based on the BERT (Bidirectional Encoder Representations from Transformers) architecture, enhanced with an encoder/decoder module and a highway network module to improve the filtering and processing of input text. The principal aim of this research is to address the gap in Italian-language NLP models by providing a robust solution capable of handling the intricacies of the Italian language within the context of PA. The model is trained and evaluated using the Italian version of the Stanford Question Answering Dataset (SQuAD-IT). Experimental results demonstrate that the proposed model outperforms existing models such as BIDAF in terms of F1-score and Exact Match (EM), indicating its superior ability to provide precise and accurate answers. The comparative analysis highlights a significant performance improvement, with the proposed model achieving an F1-score of 59.41% and an EM of 46.24%, compared to 49.35% and 38.43%, respectively, for BIDAF. The findings suggest that the proposed model offers substantial benefits in terms of accuracy and efficiency for PA applications.
引用
收藏
页码:54 / 58
页数:5
相关论文
共 50 条
  • [11] Alien adventures: exoticism in Italian-language Baroque opera
    Locke, Ralph P.
    MUSICAL TIMES, 2009, 150 (1909): : 53 - 69
  • [13] HISTORY OF THE ITALIAN-LANGUAGE ELEGY FROM HUMANISM TO THE BAROQUE
    DEMALDE, V
    STUDI SECENTESCHI, 1996, 37 : 109 - 134
  • [14] Home Automation using IoT and a Chatbot using Natural Language Processing
    Baby, Cyril Joe
    Khan, Faizan Ayyub
    Swathi, J. N.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [15] Development of Chatbot Psychologist Applying Natural Language Understanding Techniques
    Narynov, Sergazy
    Zhumanov, Zhandos
    Gumar, Aidana
    Khassanova, Mariyam
    Omarov, Batyrkhan
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 636 - 641
  • [16] Natural Language Processing Model Compiling Natural Language into Byte Code
    Trifan, Alexandru
    Anghelus, Marilena
    Constantinescu, Rodica
    2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,
  • [17] Processing natural language without natural language processing
    Brill, E
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 360 - 369
  • [18] Development of language resources for natural language processing in deep level
    Zhang, Yujie
    Kuroda, Kow
    Izumi, Emi
    Nozawa, Hajime
    Journal of the National Institute of Information and Communications Technology, 2007, 54 (03): : 53 - 61
  • [19] Two New Datasets for Italian-Language Abstractive Text Summarization
    Landro, Nicola
    Gallo, Ignazio
    La Grassa, Riccardo
    Federici, Edoardo
    INFORMATION, 2022, 13 (05)
  • [20] Language Without Words: A Pointillist Model for Natural Language Processing
    Song, Peiyou
    Shu, Anhei
    Phipps, David
    Tiwari, Mohit
    Wallach, Dan S.
    Crandall, Jedidiah R.
    Luger, George F.
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 11 - 15