Deep-transfer learning inspired natural language processing system for software requirements classification

被引:0
|
作者
Saqib, Mohd [1 ,5 ]
Mustaqeem, Mohd [2 ]
Jawed, Md Saquib [2 ,6 ]
Abdulaziz, Alsolami [3 ]
Khan, Anish [4 ]
Khan, Jeeshan [2 ]
机构
[1] Indian Inst Technol ISM, Dept Math & Comp, Dhanbad, JH, India
[2] Aligarh Muslim Univ, Dept Comp Sci, Aligarh, Uttar Pradesh, India
[3] King Abdulaziz Univ, Dept Special Educ, Jeddah 21589, Saudi Arabia
[4] King Abdulaziz Univ, Ctr Excellence Adv Mat Res, Jeddah 21589, Saudi Arabia
[5] McGill Univ, Sch Informat Studies, Quebec City, PQ, Canada
[6] Bennett Univ, Sch Comp Sci Engn & Technol, Noida, India
关键词
Functional requirements; Software engineering; Natural language processing; Transfer learning; Recurrent neural network; NONFUNCTIONAL REQUIREMENTS;
D O I
10.1007/s10115-024-02248-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the software engineering domain, the distinction between functional (FRs) and non-functional requirements (NFRs) is paramount, as it directly influences the design and development of software systems. However, several challenges, such as dealing with limited training data, domain-specific datasets, and high computational costs, have driven the need for innovative solutions, particularly those related to classifying functional and non-functional software requirements. The limited availability of labeled data for training deep learning models and their high computational costs have hindered progress. This study proposes a novel hierarchical transfer learning (HTL) approach to address the challenges of limited training data and high computational costs associated with deep learning models. The HTL model leverages transfer learning techniques, incorporating pre-trained models such as global vectors for word representation (GloVe) for text vectorization and a bidirectional long short-term memory (BiLSTM) architecture. By harnessing knowledge from large text corpora and capturing both high-level semantic relationships and detailed syntactic patterns, the HTL model demonstrates enhanced classification performance. We have evaluated the model's performance using precision, recall, F1-score, and the area under the receiver operating characteristic curve. For FRs classification, we have observed a 26% improvement in precision, a 9% improvement in recall, and an 18% in F1-score for small datasets. Similarly, for NFRs, classification achieves a 20% improvement in precision, a 38.8% improvement in recall, and a 31.8% improvement in F1-score. For large datasets, we have observed a 25% improvement in precision, a 7% improvement in recall, and a 15% improvement in F1-score for FRs classification. For NFRs classification, it achieves a 24% improvement in precision, a 39.8% improvement in recall, and a 41.8% improvement in F1-score. Our study presents a pioneering HTL approach for FRs and NFRs classification, demonstrating superior performance compared to traditional methods. Furthermore, we identify areas for future research, including improving model interpretability, handling data biases, and fine-tuning hyperparameters, which will further enhance the capabilities and applicability of the HTL model.
引用
收藏
页码:839 / 861
页数:23
相关论文
共 50 条
  • [1] Software requirements as an application domain for natural language processing
    Diamantopoulos, Themistoklis
    Roth, Michael
    Symeonidis, Andreas
    Klein, Ewan
    LANGUAGE RESOURCES AND EVALUATION, 2017, 51 (02) : 495 - 524
  • [2] Software requirements as an application domain for natural language processing
    Themistoklis Diamantopoulos
    Michael Roth
    Andreas Symeonidis
    Ewan Klein
    Language Resources and Evaluation, 2017, 51 : 495 - 524
  • [3] Deep learning of the natural language processing
    Allauzen, Alexandre
    Schuetze, Hinrich
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2018, 59 (02): : 7 - 14
  • [4] Deep Learning in Natural Language Processing
    Feng, Haoda
    Shi, Feng
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 373 - 375
  • [5] Application of deep-transfer learning in automatic glaucoma detection
    Zhao L.
    Xu X.
    Li J.
    Zhao Q.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2023, 44 (04): : 673 - 678
  • [6] Deep learning for schizophrenia classification based on natural language processing -A pilot study
    Lin, Pei-Yun
    Chen, Ying-Hsuan
    Chang, Yuh-Jer
    Chen, Jia-Wei
    Ho, Tsung-Tse
    Shih, Tai-Chuan
    Ko, Chih-Hung
    Lai, Ying-Hui
    SCHIZOPHRENIA RESEARCH, 2024, 270 : 323 - 324
  • [7] TEXT CLASSIFICATION AND CLUSTER ANALYSIS BASED ON DEEP LEARNING AND NATURAL LANGUAGE PROCESSING
    HUANG H.U.A.
    Scalable Computing, 2024, 25 (03): : 1826 - 1832
  • [8] TEXT CLASSIFICATION AND CLUSTER ANALYSIS BASED ON DEEP LEARNING AND NATURAL LANGUAGE PROCESSING
    Huang, Hua
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (03): : 1826 - 1832
  • [9] Natural Language Processing with Optimal Deep Learning Based Fake News Classification
    Althubiti, Sara A.
    Alenezi, Fayadh
    Mansour, Romany F.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3529 - 3544
  • [10] Deep Learning with Natural Language Processing Enabled Sentimental Analysis on Sarcasm Classification
    Sait, Abdul Rahaman Wahab
    Ishak, Mohamad Khairi
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (03): : 2553 - 2567