Deep-transfer learning inspired natural language processing system for software requirements classification

被引：0

作者：

Saqib, Mohd ^{[1
,5
]}

Mustaqeem, Mohd ^{[2
]}

Jawed, Md Saquib ^{[2
,6
]}

Abdulaziz, Alsolami ^{[3
]}

Khan, Anish ^{[4
]}

Khan, Jeeshan ^{[2
]}

机构：

[1] Indian Inst Technol ISM, Dept Math & Comp, Dhanbad, JH, India

[2] Aligarh Muslim Univ, Dept Comp Sci, Aligarh, Uttar Pradesh, India

[3] King Abdulaziz Univ, Dept Special Educ, Jeddah 21589, Saudi Arabia

[4] King Abdulaziz Univ, Ctr Excellence Adv Mat Res, Jeddah 21589, Saudi Arabia

[5] McGill Univ, Sch Informat Studies, Quebec City, PQ, Canada

[6] Bennett Univ, Sch Comp Sci Engn & Technol, Noida, India

来源：

KNOWLEDGE AND INFORMATION SYSTEMS | 2024年

关键词：

Functional requirements; Software engineering; Natural language processing; Transfer learning; Recurrent neural network; NONFUNCTIONAL REQUIREMENTS;

D O I：

10.1007/s10115-024-02248-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the software engineering domain, the distinction between functional (FRs) and non-functional requirements (NFRs) is paramount, as it directly influences the design and development of software systems. However, several challenges, such as dealing with limited training data, domain-specific datasets, and high computational costs, have driven the need for innovative solutions, particularly those related to classifying functional and non-functional software requirements. The limited availability of labeled data for training deep learning models and their high computational costs have hindered progress. This study proposes a novel hierarchical transfer learning (HTL) approach to address the challenges of limited training data and high computational costs associated with deep learning models. The HTL model leverages transfer learning techniques, incorporating pre-trained models such as global vectors for word representation (GloVe) for text vectorization and a bidirectional long short-term memory (BiLSTM) architecture. By harnessing knowledge from large text corpora and capturing both high-level semantic relationships and detailed syntactic patterns, the HTL model demonstrates enhanced classification performance. We have evaluated the model's performance using precision, recall, F1-score, and the area under the receiver operating characteristic curve. For FRs classification, we have observed a 26% improvement in precision, a 9% improvement in recall, and an 18% in F1-score for small datasets. Similarly, for NFRs, classification achieves a 20% improvement in precision, a 38.8% improvement in recall, and a 31.8% improvement in F1-score. For large datasets, we have observed a 25% improvement in precision, a 7% improvement in recall, and a 15% improvement in F1-score for FRs classification. For NFRs classification, it achieves a 24% improvement in precision, a 39.8% improvement in recall, and a 41.8% improvement in F1-score. Our study presents a pioneering HTL approach for FRs and NFRs classification, demonstrating superior performance compared to traditional methods. Furthermore, we identify areas for future research, including improving model interpretability, handling data biases, and fine-tuning hyperparameters, which will further enhance the capabilities and applicability of the HTL model.

引用

页码：839 / 861

页数：23

共 50 条

[1] Software requirements as an application domain for natural language processing
Diamantopoulos, Themistoklis
Roth, Michael
Symeonidis, Andreas
Klein, Ewan
LANGUAGE RESOURCES AND EVALUATION, 2017, 51 (02) : 495 - 524
[2] Software requirements as an application domain for natural language processing
Themistoklis Diamantopoulos
Michael Roth
Andreas Symeonidis
Ewan Klein
Language Resources and Evaluation, 2017, 51 : 495 - 524
[3] Deep learning of the natural language processing
Allauzen, Alexandre
Schuetze, Hinrich
TRAITEMENT AUTOMATIQUE DES LANGUES, 2018, 59 (02): : 7 - 14
[4] Deep Learning in Natural Language Processing
Feng, Haoda
Shi, Feng
NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 373 - 375
[5] Application of deep-transfer learning in automatic glaucoma detection
Zhao L.
Xu X.
Li J.
Zhao Q.
Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2023, 44 (04): : 673 - 678
[6] Deep learning for schizophrenia classification based on natural language processing -A pilot study
Lin, Pei-Yun
Chen, Ying-Hsuan
Chang, Yuh-Jer
Chen, Jia-Wei
Ho, Tsung-Tse
Shih, Tai-Chuan
Ko, Chih-Hung
Lai, Ying-Hui
SCHIZOPHRENIA RESEARCH, 2024, 270 : 323 - 324
[7] TEXT CLASSIFICATION AND CLUSTER ANALYSIS BASED ON DEEP LEARNING AND NATURAL LANGUAGE PROCESSING
HUANG H.U.A.
Scalable Computing, 2024, 25 (03): : 1826 - 1832
[8] TEXT CLASSIFICATION AND CLUSTER ANALYSIS BASED ON DEEP LEARNING AND NATURAL LANGUAGE PROCESSING
Huang, Hua
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (03): : 1826 - 1832
[9] Natural Language Processing with Optimal Deep Learning Based Fake News Classification
Althubiti, Sara A.
Alenezi, Fayadh
Mansour, Romany F.
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3529 - 3544
[10] Deep Learning with Natural Language Processing Enabled Sentimental Analysis on Sarcasm Classification
Sait, Abdul Rahaman Wahab
Ishak, Mohamad Khairi
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (03): : 2553 - 2567

← 1 2 3 4 5 →