TopicBERT: A Topic-Enhanced Neural Language Model Fine-Tuned for Sentiment Classification

被引：23

作者：

Zhou, Yuxiang ^{[1
]}

Liao, Lejian ^{[1
]}

Gao, Yang ^{[1
]}

Wang, Rui ^{[2
]}

Huang, Heyan ^{[1
]}

机构：

[1] Beijing Inst Technol, Fac Comp Sci, Beijing 100081, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Fac Comp Sci, Nanjing 210023, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Bit error rate; Semantics; Predictive models; Training; Context modeling; Social networking (online); Bidirectional encoder representations from transformers (BERT); pretrained neural language model; sentiment classification; topic-enhanced neural network;

D O I：

10.1109/TNNLS.2021.3094987

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sentiment classification is a form of data analytics where people's feelings and attitudes toward a topic are mined from data. This tantalizing power to ``predict the zeitgeist'' means that sentiment classification has long attracted interest, but with mixed results. However, the recent development of the BERT framework and its pretrained neural language models is seeing new-found success for sentiment classification. BERT models are trained to capture word-level information via mask language modeling and sentence-level contexts via next sentence prediction tasks. Out of the box, they are adequate models for some natural language processing tasks. However, most models are further fine-tuned with domain-specific information to increase accuracy and usefulness. Motivated by the idea that a further fine-tuning step would improve the performance for downstream sentiment classification tasks, we developed TopicBERT--a BERT model fine-tuned to recognize topics at the corpus level in addition to the word and sentence levels. TopicBERT comprises two variants: TopicBERT-ATP (aspect topic prediction), which captures topic information via an auxiliary training task, and TopicBERT-TA, where topic representation is directly injected into a topic augmentation layer for sentiment classification. With TopicBERT-ATP, the topics are predetermined by an LDA mechanism and collapsed Gibbs sampling. With TopicBERT-TA, the topics can change dynamically during the training. Experimental results show that both approaches deliver the state-of-the-art performance in two different domains with SemEval 2014 Task 4. However, in a test of methods, direct augmentation outperforms further training. Comprehensive analyses in the form of ablation, parameter, and complexity studies accompany the results.

引用

页码：380 / 393

页数：14

共 50 条

[31] MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions
Preniqi, Vjosa
Ghinassi, Iacopo
Ive, Julia
Saitis, Charalampos
Kalimeri, Kyriaki
PROCEEDINGS OF THE 2024 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR SOCIAL GOOD, GOODIT 2024, 2024, : 433 - 442
[32] A Fine-Tuned MobileNetV3 Model for Real and Fake Image Classification
Singh, Gurpreet
Guleria, Kalpna
Sharma, Shagun
2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1590 - 1594
[33] Detection and classification of breast cancer in mammographic images with fine-tuned convolutional neural networks
Luong, Huong Hoang
Nguyen, Hai Thanh
Thai-Nghe, Nguyen
JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2024,
[34] Blending Ensemble of Fine-Tuned Convolutional Neural Networks Applied to Mammary Image Classification
Zhang, Jingyi
Pan, Shuwan
Hong, Huichao
Kong, Lingke
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (06) : 1160 - 1166
[35] BERT's sentiment score for portfolio optimization: a fine-tuned view in Black and Litterman model
Colasanto, Francesco
Grilli, Luca
Santoro, Domenico
Villani, Giovanni
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (20): : 17507 - 17521
[36] BERT’s sentiment score for portfolio optimization: a fine-tuned view in Black and Litterman model
Francesco Colasanto
Luca Grilli
Domenico Santoro
Giovanni Villani
Neural Computing and Applications, 2022, 34 : 17507 - 17521
[37] Understanding language-elicited EEG data by predicting it from a fine-tuned language model
Schwartz, Dan
Mitchell, Tom
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 43 - 57
[38] BERT for Sentiment Analysis: Pre-trained and Fine-Tuned Alternatives
Souza, Frederico Dias
de Oliveira e Souza Filho, Joao Baptista
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 209 - 218
[39] FinBERT-FOMC: Fine-Tuned FinBERT Model with Sentiment Focus Method for Enhancing Sentiment Analysis of FOMC Minutes
Chen, Ziwei
Goessi, Sandro
Kim, Wonseong
Bermeitinger, Bernhard
Handschuh, Siegfried
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 357 - 364
[40] A minimally fine-tuned supersymmetric standard model
Chacko, Z
Nomura, Y
Tucker-Smith, D
NUCLEAR PHYSICS B, 2005, 725 (1-2) : 207 - 250

← 1 2 3 4 5 →