TF-IDF;
N-Gram;
Text classification;
Feature weighting;
Information retrieval;
SENTIMENT;
REVIEWS;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Text Classification is the process of categorizing text into the relevant categories and its algorithms are at the core of many Natural Language Processing (NLP). Term FrequencyInverse Document Frequency (TF-IDF) and NLP are the most highly used information retrieval methods in text classification. We have investigated and analyzed the feature weighting method for text classification on unstructured data. The proposed model considered two features NGrams and TF-IDF on the IMDB movie reviews and Amazon Alexa reviews dataset for sentiment analysis. Then we have used the state-of-the-art classifier to validate the method i.e., Support Vector Machine (SVM), Logistic Regression, Multinomial Naive Bayes (Multinomial NB), Random Forest, Decision Tree, and k-nearest neighbors (KNN). From those two feature extractions, a significant increase in feature extraction with TF-IDF features rather than based on N-Gram. TF-IDF got the maximum accuracy (93.81%), precision (94.20%), recall (93.81%), and F1-score (91.99%) value in Random Forest classifier.
机构:
Aalto Univ, Menestys Grp, Espoo, FinlandAalto Univ, Menestys Grp, Espoo, Finland
Canas-Bajo, Jose
Canas-Bajo, Teresa
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Vis Sci Program, Berkeley, CA 94720 USAAalto Univ, Menestys Grp, Espoo, Finland
Canas-Bajo, Teresa
Berki, Eleni
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tampere, FIN-33101 Tampere, Finland
Univ Jyvaskyla, Software Qual & Formal Modeling, Jyvaskyla, FinlandAalto Univ, Menestys Grp, Espoo, Finland
Berki, Eleni
Valtanen, Juri-Petri
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tampere, Fac Educ, Tampere, FinlandAalto Univ, Menestys Grp, Espoo, Finland
Valtanen, Juri-Petri
Saariluoma, Pertti
论文数: 0引用数: 0
h-index: 0
机构:
Univ Jyvaskyla, Cognit Sci, Jyvaskyla, Finland
Univ Oxford, Oxford, England
Univ Cambridge, Cambridge, England
Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
IIASA, Laxenburg, AustriaAalto Univ, Menestys Grp, Espoo, Finland