Author Identification based on Word Distribution in Word Space

被引：0

作者：

Ganesh, Barathi H. B. ^{[1
]}

Reshma, U. ^{[1
]}

Kumar, Anand M. ^{[1
]}

机构：

[1] Amrita Vishwa Vidyapeetham, Ctr Excellence Computat Engn & Networking, Coimbatore 641112, Tamil Nadu, India

来源：

2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2015年

关键词：

Author attribution; Random forest tree; Logistic Regression; Support Vector Machine; PAN Author Identification 2014;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Author attribution has grown into an area that is more challenging from the past decade. It has become an inevitable task in many sectors like forensic analysis, law, journalism and many more as it helps to detect the author in every documentation. Here unigram/bigram features along with latent semantic features from word space were taken and the similarity of a particular document was tested using Random forest tree, Logistic Regression and Support Vector Machine in order to create a global model. Dataset from PAN Author Identification shared task 2014 is taken for processing. It has been observed that the proposed model shows state-of-art accuracy of 80% which is significantly greater when compared to the Author Identification PAN results of the year 2014.

引用

页码：1519 / 1523

页数：5

共 50 条

[21] Distribution of semantic neighbours based on word features
McAuley, Tara L.
Lansue, Brette
Buchanan, Lori
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 750 - 750
[22] Knowledge of word length does not constrain word identification
Inhoff, AW
Eiter, BM
PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 2003, 67 (01): : 1 - 9
[23] Word identification in noise
Pisoni, DB
LANGUAGE AND COGNITIVE PROCESSES, 1996, 11 (06): : 681 - 687
[24] MECHANISMS OF WORD IDENTIFICATION
MEWHORT, DJK
BEAL, AL
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1977, 3 (04) : 629 - 640
[25] The temporal distribution of information in audiovisual spoken-word identification
Alexandra Jesse
Dominic W. Massaro
Attention, Perception, & Psychophysics, 2010, 72 : 209 - 225
[26] DISTRIBUTION OF WORD FREQUENCIES
GOOD, IJ
NATURE, 1957, 179 (4559) : 595 - 595
[27] DISTRIBUTION - A LAST WORD
GEACH, PT
PHILOSOPHICAL REVIEW, 1960, 69 (03): : 396 - 398
[28] The temporal distribution of information in audiovisual spoken-word identification
Jesse, Alexandra
Massaro, Dominic W.
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (01) : 209 - 225
[29] EXPLICATION OF VALUE WORD VIEW IN THE COMMUNICATIVE MODEL "AUTHOR - ADDRESSE" IN TEXT SPACE OF MAGAZINES
Kostyashina, Ekaterina A.
VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA-KULTUROLOGIYA I ISKUSSTVOVEDENIE-TOMSK STATE UNIVERSITY JOURNAL OF CULTURAL STUDIES AND ART HISTORY, 2012, 8 (04): : 23 - +
[30] Introduction to the special issue: Morphology in word identification and word spelling
Verhoeven, Ludo
Carlisle, Joanne F.
READING AND WRITING, 2006, 19 (07) : 643 - 650

← 1 2 3 4 5 →