Real-time, scalable, content-based Twitter users recommendation

被引:8
|
作者
Subercaze, Julien [1 ]
Gravier, Christophe [1 ]
Laforest, Frederique [1 ]
机构
[1] Univ Jean Monnet, CNRS, UMR 5516, Lab Hubert Curien, 25 Rue Docteur Remy Annino, F-42000 St Etienne, France
关键词
Twitter recommendation; binary footprint; large scale approach; information retrieval; real-time recommendation;
D O I
10.3233/WEB-160329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time recommendation of Twitter users based on the content of their profiles is a very challenging task. Traditional IR methods such as TF-IDF fail to handle efficiently large datasets. In this paper we present a scalable approach that allows real time recommendation of users based on their tweets. Our model builds a graph of terms, driven by the fact that users sharing similar interests will share similar terms. We show how this model can be encoded as a compact binary footprint, that allows very fast comparison and ranking, taking full advantage of modern CPU architectures. We validate our approach through an empirical evaluation against the Apache Lucene's implementation of TF-IDF. We show that our approach is in average two hundred times faster than standard optimized implementation of TF-IDF with a precision of 58%.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 50 条
  • [1] Real-time, Scalable, Content-based Twitter Users Recommendation
    Subercaze, Julien
    Gravier, Christophe
    Laforest, Frederique
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1367 - 1367
  • [2] A Multi-view Content-Based User Recommendation Scheme for Following Users in Twitter
    Chechev, Milen
    Georgiev, Petko
    SOCIAL INFORMATICS, SOCINFO 2012, 2012, 7710 : 434 - 447
  • [3] Content-based Classification of Political Inclinations of Twitter Users
    Di Giovanni, Marco
    Brambilla, Marco
    Ceri, Stefano
    Daniel, Florian
    Ramponi, Giorgia
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4321 - 4327
  • [4] Content-based retrieval of similar videos in real-time
    Quellec, Gwenole
    Lamard, Mathieu
    Cazuguel, Guy
    Droueche, Zakarya
    Cochener, Beatrice
    Roux, Christian
    TRAITEMENT DU SIGNAL, 2012, 29 (1-2) : 83 - 100
  • [5] Real-time content-based processing of multicast video
    Zhou, WS
    Vellaikal, A
    Shen, Y
    Kuo, JCC
    CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 882 - 886
  • [6] Scalable and Real-time Sentiment Analysis of Twitter Data
    Karanasou, Maria
    Ampla, Anneta
    Doulkeridis, Christos
    Halkidi, Maria
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 944 - 951
  • [7] Terms of a Feather: Content-Based News Recommendation and Discovery Using Twitter
    Phelan, Owen
    McCarthy, Kevin
    Bennett, Mike
    Smyth, Barry
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 448 - 459
  • [8] An efficient architecture for real-time content-based arithmetic coding
    Gong, D
    He, Y
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 515 - 518
  • [9] Real-Time Adaptive Content-Based Synchronization of Multimedia Streams
    Elhajj, Imad H.
    Dargham, Nadine Bou
    Xi, Ning
    Jia, Yunyi
    ADVANCES IN MULTIMEDIA, 2011, 2011
  • [10] Real-time content-based adaptive streaming of sports videos
    Chang, SF
    Zhong, D
    Kumar, R
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES, PROCEEDINGS, 2001, : 139 - 146