Real-time, scalable, content-based Twitter users recommendation

被引:8
|
作者
Subercaze, Julien [1 ]
Gravier, Christophe [1 ]
Laforest, Frederique [1 ]
机构
[1] Univ Jean Monnet, CNRS, UMR 5516, Lab Hubert Curien, 25 Rue Docteur Remy Annino, F-42000 St Etienne, France
关键词
Twitter recommendation; binary footprint; large scale approach; information retrieval; real-time recommendation;
D O I
10.3233/WEB-160329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time recommendation of Twitter users based on the content of their profiles is a very challenging task. Traditional IR methods such as TF-IDF fail to handle efficiently large datasets. In this paper we present a scalable approach that allows real time recommendation of users based on their tweets. Our model builds a graph of terms, driven by the fact that users sharing similar interests will share similar terms. We show how this model can be encoded as a compact binary footprint, that allows very fast comparison and ranking, taking full advantage of modern CPU architectures. We validate our approach through an empirical evaluation against the Apache Lucene's implementation of TF-IDF. We show that our approach is in average two hundred times faster than standard optimized implementation of TF-IDF with a precision of 58%.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 50 条
  • [21] Tag-LDA for scalable real-time tag recommendation
    Si, Xiance
    Sun, Maosong
    Journal of Information and Computational Science, 2009, 6 (02): : 1009 - 1016
  • [22] Content-Based News Recommendation
    Kompan, Michal
    Bielikova, Maria
    E-COMMERCE AND WEB TECHNOLOGIES, 2010, 61 : 61 - 72
  • [23] IARank: Ranking Users on Twitter in Near Real-time, Based on their Information Amplification Potential
    Cappelletti, Rafael
    Sastry, Nishanth
    PROCEEDINGS OF THE 2012 ASE INTERNATIONAL CONFERENCE ON SOCIAL INFORMATICS (SOCIALINFORMATICS 2012), 2012, : 70 - 77
  • [24] Real-Time, Content-Based Communication Load Reduction in the Internet of Multimedia Things
    Tanseer, Iffrah
    Kanwal, Nadia
    Asghar, Mamoona Naveed
    Iqbal, Ayesha
    Tanseer, Faryal
    Fleury, Martin
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [25] Real-time multimedia tagging and content-based retrieval for CCTV surveillance systems
    Perrott, AJ
    Lindsay, AT
    Parkes, AP
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS III, 2002, 4862 : 40 - 49
  • [26] Interest-based real-time content recommendation in online social communities
    Li, Dongsheng
    Lv, Qin
    Xie, Xing
    Shang, Li
    Xia, Huanhuan
    Lu, Tun
    Gu, Ning
    KNOWLEDGE-BASED SYSTEMS, 2012, 28 : 1 - 12
  • [27] Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs
    Gupta, Pankaj
    Satuluri, Venu
    Grewal, Ajeet
    Gurumurthy, Siva
    Zhabiuk, Volodymyr
    Li, Quannan
    Lin, Jimmy
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (13): : 1379 - 1380
  • [28] Content-based image filtering for recommendation
    Jung, Kyung-Yong
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 312 - 321
  • [29] Real-time camera motion classification for content-based indexing and retrieval using templates
    Lee, S
    Hayes, MH
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3664 - 3667
  • [30] A review of real-time segmentation of uncompressed video sequences for content-based search and retrieval
    Lefèvre, S
    Holler, J
    Vincent, N
    REAL-TIME IMAGING, 2003, 9 (01) : 73 - 98