Real-time, scalable, content-based Twitter users recommendation

被引:8
|
作者
Subercaze, Julien [1 ]
Gravier, Christophe [1 ]
Laforest, Frederique [1 ]
机构
[1] Univ Jean Monnet, CNRS, UMR 5516, Lab Hubert Curien, 25 Rue Docteur Remy Annino, F-42000 St Etienne, France
关键词
Twitter recommendation; binary footprint; large scale approach; information retrieval; real-time recommendation;
D O I
10.3233/WEB-160329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time recommendation of Twitter users based on the content of their profiles is a very challenging task. Traditional IR methods such as TF-IDF fail to handle efficiently large datasets. In this paper we present a scalable approach that allows real time recommendation of users based on their tweets. Our model builds a graph of terms, driven by the fact that users sharing similar interests will share similar terms. We show how this model can be encoded as a compact binary footprint, that allows very fast comparison and ranking, taking full advantage of modern CPU architectures. We validate our approach through an empirical evaluation against the Apache Lucene's implementation of TF-IDF. We show that our approach is in average two hundred times faster than standard optimized implementation of TF-IDF with a precision of 58%.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 50 条
  • [41] Cloud-MOM: A Content-Based Real-Time Message-Oriented Middleware for Cloud
    Ding, Hong
    Zhang, Chuang
    Chen, Xiaojun
    Shi, Jinqiao
    Wang, Wenan
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 750 - 757
  • [42] Content-based model template adaptation and real-time system for behavior interpretation in sports video
    Han, Jungong
    de With, Peter H. N.
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2006, 4179 : 474 - 484
  • [43] SCSL: Optimizing Matching Algorithms to Improve Real-time for Content-based Pub/Sub Systems
    Ding, Tianchen
    Qian, Shiyou
    Cao, Jian
    Xue, Guangtao
    Li, Minglu
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 148 - 157
  • [44] Content-based utility function prediction for real-time MPEG-4 video transcoding
    Wang, Y
    Kim, JG
    Chang, SF
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 189 - 192
  • [45] Real-time indexing of retinal images for data mining and content-based image retrieval applications
    Sethi, P
    Dua, S
    Beuerman, RW
    Hartnett, M
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2003, 44 : U295 - U295
  • [46] A content-based goods image recommendation system
    Li Yu
    Fangjian Han
    Shaobing Huang
    Yiwen Luo
    Multimedia Tools and Applications, 2018, 77 : 4155 - 4169
  • [47] A Content-Based Recommendation System using TrueSkill
    Cruz Quispe, Laura
    Ochoa Luna, Jose Eduardo
    2015 FOURTEENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (MICAI), 2015, : 203 - 207
  • [48] Content-based recommendation in E-commerce
    Xu, B
    Zhang, MM
    Pan, ZG
    Yang, HW
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 946 - 955
  • [49] A multigranular linguistic content-based recommendation model
    Martinez, Luis
    Perez, Luis G.
    Barranco, Manuel
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2007, 22 (05) : 419 - 434
  • [50] A content-based goods image recommendation system
    Yu, Li
    Han, Fangjian
    Huang, Shaobing
    Luo, Yiwen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (04) : 4155 - 4169