Positive and Unlabeled Learning for User Behavior Analysis Based on Mobile Internet Traffic Data

被引:11
|
作者
Yu, Ke [1 ]
Liu, Yue [1 ]
Qing, Linbo [2 ]
Wang, Binbin [1 ]
Cheng, Yongqiang [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610054, Sichuan, Peoples R China
[3] Univ Hull, Sch Engn & Comp Sci, Kingston Upon Hull HU6 7RX, N Humberside, England
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Mobile user behavior; PU learning; App usage prediction; mobile video traffic identification; bipartite network; USAGE PREDICTION;
D O I
10.1109/ACCESS.2018.2852008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of wireless communication and mobile Internet, mobile phone becomes ubiquitous and functions as a versatile and a smart system, on which people frequently interact with various mobile applications (Apps). Understanding human behaviors using mobile phone is significant for mobile system developers, for human-centered system optimization and better service provisioning. In this paper, we focus on mobile user behavior analysis and prediction based on mobile Internet traffic data. Real traffic flow data is collected from the public network of Internet service providers, by high-performance network traffic monitors. We construct a User-App bipartite network to represent the traffic interaction pattern between users and App servers. After mining the explicit and implicit features from the User-App bipartite network, we propose two positive and unlabeled (PU) learning methods, including Spy-based PU learning and K-means-based PU learning, for App usage prediction and mobile video traffic identification. We first use the traffic flow data of QQ, a very famous messaging and social media application possessing high market share in China, as the experimental data set for App usage prediction task. Then, we use the traffic flow data from six popular Apps, including video intensive Apps (Youku, Baofeng, LeTV, and Tudou) and other Apps (Meituan and Apple), as the experimental data set for mobile video traffic identification task. Experimental results show that our proposed PU learning methods perform well in both tasks.
引用
收藏
页码:37568 / 37580
页数:13
相关论文
共 50 条
  • [21] A Study of User Downloading Behavior in Mobile Internet Using Clickstream Data
    Liu, Yanbin
    Yuan, Ping
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 255 - 257
  • [22] Learning from positive and unlabeled data: a survey
    Jessa Bekker
    Jesse Davis
    Machine Learning, 2020, 109 : 719 - 760
  • [23] Learning from positive and unlabeled data: a survey
    Bekker, Jessa
    Davis, Jesse
    MACHINE LEARNING, 2020, 109 (04) : 719 - 760
  • [24] State of the art on road traffic sensing and learning based on mobile user network log data
    Huang, Jin
    Xiao, Ming
    NEUROCOMPUTING, 2018, 278 : 110 - 118
  • [25] Tri-training based learning from positive and unlabeled data
    Zhang, Bangzuo
    Zuo, Wanli
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 640 - 644
  • [26] An Active Learning Based on Uncertainty and Density Method for Positive and Unlabeled Data
    Luo, Jun
    Zhou, Wenan
    Du, Yu
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 229 - 241
  • [27] Analysis of Location Data Leakage in the Internet Traffic of Android-based Mobile Devices
    Sivan, Nir
    Bitton, Ron
    Shabtai, Asaf
    PROCEEDINGS OF THE 22ND INTERNATIONAL SYMPOSIUM ON RESEARCH IN ATTACKS, INTRUSIONS AND DEFENSES, 2019, : 243 - 260
  • [28] Learning from Positive and Unlabeled Data with Arbitrary Positive Shift
    Hammoudeh, Zayd
    Lowd, Daniel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [29] Scalping Anomaly Detection Based on Mobile Internet Traffic Data
    Wu, Chuting
    Yu, Ke
    Wu, Xiaofei
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND COMMUNICATION ENGINEERING (ICTCE 2018), 2018, : 237 - 244
  • [30] Toward understanding the mobile Internet user behavior: A methodology for user clustering with aging analysis
    Yamakami, T
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 85 - 89