Neural Factorization Machines for Sparse Predictive Analytics

被引:997
|
作者
He, Xiangnan [1 ]
Chua, Tat-Seng [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
基金
新加坡国家研究基金会;
关键词
Factorization Machines; Neural Networks; Deep Learning; Sparse Data; Regression; Recommendation;
D O I
10.1145/3077136.3080777
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many predictive tasks of web applications need to model categorical variables, such as user IDs and demographics like genders and occupations. To apply standard machine learning techniques, these categorical predictors are always converted to a set of binary features via one-hot encoding, making the resultant feature vector highly sparse. To learn from such sparse data effectively, it is crucial to account for the interactions between features. Factorization Machines (FMs) are a popular solution for efficiently using the second-order feature interactions. However, FM models feature interactions in a linear way, which can be insufficient for capturing the non-linear and complex inherent structure of real-world data. While deep neural networks have recently been applied to learn non-linear feature interactions in industry, such as the Wide&Deep by Google and DeepCross by Microsoft, the deep structure meanwhile makes them difficult to train. In this paper, we propose a novel model Neural Factorization Machine (NFM) for prediction under sparse settings. NFM seamlessly combines the linearity of FM in modelling second-order feature interactions and the non-linearity of neural network in modelling higher-order feature interactions. Conceptually, NFM is more expressive than FM since FM can be seen as a special case of NFM without hidden layers. Empirical results on two regression tasks show that with one hidden layer only, NFM significantly outperforms FM with a 7.3% relative improvement. Compared to the recent deep learning methods Wide&Deep and DeepCross, our NFM uses a shallower structure but offers better performance, being much easier to train and tune in practice.
引用
收藏
页码:355 / 364
页数:10
相关论文
共 50 条
  • [21] CONSISTENT SPARSE FACTORIZATION
    NORONHA, WP
    WACHSPRESS, EL
    BAKER, AJ
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1992, 24 (07) : 25 - 44
  • [22] FACTORIZATION OF SPARSE POLYNOMIALS
    DAVENPORT, JH
    LECTURE NOTES IN COMPUTER SCIENCE, 1983, 162 : 214 - 224
  • [23] Factorization Machines with libFM
    Rendle, Steffen
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (03)
  • [24] Convex Factorization Machines
    Blondel, Mathieu
    Fujino, Akinori
    Ueda, Naonori
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II, 2015, 9285 : 19 - 35
  • [25] Predictive analytics
    Schwab, Joseph H.
    SPINE JOURNAL, 2020, 20 (07): : 1152 - 1153
  • [26] Neural Network Models of Process Equipment in a Monitoring and Predictive Analytics System
    Shabunin A.S.
    Chernetskii M.Y.
    Osipovskii R.V.
    Power Technology and Engineering, 2024, 58 (01) : 147 - 154
  • [27] BIG DATA, NEURAL NETWORK AND PREDICTIVE ANALYTICS: APPLICATION IN THE FIELD OF SPORT
    Konchev, Mihail
    INTERNATIONAL SCIENTIFIC CONGRESS APPLIED SPORTS SCIENCES (ICASS2019) / BALKAN SCIENTIFIC CONGRESS PHYSICAL EDUCATION, SPORTS, HEALTH, 2019, : 393 - 397
  • [28] Data Classification with Ensembles of One-Class Support Vector Machines and Sparse Nonnegative Matrix Factorization
    Cyganek, Boguslaw
    Krawczyk, Bartosz
    Intelligent Information and Database Systems, Pt I, 2015, 9011 : 526 - 535
  • [29] Scalability of sparse Cholesky factorization
    Rauber, T
    Rünger, G
    Scholtes, C
    INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1999, 10 (01): : 19 - 52
  • [30] SPARSE QR FACTORIZATION IN MATLAB
    MATSTOMS, P
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1994, 20 (01): : 136 - 159