Scalable Supervised Dimensionality Reduction Using Clustering

被引:0
|
作者
Raeder, Troy [1 ]
Perlich, Claudia [1 ]
Dalessandro, Brian [1 ]
Stitelman, Ori [1 ]
Provost, Foster [2 ,3 ]
机构
[1] m6d Res, 37 E 18th St, New York, NY 10003 USA
[2] NYU, New York, NY 10012 USA
[3] m6d Res, New York, NY 10012 USA
关键词
supervised dimensionality reduction; clustering; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automated targeting of online display ads at scale requires the simultaneous evaluation of a single prospect against many independent models. When deciding which ad to show to a user, one must calculate likelihood-to-convert scores for that user across all potential advertisers in the system. For modern machine-learning-based targeting, as conducted by Media6Degrees (m6d), this can mean scoring against thousands of models in a large, sparse feature space. Dimensionality reduction within this space is useful, as it decreases scoring time and model storage requirements. To meet this need, we develop a novel algorithm for scalable supervised dimensionality reduction across hundreds of simultaneous classification tasks. The algorithm performs hierarchical clustering in the space of model parameters from historical models in order to collapse related features into a single dimension. This allows us to implicitly incorporate feature and label data across all tasks without operating directly in a massive space. We present experimental results showing that for this task our algorithm outperforms other popular dimensionality-reduction algorithms across a wide variety of ad campaigns, as well as production results that showcase its performance in practice.
引用
收藏
页码:1213 / 1221
页数:9
相关论文
共 50 条
  • [21] Semantic coding by supervised dimensionality reduction
    Kokiopoulou, Effrosyni
    Frossard, Pascal
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (05) : 806 - 818
  • [22] Semi-Supervised Dimensionality Reduction
    Zhang, Daoqiang
    Zhou, Zhi-Hua
    Chen, Songcan
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 629 - +
  • [23] Supervised dimensionality reduction for big data
    Joshua T. Vogelstein
    Eric W. Bridgeford
    Minh Tang
    Da Zheng
    Christopher Douville
    Randal Burns
    Mauro Maggioni
    Nature Communications, 12
  • [24] Semi-Supervised Dimensionality Reduction
    Wang, Yongmao
    Wang, Yukun
    THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 506 - 509
  • [25] DIMENSIONALITY REDUCTION BY SUPERVISED LOCALITY ANALYSIS
    Zhang, Lei
    Peng, Peipei
    Xiang, Xuezhi
    Zhen, Xiantong
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1488 - 1492
  • [26] Kernel dimensionality reduction for supervised learning
    Fukumizu, K
    Bach, FR
    Jordan, MI
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 81 - 88
  • [27] Clustering Documents using the Document to Vector Model for Dimensionality Reduction
    Radu, Robert-George
    Radulescu, Iulia-Maria
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    Mocanu, Mariana
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2020, : 57 - 62
  • [28] Semi-supervised dimensionality reduction using pairwise equivalence constraints
    Cevikalp, Hakan
    Verbeek, Jakob
    Jurie, Frederic
    Klaser, Alexander
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 489 - 496
  • [29] Supervised Dimensionality Reduction of Proportional Data Using Exponential Family Distributions
    Masoudimansour, Walid
    Bouguila, Nizar
    ELECTRONICS, 2023, 12 (15)
  • [30] Dimensionality reduction and clustering on statistical manifolds
    Lee, Sang-Mook
    Abbott, A. Lynn
    Ararnan, Philip A.
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3125 - +