Learning Large Scale Ordinal Ranking Model via Divide-and-Conquer Technique

被引:0
|
作者
Tang, Lu [1 ]
Chaudhuri, Sougata [2 ]
Bagherjeiran, Abraham [2 ]
Zhou, Ling [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] A9 Com Inc, Palo Alto, CA USA
关键词
Binary Classification; Ordinal Ranking; Big Data;
D O I
10.1145/3184558.3191658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Structured prediction, where outcomes have a precedence order, lies at the heart of machine learning for information retrieval, movie recommendation, product review prediction, and digital advertising. Ordinal ranking, in particular, assumes that the structured response has a linear ranked order. Due to the extensive applicability of these models, substantial research has been devoted to understanding them, as well as developing efficient training techniques. One popular and widely cited technique of training ordinal ranking models is to exploit the linear precedence order and systematically reduce it to a binary classification problem. This facilitates the usage of readily available, powerful binary classifiers, but necessitates an expansion of the original training data, where the training data increases by K - 1 times of its original size, with K being the number of ordinal classes. Due to prevalent nature of problems with large number of ordered classes, the reduction leads to datasets which are too large to train on single machines. While approximation methods like stochastic gradient descent are typically applied here, we investigate exact optimization solutions that can scale. In this paper, we present a divide-and-conquer (DC) algorithm, which divides large scale binary classification data into a cluster of machines and trains logistic models in parallel, and combines them at the end of the training phase to create a single binary classifier, which can then be used as an ordinal ranker. It requires no synchronization between the parallel learning algorithms during the training period, which makes training on large datasets feasible and efficient. We prove consistency and asymptotic normality property of the learned models using our proposed algorithm. We provide empirical evidence, on various ordinal datasets, of improved estimation and prediction performance of the model learnt using our algorithm, over several standard divide-and-conquer algorithms.
引用
收藏
页码:1901 / 1909
页数:9
相关论文
共 50 条
  • [31] PairRank: Online Pairwise Learning to Rank by Divide-and-Conquer
    Jia, Yiling
    Wang, Huazheng
    Guo, Stephen
    Wang, Hongning
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 146 - 157
  • [32] Influence Maximization on Large-Scale Mobile Social Network: A Divide-and-Conquer Method
    Song, Guojie
    Zhou, Xiabing
    Wang, Yu
    Xie, Kunqing
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (05) : 1379 - 1392
  • [33] DivTheft: An Ensemble Model Stealing Attack by Divide-and-Conquer
    Ma, Zhuo
    Liu, Xinjing
    Liu, Yang
    Liu, Ximeng
    Qin, Zhan
    Ren, Kui
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (06) : 4810 - 4822
  • [34] Erratum to: Large-scale image colorization based on divide-and-conquer support vector machines
    Xinyu He
    Bo-Wei Chen
    Wen Ji
    Seungmin Rho
    Sun-Yuan Kung
    The Journal of Supercomputing, 2016, 72 : 1678 - 1678
  • [35] Density Peaks Clustering Algorithm for Large-scale Data Based on Divide-and-Conquer Strategy
    Wang, Yining
    2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 416 - 419
  • [36] Random sampling of contingency tables via probabilistic divide-and-conquer
    Stephen DeSalvo
    James Zhao
    Computational Statistics, 2020, 35 : 837 - 869
  • [37] Large scale geospatial data conflation: A feature matching framework based on optimization and divide-and-conquer
    Lei, Ting L.
    Computers, Environment and Urban Systems, 2021, 87
  • [38] Large scale geospatial data conflation: A feature matching framework based on optimization and divide-and-conquer
    Lei, Ting L.
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2021, 87
  • [39] A Competitive Divide-and-Conquer Algorithm for Unconstrained Large-Scale Black-Box Optimization
    Mei, Yi
    Omidvar, Mohammad Nabi
    Li, Xiaodong
    Yao, Xin
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 42 (02): : 1 - 24
  • [40] Few-Shot Segmentation via Divide-and-Conquer Proxies
    Lang, Chunbo
    Cheng, Gong
    Tu, Binfei
    Han, Junwei
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (01) : 261 - 283