A Parallel and Efficient Algorithm for Learning to Match

被引：3

作者：

Shang, Jingbo ^{[1
,4
]}

Chen, Tianqi ^{[2
]}

Li, Hang ^{[3
]}

Lu, Zhengdong ^{[3
]}

Yu, Yong ^{[4
]}

机构：

[1] Univ Illinois, Champaign, IL 61801 USA

[2] Univ Washington, Seattle, WA 98195 USA

[3] Huawei Noahs Ark Lab, Hong Kong, Hong Kong, Peoples R China

[4] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2014年

关键词：

MATRIX FACTORIZATION;

D O I：

10.1109/ICDM.2014.71

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many tasks in data mining and related fields can be formalized as matching between objects in two heterogeneous domains, including collaborative filtering, link prediction, image tagging, and web search. Machine learning techniques, referred to as learning-to-match in this paper, have been successfully applied to the problems. Among them, a class of state-of-the-art methods, named feature-based matrix factorization, formalize the task as an extension to matrix factorization by incorporating auxiliary features into the model. Unfortunately, making those algorithms scale to real world problems is challenging, and simple parallelization strategies fail due to the complex cross talking patterns between sub-tasks. In this paper, we tackle this challenge with a novel parallel and efficient algorithm. Our algorithm, based on coordinate descent, can easily handle hundreds of millions of instances and features on a single machine. The key recipe of this algorithm is an iterative relaxation of the objective to facilitate parallel updates of parameters, with guaranteed convergence on minimizing the original objective function. Experimental results demonstrate that the proposed method is effective on a wide range of matching problems, with efficiency significantly improved upon the baselines while accuracy retained unchanged.

引用

页码：971 / 976

页数：6

共 50 条

[31] Efficient Batch Parallel Online Sequential Extreme Learning Machine Algorithm Based on MapReduce
Huang, Shan
Wang, Botao
Chen, Yuemei
Wang, Guoren
Yu, Ge
PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 13 - 25
[32] An efficient parallel neural network-based multi-instance learning algorithm
Cheng Hua Li
Iker Gondra
Lijun Liu
The Journal of Supercomputing, 2012, 62 : 724 - 740
[33] An Efficient Flow Monitoring Algorithm Using a Flexible Match Structure
Yang, Ze
Yeung, Kwan L.
2016 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR), 2016, : 176 - 181
[34] On parallel attribute-efficient learning
Damaschke, P
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 67 (01) : 46 - 62
[35] A parallel algorithm for learning Bayesian networks
Yu, Kui
Wang, Hao
Wu, Xindong
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 1055 - +
[36] A parallel neurofuzzy learning and construction algorithm
Harris, CJ
Hong, X
APPLICATIONS AND SCIENCE OF COMPUTATIONAL INTELLIGENCE IV, 2001, 4390 : 263 - 272
[37] Exploiting Best-Match Equations for Efficient Reinforcement Learning
van Seijen, Harm
Whiteson, Shimon
van Hasselt, Hado
Wiering, Marco
JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 2045 - 2094
[38] Efficient dynamic channel assignment through laser chaos: a multiuser parallel processing learning algorithm
Chen, Zengjing
Wang, Lu
Xing, Chengzhi
SCIENTIFIC REPORTS, 2023, 13 (01)
[39] Efficient dynamic channel assignment through laser chaos: a multiuser parallel processing learning algorithm
Zengjing Chen
Lu Wang
Chengzhi Xing
Scientific Reports, 13
[40] An efficient parallel algorithm for merging in the postal model
Park, HK
Chi, DH
Lee, DK
Ryu, KW
ETRI JOURNAL, 1999, 21 (02) : 31 - 39

← 1 2 3 4 5 →