MLlib: Machine learning in Apache Spark

被引:0
|
作者
Meng, Xiangrui [1 ]
Bradley, Joseph [1 ]
Yavuz, Burak [1 ]
Sparks, Evan [2 ]
Venkataraman, Shivaram [2 ]
Liu, Davies [1 ]
Freeman, Jeremy [3 ]
Tsai, D.B. [4 ]
Amde, Manish [5 ]
Owen, Sean [6 ]
Xin, Doris [7 ]
Xin, Reynold [1 ]
Franklin, Michael J. [2 ]
Zadeh, Reza [8 ]
Zaharia, Matei [9 ]
Talwalkar, Ameet [10 ]
机构
[1] Databricks, 160 Spear Street, 13th Floor, San Francisco,CA,94105, United States
[2] UC Berkeley, 465 Soda Hall, Berkeley,CA,94720, United States
[3] HHMI Janelia Research Campus, 19805 Helix Dr, Ashburn,VA,20147, United States
[4] Netflix, 970 University Ave, Los Gatos,CA,95032, United States
[5] Origami Logic, 1134 Crane Street, Menlo Park,CA,94025, United States
[6] Cloudera UK, 33 Creechurch Lane, London,EC3A 5EB, United Kingdom
[7] UIUC, 201 N Goodwin Ave, Urbana,IL,61801, United States
[8] Stanford, Databricks, 475 Via Ortega, Stanford,CA,94305, United States
[9] MIT, Databricks, 160 Spear Street, 13th Floor, San Francisco,CA,94105, United States
[10] UCLA, Databricks, 4732 Boelter Hall, Los Angeles,CA,90095, United States
关键词
Artificial intelligence - Learning systems - Linear algebra - Data handling - High level languages;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] MLlib: Machine Learning in Apache Spark
    Meng, Xiangrui
    Bradley, Joseph
    Yavuz, Burak
    Sparks, Evan
    Venkataraman, Shivaram
    Liu, Davies
    Freeman, Jeremy
    Tsai, D. B.
    Amde, Manish
    Owen, Sean
    Xin, Doris
    Xin, Reynold
    Franklin, Michael J.
    Zadeh, Reza
    Zaharia, Matei
    Talwalkar, Ameet
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [2] Big Data Machine Learning using Apache Spark MLlib
    Assefi, Mehdi
    Behravesh, Ehsun
    Liu, Guangchi
    Tafti, Ahmad P.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3492 - 3498
  • [3] Performance evaluation of DNN with other machine learning techniques in a cluster using Apache Spark and MLlib
    JayaLakshmi, A. N. M.
    Kishore, K. V. Krishna
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (01) : 1311 - 1319
  • [4] Comparative Study of Apache Spark MLlib Clustering Algorithms
    Harifi, Sasan
    Byagowi, Ebrahim
    Khalilian, Madjid
    DATA MINING AND BIG DATA, DMBD 2017, 2017, 10387 : 61 - 73
  • [5] Privacy-Preserving Machine Learning on Apache Spark
    Brito, Claudia V.
    Ferreira, Pedro G.
    Portela, Bernardo L.
    Oliveira, Rui C.
    Paulo, Joao T.
    IEEE ACCESS, 2023, 11 : 127907 - 127930
  • [6] Optimizing Machine Learning on Apache Spark in HPC Environments
    Li, Zhenyu
    Davis, James
    Jarvis, Stephen A.
    PROCEEDINGS OF 2018 IEEE/ACM MACHINE LEARNING IN HPC ENVIRONMENTS (MLHPC 2018), 2018, : 95 - 105
  • [8] On Scalability of Distributed Machine Learning with Big Data on Apache Spark
    Hai, Ameen Abdel
    Forouraghi, Babak
    BIG DATA - BIGDATA 2018, 2018, 10968 : 209 - 219
  • [9] Network Intrusion Detection on Apache Spark with Machine Learning Algorithms
    Kurt, Elif Merve
    Becerikli, Yasar
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2018, 2018, 893 : 130 - 141
  • [10] MLlib*: Fast Training of GLMs using Spark MLlib
    Zhang, Zhipeng
    Jiang, Jiawei
    Wu, Wentao
    Zhang, Ce
    Yu, Lele
    Cui, Bin
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1778 - 1789