Apache Wayang: A Unified Data Analytics Framework

被引:2
|
作者
Beedkar, Kaustubh [1 ,4 ]
Contreras-Rojas, Bertty [2 ]
Gavriilidis, Haralampos [2 ]
Kaoudi, Zoi [3 ,4 ]
Markl, Volker [2 ]
Pardo-Meza, Rodrigo [2 ]
Quiane-Ruiz, Jorge-Arnulfo [3 ,4 ]
机构
[1] Indian Inst Technol Delhi, New Delhi, India
[2] Tech Univ Berlin, Berlin, Germany
[3] IT Univ Copenhagen, Copenhagen, Denmark
[4] Databloom Inc, Miami, FL 33127 USA
关键词
16;
D O I
10.1145/3631504.3631510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The large variety of specialized data processing platforms and the increased complexity of data analytics has led to the need for unifying data analytics within a single framework. Such a framework should free users from the burden of (i) choosing the right platform(s) and (ii) gluing code between the different parts of their pipelines. Apache Wayang (Incubating) is the only open-source framework that provides a systematic solution to unified data analytics by integrating multiple heterogeneous data processing platforms. It achieves that by decoupling applications from the underlying platforms and providing an optimizer so that users do not have to specify the platforms on which their pipeline should run. Wayang provides a unified view and processing model, effectively integrating the hodgepodge of heterogeneous platforms into a single framework with increased usability without sacrificing performance and total cost of ownership. In this paper, we present the architecture ofWayang, describe its main components, and give an outlook on future directions.
引用
收藏
页码:30 / 35
页数:6
相关论文
共 50 条
  • [21] Typhoon Quantitative Rainfall Prediction from Big Data Analytics by Using the Apache Hadoop Spark Parallel Computing Framework
    Wei, Chih-Chiang
    Chou, Tzu-Hao
    ATMOSPHERE, 2020, 11 (08)
  • [22] Apache Spark in Riot Games: A Case Study on Data Processing and Analytics
    Sharma, Kanhaiya
    Mohammad, Firdous Hussain
    Parashar, Deepak
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 32 - 40
  • [23] Mobile Big Data Analytics Using Deep Learning and Apache Spark
    Abu Alsheikh, Mohammad
    Niyato, Dusit
    Lin, Shaowei
    Tan, Hwee-Pink
    Han, Zhu
    IEEE NETWORK, 2016, 30 (03): : 22 - 29
  • [24] Big data Predictive Analytics for Apache Spark using Machine Learning
    Junaid, Muhammad
    Wagan, Shiraz Ali
    Qureshi, Nawab Muhammad Faseeh
    Nam, Choon Sung
    Shin, Dong Ryeol
    2020 GLOBAL CONFERENCE ON WIRELESS AND OPTICAL TECHNOLOGIES (GCWOT), 2020,
  • [25] Unified Visual-Aware Representations for Data Analytics
    Peska, Ladislav
    Sixtova, Ivana
    Hoksza, David
    Bernhauer, David
    Lokoc, Jakub
    Skopal, Tomas
    IEEE ACCESS, 2025, 13 : 19694 - 19715
  • [26] A Gamification Framework for Sensor Data Analytics
    L'Heureux, Alexandra
    Grolinger, Katarina
    Higashino, Wilson A.
    Capretz, Miriam A. M.
    2017 IEEE 2ND INTERNATIONAL CONGRESS ON INTERNET OF THINGS (IEEE ICIOT), 2017, : 74 - 81
  • [27] Governance Framework for Enterprise Analytics and Data
    Yamada, Atsushi
    Peran, Michael
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3623 - 3631
  • [28] A generic framework for data quality analytics
    Arranz, Miguel Castaño
    Gustafson, Anna
    Al-Chalabi, Hussan
    International Journal of COMADEM, 2020, 23 (01): : 31 - 38
  • [29] A Unified Scaling Model in the Era of Big Data Analytics
    Li, Zhongwei
    Duan, Feng
    Che, Hao
    2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 67 - 77
  • [30] A general framework for streaming data analytics
    Christy Sujatha, D.
    Gnana Jayanthi, J.
    Test Engineering and Management, 2019, 81 (11-12): : 4493 - 4502