CLIC: An Extensible and Efficient Cross-Platform Data Analytics System

被引:0
|
作者
Chen, Qixiang [1 ]
Chen, Zhijun [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci Technol, Shanghai 200437, Peoples R China
关键词
Data analysis; data processing; data systems; systems;
D O I
10.1109/TPDS.2023.3298038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the ever-increasing data volume and application diversity, a modern data analytics job is generally built as a workflow consisting of multiple tasks. For either specific functionalities or higher performance, tasks in a workflow may need to be deployed on different data processing platforms. This article proposes CLIC, a highly extensible system for efficient cross-platform data analytics. To leverage the advantage of diverse platforms while alleviating development efforts, we propose an embedding-based operator encoding scheme and a Graph Convolutional Network model for efficient platform selection. Aiming at flexibly integrating new operators and platforms, CLIC is designed with a highly extensible system architecture that decouples the core functionalities from backend platforms. Experiments show that CLIC can significantly improve the performance of modern data analysis workflows with fast platform selection.
引用
收藏
页码:34 / 45
页数:12
相关论文
共 50 条
  • [31] FusionLearn: a biomarker selection algorithm on cross-platform data
    Gao, Xin
    Zhong, Yuan
    BIOINFORMATICS, 2019, 35 (21) : 4465 - 4468
  • [32] Cross-Platform Data Processing: Use Cases and Challenges
    Kaoudi, Zoi
    Quiane-Ruiz, Jorge-Arnulfo
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1723 - 1726
  • [33] methyLiftover: cross-platform DNA methylation data integration
    Titus, Alexander J.
    Houseman, E. Andres
    Johnson, Kevin C.
    Christensen, Brock C.
    BIOINFORMATICS, 2016, 32 (16) : 2517 - 2519
  • [34] Cross-platform independence
    Clarke, RA
    DR DOBBS JOURNAL, 1999, 24 (12): : 10 - 10
  • [35] Cross-Platform Analysis with Binarized Gene Expression Data
    Tuna, Salih
    Niranjan, Mahesan
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2009, 5780 : 439 - 449
  • [36] A Model for Cross-Platform Searches in Temporal Microarray Data
    Tusch, Guenter
    Tole, Olvi
    Hoinski, Mary Ellen
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 153 - 158
  • [37] On cross-platform security
    Gong, L
    COMPUTER SYSTEMS: THEORY, TECHNOLOGY AND APPLICATIONS: A TRIBUTE TO ROGER NEEDHAM, 2004, : 89 - 91
  • [38] Cross-platform design
    Bond, T
    DR DOBBS JOURNAL, 1999, 24 (11): : 10 - 10
  • [39] BCL: A Cross-Platform Distributed Data Structures Library
    Brock, Benjamin
    Buluc, Aydin
    Yelick, Katherine
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [40] A framework for efficient and rapid development of cross-platform audio applications
    Xavier Amatriain
    Pau Arumi
    David Garcia
    Multimedia Systems, 2008, 14 : 15 - 32