Project Daytona: Data Analytics as a Cloud Service

被引:14
|
作者
Barga, Roger S. [1 ]
Ekanayake, Jaliya [1 ]
Lu, Wei [1 ]
机构
[1] Microsoft Corp, Microsoft Res, eXtreme Comp Grp, Redmond, WA 98053 USA
关键词
D O I
10.1109/ICDE.2012.136
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spreadsheets are established data collection and analysis tools in business, technical computing and academic research. Excel, for example, offers an attractive user interface, provides an easy to use data entry model, and offers substantial interactivity for what-if analysis. However, spreadsheets and other common client applications do not offer scalable computation for large scale data analytics and exploration. Increasingly researchers in domains ranging from the social sciences to environmental sciences are faced with a deluge of data, often sitting in spreadsheets such as Excel or other client applications, and they lack a convenient way to explore the data, to find related data sets, or to invoke scalable analytical models over the data. To address these limitations, we have developed a cloud data analytics service based on Daytona, which is an iterative MapReduce runtime optimized for data analytics. In our model, Excel and other existing client applications provide the data entry and user interaction surfaces, Daytona provides a scalable runtime on the cloud for data analytics, and our service seamlessly bridges the gap between the client and cloud. Any analyst can use our data analytics service to discover and import data from the cloud, invoke cloud scale data analytics algorithms to extract information from large datasets, invoke data visualization, and then store the data back to the cloud all through a spreadsheet or other client application they are already familiar with.
引用
收藏
页码:1317 / 1320
页数:4
相关论文
共 50 条
  • [41] A Review of Big Data Analytics over Cloud
    Dasoriya, Rayan
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2017, : 1 - 6
  • [42] Reproducible and Portable Big Data Analytics in the Cloud
    Wang, Xin
    Guo, Pei
    Li, Xingyan
    Gangopadhyay, Aryya
    Busart, Carl E.
    Freeman, Jade
    Wang, Jianwu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 2966 - 2982
  • [43] Big data analytics in Cloud computing: an overview
    Berisha, Blend
    Mëziu, Endrit
    Shabani, Isak
    Journal of Cloud Computing, 2022, 11 (01)
  • [44] Big Data Analytics and Intelligence at Alibaba Cloud
    Zhou, Jingren
    OPERATING SYSTEMS REVIEW, 2017, 51 (02) : 1 - 1
  • [45] Visual Analytics Framework for Cloud Infrastructure Data
    Kejariwal, Arun
    Lee, Winston
    Vallis, Owen
    Hochenbaum, Jordan
    Yan, Bryce
    2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 886 - 893
  • [46] Big data analytics in Cloud computing: an overview
    Blend Berisha
    Endrit Mëziu
    Isak Shabani
    Journal of Cloud Computing, 11
  • [47] Providing Healthcare-as-a-Service Using Fuzzy Rule Based Big Data Analytics in Cloud Computing
    Jindal, Anish
    Dua, Amit
    Kumar, Neeraj
    Das, Ashok Kumar
    Vasilakos, Athanasios V.
    Rodrigues, Joel J. P. C.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (05) : 1605 - 1618
  • [48] Leveraging the capabilities of service-oriented decision support systems: Putting analytics and big data in cloud
    Demirkan, Haluk
    Delen, Dursun
    DECISION SUPPORT SYSTEMS, 2013, 55 (01) : 412 - 421
  • [49] CLUE: System Trace Analytics for Cloud Service Performance Diagnosis
    Zhang, Hui
    Rhee, Junghwan
    Arora, Nipun
    Gamage, Sahan
    Jiang, Guofei
    Yoshihira, Kenji
    Xu, Dongyan
    2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
  • [50] Visor: Privacy-Preserving Video Analytics as a Cloud Service
    Poddar, Rishabh
    Ananthanarayanan, Ganesh
    Setty, Srinath
    Volos, Stavros
    Popa, Raluca Ada
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 1039 - 1056