SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics

被引:155
|
作者
Vartak, Manasi [1 ]
Rahman, Sajjadur
Madden, Samuel [1 ]
Parameswaran, Aditya [2 ]
Polyzotis, Neoklis [3 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ Illinois UIUC, Champaign, IL USA
[3] Google, Mountain View, CA 94043 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2015年 / 8卷 / 13期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2831360.2831371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data analysts often build visualizations as the first step in their analytical workflow. However, when working with high-dimensional datasets, identifying visualizations that show relevant or desired trends in data can be laborious. We propose SEEDB, a visualization recommendation engine to facilitate fast visual analysis: given a subset of data to be studied, SEEDB intelligently explores the space of visualizations, evaluates promising visualizations for trends, and recommends those it deems most "useful" or "interesting". The two major obstacles in recommending interesting visualizations are (a) scale: evaluating a large number of candidate visualizations while responding within interactive time scales, and (b) utility: identifying an appropriate metric for assessing interestingness of visualizations. For the former, SEEDB introduces pruning optimizations to quickly identify high-utility visualizations and sharing optimizations to maximize sharing of computation across visualizations. For the latter, as a first step, we adopt a deviation-based metric for visualization utility, while indicating how we may be able to generalize it to other factors influencing utility. We implement SEEDB as a middleware layer that can run on top of any DBMS. Our experiments show that our framework can identify interesting visualizations with high accuracy. Our optimizations lead to multiple orders of magnitude speedup on relational row and column stores and provide recommendations at interactive time scales. Finally, we demonstrate via a user study the effectiveness of our deviation-based utility metric and the value of recommendations in supporting visual analytics.
引用
收藏
页码:2182 / 2193
页数:12
相关论文
共 50 条
  • [31] Data-Driven Ranking and Visualization of Products by Competitiveness
    Usmani, Sheema
    Bernagozzi, Mariana
    Huang, Yufeng
    Morales, Michelle
    Sarvestani, Amir Sabet
    Srivastava, Biplav
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13640 - 13641
  • [32] Data-Driven Recommendations in a Public Service Organisation
    Piscopo, Alessandro
    Panteli, Maria
    Penna, Douglas
    ABIS'19: PROCEEDINGS OF THE 23RD INTERNATIONAL WORKSHOP ON PERSONALIZATION AND RECOMMENDATION ON THE WEB AND BEYOND, 2019, : 23 - 24
  • [33] Revisiting customer analytics capability for data-driven retailing
    Hossain, Md Afnan
    Akter, Shahriar
    Yanamandram, Venkata
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2020, 56
  • [34] Data-driven HR Analytics in a Quality Management System
    Polyakova, Alexandra
    Kolmakov, Vladimir
    Pokamestov, Ilya
    QUALITY-ACCESS TO SUCCESS, 2020, 21 (176): : 74 - 80
  • [35] Data-driven analytics of COVID-19 ‘infodemic’
    Minyu Wan
    Qi Su
    Rong Xiang
    Chu-Ren Huang
    International Journal of Data Science and Analytics, 2023, 15 : 313 - 327
  • [36] Making Data-Driven Discerning Decision with Business Analytics
    Wang, John
    Bin Zhou, Steve
    INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2014, 1 (01) : IV - VII
  • [37] Business Analytics: The Science of Data-Driven Decision Making
    Mathirajan, Muthu
    IIMB MANAGEMENT REVIEW, 2019, 31 (01) : 99 - 100
  • [38] Employing Analytics to Guide a Data-Driven Review of LibGuides
    Griffin, Melanie
    Taylor, Tomaro, I
    JOURNAL OF WEB LIBRARIANSHIP, 2018, 12 (03) : 147 - 159
  • [39] Introduction to the Special Section on Data-Driven Prescriptive Analytics
    Giesecke, Kay
    Liberali, Gui
    Nazerzadeh, Hamid
    Shanthikumar, J. George
    Teo, Chung Piaw
    MANAGEMENT SCIENCE, 2022, 68 (03) : 1591 - 1594
  • [40] Crisis analytics: big data-driven crisis response
    Junaid Qadir
    Anwaar Ali
    Raihan ur Rasool
    Andrej Zwitter
    Arjuna Sathiaseelan
    Jon Crowcroft
    Journal of International Humanitarian Action, 2016, 1 (1)