OSCAR: A Semantic-based Data Binning Approach

被引:2
|
作者
Setlur, Vidya [1 ]
Correll, Michael [1 ]
Battersby, Sarah [1 ]
机构
[1] Tableau Res, Philadelphia, PA 19146 USA
关键词
Data-driven semantics; binning; constraints; geospatial; CHOROPLETH MAPS;
D O I
10.1109/VIS54862.2022.00029
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Binning is applied to categorize data values or to see distributions of data. Existing binning algorithms often rely on statistical properties of data. However, there are semantic considerations for selecting appropriate binning schemes. Surveys, for instance, gather respondent data for demographic-related questions such as age, salary, number of employees, etc., that are bucketed into defined semantic categories. In this paper, we leverage common semantic categories from survey data and Tableau Public visualizations to identify a set of semantic binning categories. We employ these semantic binning categories in OSCAR: a method for automatically selecting bins based on the inferred semantic type of the field. We conducted a crowdsourced study with 120 participants to better understand user preferences for bins generated by OSCAR vs. binning provided in Tableau. We find that maps and histograms using binned values generated by OSCAR are preferred by users as compared to binning schemes based purely on the statistical properties of the data.
引用
收藏
页码:100 / 104
页数:5
相关论文
共 50 条
  • [1] A semantic-based approach for Machine Learning data analysis
    Pinto, Agnese
    Scioscia, Floriano
    Loseto, Giuseppe
    Ruta, Michele
    Bove, Eliana
    Di Sciascio, Eugenio
    2015 IEEE 9TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2015, : 324 - 327
  • [2] A Semantic-Based Approach for Managing Healthcare Big Data: A Survey
    Hammad, Rafat
    Barhoush, Malek
    Abed-alguni, Bilal H.
    JOURNAL OF HEALTHCARE ENGINEERING, 2020, 2020
  • [3] Ontology for Attack Detection: Semantic-Based Approach for Genomic Data Security
    Noor, Saba
    Ahmed, Mansoor
    Saqib, Malik Najmus
    Abdullah-Al-Wadud, M.
    Islam, Md Saiful
    Fazal-e-Amin
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2017, 7 (06) : 1309 - 1323
  • [4] A semantic-based approach for querying linked data using natural language
    Andres Paredes-Valverde, Mario
    Valencia-Garcia, Rafael
    Angel Rodriguez-Garcia, Miguel
    Colomo-Palacios, Ricardo
    Alor-Hernandez, Giner
    JOURNAL OF INFORMATION SCIENCE, 2016, 42 (06) : 851 - 862
  • [5] A Semantic-based Approach to Grid Service Matching
    Liu, Meimei
    Li, Peifeng
    Zhu, Qiaoming
    Ji, Qin
    2009 SIXTH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2009, : 151 - 155
  • [6] Semantic Maps for Knowledge Graphs: A Semantic-Based Summarization Approach
    Camarillo-Ramirez, Pablo
    Cervantes-Alvarez, Francisco
    Gutierrez-Preciado, Luis Fernando
    IEEE ACCESS, 2024, 12 : 6729 - 6744
  • [7] In Search of Design Inspiration: A Semantic-Based Approach
    Setchi, Rossitza
    Bouchard, Carole
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2010, 10 (03)
  • [8] AN APPROACH TO SEMANTIC-BASED MODEL DISCOVERY AND SELECTION
    Szabo, Claudia
    Teo, Yong Meng
    PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 3054 - 3066
  • [9] A semantic-based approach for data management in a P2P system
    Souza D.
    Pires C.E.
    Kedad Z.
    Tedesco P.
    Salgado A.C.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 6790 LNCS : 56 - 86
  • [10] Semantic-based data access services on the grid
    Huang, H
    Shi, ZZ
    Cheng, Y
    Qiu, LR
    He, XX
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1554 - 1557