Efficient and robust active learning methods for interactive database exploration

被引:0
|
作者
Huang, Enhui [1 ]
Diao, Yanlei [1 ,2 ]
Liu, Anna [2 ]
Peng, Liping [2 ]
Palma, Luciano Di [1 ]
机构
[1] Ecole Polytech, Palaiseau, France
[2] Univ Massachusetts Amherst, Amherst, MA USA
来源
VLDB JOURNAL | 2024年 / 33卷 / 04期
基金
欧洲研究理事会;
关键词
Interactive data exploration; Active learning; Label noise; IMBALANCED DATA; QUERY; EXAMPLE; CLASSIFICATION; SEARCH; NOISE;
D O I
10.1007/s00778-023-00816-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There is an increasing gap between fast growth of data and the limited human ability to comprehend data. Consequently, there has been a growing demand of data management tools that can bridge this gap and help the user retrieve high-value content from data more effectively. In this work, we propose an interactive data exploration system as a new database service, using an approach called "explore-by-example." Our new system is designed to assist the user in performing highly effective data exploration while reducing the human effort in the process. We cast the explore-by-example problem in a principled "active learning" framework. However, traditional active learning suffers from two fundamental limitations: slow convergence and lack of robustness under label noise. To overcome the slow convergence and label noise problems, we bring the properties of important classes of database queries to bear on the design of new algorithms and optimizations for active learning-based database exploration. Evaluation results using real-world datasets and user interest patterns show that our new system, both in the noise-free case and in the label noise case, significantly outperforms state-of-the-art active learning techniques and data exploration systems in accuracy while achieving the desired efficiency for interactive data exploration.
引用
收藏
页码:931 / 956
页数:26
相关论文
共 50 条
  • [1] Optimization for Active Learning-based Interactive Database Exploration
    Huang, Enhui
    Peng, Liping
    Di Palma, Luciano
    Abdelkafi, Ahmed
    Liu, Anna
    Diao, Yanlei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 12 (01): : 71 - 84
  • [2] Robust Intrinsically Motivated Exploration and Active Learning
    Baranes, Adrien
    Oudeyer, Pierre-Yves
    2009 IEEE 8TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2009, : 124 - 129
  • [3] Active learning methods for interactive image retrieval
    Gosselin, Philippe Henri
    Cord, Matthieu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (07) : 1200 - 1211
  • [4] Coaching the Exploration and Exploitation in Active Learning for Interactive Video Retrieval
    Wei, Xiao-Yong
    Yang, Zhen-Qun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (03) : 955 - 968
  • [5] Query Recommendations for Interactive Database Exploration
    Chatzopoulou, Gloria
    Eirinaki, Magdalini
    Polyzotis, Neoklis
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 3 - +
  • [6] AN ACTIVE EXPLORATION METHOD FOR DATA EFFICIENT REINFORCEMENT LEARNING
    Zhao, Dongfang
    Liu, Jiafeng
    Wu, Rui
    Cheng, Dansong
    Tang, Xianglong
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2019, 29 (02) : 351 - 362
  • [7] AIDE: An Active Learning-Based Approach for Interactive Data Exploration
    Dimitriadou, Kyriaki
    Papaemmanouil, Olga
    Diao, Yanlei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (11) : 2842 - 2856
  • [8] An Active Learning Framework for Efficient Robust Policy Search
    Narayanaswami, Sai Kiran
    Sudarsanam, Nandan
    Ravindran, Balaraman
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 1 - 9
  • [9] Interactive methods for graph exploration
    Loubier, Eloise
    JOURNAL OF INTELLIGENCE STUDIES IN BUSINESS, 2012, 2 (01): : 21 - +
  • [10] Probabilistic Database Summarization for Interactive Data Exploration
    Orr, Laurel
    Balazinska, Magdalena
    Suciu, Dan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (10): : 1154 - 1165