Efficient and robust active learning methods for interactive database exploration

被引:0
|
作者
Huang, Enhui [1 ]
Diao, Yanlei [1 ,2 ]
Liu, Anna [2 ]
Peng, Liping [2 ]
Palma, Luciano Di [1 ]
机构
[1] Ecole Polytech, Palaiseau, France
[2] Univ Massachusetts Amherst, Amherst, MA USA
来源
VLDB JOURNAL | 2024年 / 33卷 / 04期
基金
欧洲研究理事会;
关键词
Interactive data exploration; Active learning; Label noise; IMBALANCED DATA; QUERY; EXAMPLE; CLASSIFICATION; SEARCH; NOISE;
D O I
10.1007/s00778-023-00816-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There is an increasing gap between fast growth of data and the limited human ability to comprehend data. Consequently, there has been a growing demand of data management tools that can bridge this gap and help the user retrieve high-value content from data more effectively. In this work, we propose an interactive data exploration system as a new database service, using an approach called "explore-by-example." Our new system is designed to assist the user in performing highly effective data exploration while reducing the human effort in the process. We cast the explore-by-example problem in a principled "active learning" framework. However, traditional active learning suffers from two fundamental limitations: slow convergence and lack of robustness under label noise. To overcome the slow convergence and label noise problems, we bring the properties of important classes of database queries to bear on the design of new algorithms and optimizations for active learning-based database exploration. Evaluation results using real-world datasets and user interest patterns show that our new system, both in the noise-free case and in the label noise case, significantly outperforms state-of-the-art active learning techniques and data exploration systems in accuracy while achieving the desired efficiency for interactive data exploration.
引用
收藏
页码:931 / 956
页数:26
相关论文
共 50 条
  • [31] An Exploration of Machine Learning Methods for Robust Boredom Classification Using EEG and GSR Data
    Seo, Jungryul
    Laine, Teemu H.
    Sohn, Kyung-Ah
    SENSORS, 2019, 19 (20)
  • [32] Exploration from Demonstration for Interactive Reinforcement Learning
    Subramanian, Kaushik
    Isbell, Charles L., Jr.
    Thomaz, Andrea L.
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 447 - 456
  • [33] Interactive Metabolism, a simple and robust active learning tool that improves the biochemistry knowledge of undergraduate students
    Pereira Lopes Alves de Franca, Vitoria Costa
    Campos, Wellington Ferreira
    ADVANCES IN PHYSIOLOGY EDUCATION, 2021, 45 (02) : 353 - 364
  • [34] FSDA: A MATLAB toolbox for robust analysis and interactive data exploration
    Riani, Marco
    Perrotta, Domenico
    Torti, Francesca
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 116 : 17 - 32
  • [35] Active teaching for an interactive learning robot
    Katagami, D
    Yamada, S
    RO-MAN 2003: 12TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2003, : 181 - 186
  • [36] Towards Safe and Efficient Analog Circuit Design: Active Learning for Feasibility Region Exploration
    Ata, Sezin Kircali
    Kong, Zhi Hui
    James, Anusha
    Cai, Lile
    Yeo, Kiat Seng
    Aung, Khin Mi Mi
    Foo, Chuan Sheng
    James, Ashish
    2024 IEEE THE 20TH ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS 2024, 2024, : 193 - 197
  • [37] Accelerating high-entropy alloy discovery: efficient exploration via active learning
    Sulley, Gloria A.
    Raush, Jonathan
    Montemore, Matthew M.
    Hamm, Jihun
    SCRIPTA MATERIALIA, 2024, 249
  • [38] Corruption Robust Active Learning
    Chen, Yifang
    Du, Simon S.
    Jamieson, Kevin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [39] Active learning for interactive multimedia retrieval
    Huang, Thomas S.
    Dagli, Charlie K.
    Rajaram, Shyamsundar
    Chang, Edward Y.
    Mandel, Michael I.
    Poliner, Graham E.
    Ellis, Daniel P. W.
    PROCEEDINGS OF THE IEEE, 2008, 96 (04) : 648 - 667
  • [40] On Robust Grouping Active Learning
    Li, Changsheng
    Yang, Chen
    Liang, Lingyan
    Yuan, Ye
    Wang, Guoren
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (01): : 103 - 112