QuMinS: Fast and scalable querying, mining and summarizing multi-modal databases

被引:0
|
作者
Cordeiro, Robson L. F. [1 ]
Guo, Fan [2 ]
Haverkamp, Donna S. [3 ]
Horne, James H. [3 ]
Hughes, Ellen K. [3 ]
Kim, Gunhee [2 ]
Romani, Luciana A. S. [4 ]
Coltri, Priscila P. [5 ]
Souza, Tamires T. [1 ]
Traina, Agma J. M. [1 ]
Traina, Caetano, Jr. [1 ]
Faloutsos, Christos [2 ]
机构
[1] Univ Sao Paulo, BR-13560970 Sao Carlos, SP, Brazil
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Sci Applicat Int Corp, Mclean, VA 22102 USA
[4] Embrapa Agr Informat, BR-13083886 Campinas, SP, Brazil
[5] Univ Estadual Campinas, BR-13083970 Campinas, SP, Brazil
基金
美国国家科学基金会; 巴西圣保罗研究基金会;
关键词
Low-labor labeling; Summarization; Outlier detection; Query by example; Clustering; Satellite imagery; IMAGE ANNOTATION; RANDOM-WALK; CLASSIFICATION; RECOGNITION; OBJECT; GRAPH;
D O I
10.1016/j.ins.2013.11.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a large image set, in which very few images have labels, how to guess labels for the remaining majority? How to spot images that need brand new labels different from the predefined ones? How to summarize these data to route the user's attention to what really matters? Here we answer all these questions. Specifically, we propose QuMinS, a fast, scalable solution to two problems: (i) Low-labor labeling (LLL) - given an image set, very few images have labels, find the most appropriate labels for the rest; and (ii) Mining and attention routing - in the same setting, find clusters, the top-N-O outlier images, and the N-R images that best represent the data. Experiments on satellite images spanning up to 2.25 GB show that, contrasting to the state-of-the-art labeling techniques, QuMinS scales linearly on the data size, being up to 40 times faster than top competitors (GCap), still achieving better or equal accuracy, it spots images that potentially require unpredicted labels, and it works even with tiny initial label sets, i.e., nearly five examples. We also report a case study of our method's practical usage to show that QuMinS is a viable tool for automatic coffee crop detection from remote sensing images. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:211 / 229
页数:19
相关论文
共 50 条
  • [21] A multi-modal heterogeneous data mining algorithm using federated learning
    Wei, Xianyong
    JOURNAL OF ENGINEERING-JOE, 2021, 2021 (08): : 458 - 466
  • [22] Mining heterogeneous clinical notes by multi-modal latent topic model
    Wen, Zhi
    Nair, Pratheeksha
    Deng, Chih-Ying
    Lu, Xing Han
    Moseley, Edward
    George, Naomi
    Lindvall, Charlotta
    Li, Yue
    PLOS ONE, 2021, 16 (04):
  • [23] Cascades: Scalable, flexible and composable middleware for multi-modal sensor networking applications
    Huang, J
    Feng, WC
    Bulusu, N
    Feng, WC
    MULTIMEDIA COMPUTING AND NETWORKING 2006, 2006, 6071
  • [24] Visual mining of multi-modal social networks at different abstraction levels
    Singh, Lisa
    Beard, Mitchell
    Getoor, Lise
    11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 672 - +
  • [25] A multi-modal heterogeneous data mining algorithm using federated learning
    Wei, Xianyong
    Journal of Engineering, 2021, 2021 (08): : 458 - 466
  • [26] An Embedded, Multi-Modal Sensor System for Scalable Robotic and Prosthetic Hand Fingers
    Weiner, Pascal
    Neef, Caterina
    Shibata, Yoshihisa
    Nakamura, Yoshihiko
    Asfour, Tamim
    SENSORS, 2020, 20 (01)
  • [27] Symbolization and Data Mining of Multi-modal Signals using Bag of Systems
    Sannomiya, Chihiro
    Tanaka, Yusuke
    Kamakura, Hironori
    Kurihara, Keisuke
    Neyama, Ryo
    Nawa, Kazunari
    2016 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2016, : 233 - 238
  • [28] Popularity Prediction of Social Media based on Multi-Modal Feature Mining
    Hsu, Chih-Chung
    Kang, Li-Wei
    Lee, Chia-Yen
    Lee, Jun-Yi
    Zhang, Zhong-Xuan
    Wu, Shao-Min
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2687 - 2691
  • [29] Robust scalable initialization for Bayesian variational inference with multi-modal Laplace approximations
    Bridgman, Wyatt
    Jones, Reese E.
    Khalil, Mohammad
    PROBABILISTIC ENGINEERING MECHANICS, 2023, 74
  • [30] Heterogeneous Translated Hashing: A Scalable Solution Towards Multi-Modal Similarity Search
    Wei, Ying
    Song, Yangqiu
    Zhen, Yi
    Liu, Bo
    Yang, Qiang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 10 (04)