Comparison of Visual Datasets for Machine Learning

被引:23
|
作者
Gauen, Kent [1 ]
Dailey, Ryan [1 ]
Laiman, John [1 ]
Zi, Yuxiang [1 ]
Asokan, Nirmal [1 ]
Lu, Yung-Hsiang [1 ]
Thiruvathukal, George K. [2 ]
Shyu, Mei-Ling [3 ]
Chen, Shu-Ching [4 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Loyola Univ, Dept Comp Sci, Chicago, IL 60611 USA
[3] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33124 USA
[4] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
基金
美国国家科学基金会;
关键词
OBJECT;
D O I
10.1109/IRI.2017.59
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the greatest technological improvements in recent years is the rapid progress using machine learning for processing visual data. Among all factors that contribute to this development, datasets with labels play crucial roles. Several datasets are widely reused for investigating and analyzing different solutions in machine learning. Many systems, such as autonomous vehicles, rely on components using machine learning for recognizing objects. This paper compares different visual datasets and frameworks for machine learning. The comparison is both qualitative and quantitative and investigates object detection labels with respect to size, location, and contextual information. This paper also presents a new approach creating datasets using real-time, geo-tagged visual data, greatly improving the contextual information of the data. The data could be automatically labeled by cross-referencing information from other sources (such as weather).
引用
收藏
页码:346 / 355
页数:10
相关论文
共 50 条
  • [41] Robust machine learning applied to terascale astronomical datasets
    Ball, Nicholas M.
    Brunner, Robert J.
    Myers, Adam D.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVII, 2008, 394 : 201 - +
  • [42] Open Graph Benchmark: Datasets for Machine Learning on Graphs
    Hu, Weihua
    Fey, Matthias
    Zitnik, Marinka
    Dong, Yuxiao
    Ren, Hongyu
    Liu, Bowen
    Catasta, Michele
    Leskovec, Jure
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [43] Anomaly Detection in ICS Datasets with Machine Learning Algorithms
    Mubarak, Sinil
    Habaebi, Mohamed Hadi
    Islam, Md Rafiqul
    Rahman, Farah Diyana Abdul
    Tahir, Mohammad
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 37 (01): : 33 - 46
  • [44] Contemplation of Machine Learning Algorithm under Distinct Datasets
    Shah, Kushagra
    Chaturvedi, Pradhyumn
    Jain, Akagra
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [45] OpenCL based machine learning labeling of biomedical datasets
    Amoros, Oscar
    Escalera, Sergio
    Puig, Anna
    MEDICAL IMAGING 2011: VISUALIZATION, IMAGE-GUIDED PROCEDURES, AND MODELING, 2011, 7964
  • [46] A survey on datasets for fairness-aware machine learning
    Tai Le Quy
    Roy, Arjun
    Iosifidis, Vasileios
    Zhang, Wenbin
    Ntoutsi, Eirini
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (03)
  • [47] Surgical Tool Datasets for Machine Learning Research: A Survey
    Mark Rodrigues
    Michael Mayo
    Panos Patros
    International Journal of Computer Vision, 2022, 130 : 2222 - 2248
  • [48] Machine learning to analyze single-case graphs: A comparison to visual inspection
    Lanovaz, Marc J.
    Hranchuk, Kieva
    JOURNAL OF APPLIED BEHAVIOR ANALYSIS, 2021, 54 (04) : 1541 - 1552
  • [49] Machine learning for discovering missing or wrong protein function annotations A comparison using updated benchmark datasets
    Nakano, Felipe Kenji
    Lietaert, Mathias
    Vens, Celine
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [50] Classification of Vocal Cord Disorders: Comparison Across Voice Datasets, Speech Tasks, and Machine Learning Methods
    Chen, Ching-Chieh
    Hsu, Wei-Cheng
    Lin, Tzu-Han
    Chen, Kuan-Dar
    Tsou, Yung-An
    Liu, Yi-Wen
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1868 - 1873