HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge Understanding

被引:0
|
作者
Zheng, Hao [1 ]
Lee, Regina [1 ]
Lu, Yuqian [1 ]
机构
[1] Univ Auckland, Dept Mech & Mechatron Engn, Auckland, New Zealand
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding comprehensive assembly knowledge from videos is critical for futuristic ultra-intelligent industry. To enable technological breakthrough, we present HA-ViD - an assembly video dataset that features representative industrial assembly scenarios, natural procedural knowledge acquisition process, and consistent human-robot shared annotations. Specifically, HA-ViD captures diverse collaboration patterns of real-world assembly, natural human behaviors and learning progression during assembly, and granulate action annotations to subject, action verb, manipulated object, target object, and tool. We provide 3222 multi-view and multi-modality videos, 1.5M frames, 96K temporal labels and 2M spatial labels. We benchmark four foundational video understanding tasks: action recognition, action segmentation, object detection and multi-object tracking. Importantly, we analyze their performance and the further reasoning steps for comprehending knowledge in assembly progress, process efficiency, task collaboration, skill parameters and human intention. Details of HA-ViD is available at: https://iai-hrc.github.io/ha- vid.
引用
收藏
页数:13
相关论文
共 31 条
  • [1] A Comprehensive Dataset of Four Provincial Legislative Assembly Members
    Rivard, Alex B.
    Bodet, Marc Andre
    Godbout, Jean-Francois
    Montigny, Eric
    CANADIAN JOURNAL OF POLITICAL SCIENCE-REVUE CANADIENNE DE SCIENCE POLITIQUE, 2024, 57 (02): : 301 - 307
  • [2] ATTACH Dataset: Annotated Two-Handed Assembly Actions for Human Action Understanding
    Aganian, Dustin
    Stephan, Benedict
    Eisenbach, Markus
    Stretz, Corinna
    Gross, Horst-Michael
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11367 - 11373
  • [3] Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities
    Sener, Fadime
    Chatterjee, Dibyadip
    Shelepov, Daniel
    He, Kun
    Singhania, Dipika
    Wang, Robert
    Yao, Angela
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21064 - 21074
  • [4] EPIC-Tent: An Egocentric Video Dataset for Camping Tent Assembly
    Jang, Youngkyoon
    Sullivan, Brian
    Ludwig, Casimir
    Gilchrist, Iain D.
    Damen, Dima
    Mayol-Cuevas, Walterio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4461 - 4469
  • [5] A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset
    Deng, Jiaxin
    Shen, Dong
    Pan, Haojie
    Wu, Xiangyu
    Liu, Ximan
    Meng, Gaofeng
    Yang, Fan
    Gao, Tingting
    Fu, Ruiji
    Wang, Zhongyuan
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 95 - 104
  • [6] The HA4M dataset: Multi-Modal Monitoring of an assembly task for Human Action recognition in Manufacturing
    Cicirelli, Grazia
    Marani, Roberto
    Romeo, Laura
    Dominguez, Manuel Garcia
    Heras, Jonathan
    Perri, Anna G. G.
    D'Orazio, Tiziana
    SCIENTIFIC DATA, 2022, 9 (01)
  • [7] The HA4M dataset: Multi-Modal Monitoring of an assembly task for Human Action recognition in Manufacturing
    Grazia Cicirelli
    Roberto Marani
    Laura Romeo
    Manuel García Domínguez
    Jónathan Heras
    Anna G. Perri
    Tiziana D’Orazio
    Scientific Data, 9
  • [8] THE BRIO-TA DATASET: UNDERSTANDING ANOMALOUS ASSEMBLY PROCESS IN MANUFACTURING
    Moriwaki, Kosuke
    Nakano, Gaku
    Inoshita, Tetsuo
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1991 - 1995
  • [9] EduNet: A New Video Dataset for Understanding Human Activity in the Classroom Environment
    Sharma, Vijeta
    Gupta, Manjari
    Kumar, Ajai
    Mishra, Deepti
    SENSORS, 2021, 21 (17)
  • [10] Simulating Assembly Landscapes for Comprehensive Understanding of Supramolecular Polymer-Solvent Systems
    Jansen, Stef A. H.
    Weyandt, Elisabeth
    Aoki, Tsubasa
    Akiyama, Takayoshi
    Itoh, Yoshimitsu
    Vantomme, Ghislaine
    Aida, Takuzo
    Meijer, E. W.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2023, 145 (07) : 4231 - 4237