Multi-modal RGB–Depth–Thermal Human Body Segmentation

被引:0
|
作者
Cristina Palmero
Albert Clapés
Chris Bahnsen
Andreas Møgelmose
Thomas B. Moeslund
Sergio Escalera
机构
[1] UB,Dept. Matemàtica Aplicada i Anàlisi
[2] Computer Vision Center,undefined
[3] Aalborg University,undefined
来源
关键词
Human body segmentation; RGB; Depth; Thermal;
D O I
暂无
中图分类号
学科分类号
摘要
This work addresses the problem of human body segmentation from multi-modal visual cues as a first stage of automatic human behavior analysis. We propose a novel RGB–depth–thermal dataset along with a multi-modal segmentation baseline. The several modalities are registered using a calibration device and a registration algorithm. Our baseline extracts regions of interest using background subtraction, defines a partitioning of the foreground regions into cells, computes a set of image features on those cells using different state-of-the-art feature extractions, and models the distribution of the descriptors per cell using probabilistic models. A supervised learning algorithm then fuses the output likelihoods over cells in a stacked feature vector representation. The baseline, using Gaussian mixture models for the probabilistic modeling and Random Forest for the stacked learning, is superior to other state-of-the-art methods, obtaining an overlap above 75 % on the novel dataset when compared to the manually annotated ground-truth of human segmentations.
引用
收藏
页码:217 / 239
页数:22
相关论文
共 50 条
  • [41] Comprehensive Multi-Modal Interactions for Referring Image Segmentation
    Jain, Kanishk
    Gandhi, Vineet
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3427 - 3435
  • [42] Unpaired multi-modal tumor segmentation with structure adaptation
    Pei Zhou
    Houjin Chen
    Yanfeng Li
    Yahui Peng
    Applied Intelligence, 2023, 53 : 3639 - 3651
  • [43] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
    Kim, Kyungmin
    SENSORS, 2024, 24 (23)
  • [44] MULTI-MODAL SEMANTIC MESH SEGMENTATION IN URBAN SCENES
    Laupheimer, Dominik
    Haala, Norbert
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 267 - 274
  • [45] Evaluation of Multi-modal Image Segmentation for Radiotherapy Planning
    Malladi, Sreeja
    Purohit, Sanket
    Brahme, Advait
    Scott, Julia A.
    IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, MEDICAL IMAGING 2024, 2024, 12931
  • [46] Unpaired Multi-Modal Segmentation via Knowledge Distillation
    Dou, Qi
    Liu, Quande
    Heng, Pheng Ann
    Glocker, Ben
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2415 - 2425
  • [47] PSALM: Pixelwise SegmentAtion with Large Multi-modal Model
    Zhang, Zheng
    Ma, Yeyao
    Zhang, Enming
    Bai, Xiang
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 74 - 91
  • [48] Multi-Modal RGB-D Scene Recognition Across Domains
    Ferreri, Andrea
    Bucci, Silvia
    Tommasi, Tatiana
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2199 - 2208
  • [49] The origin of human multi-modal communication
    Levinson, Stephen C.
    Holler, Judith
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1651)
  • [50] Lightweight Multi-modal Representation Learning for RGB Salient Object Detection
    Xiao, Yun
    Huang, Yameng
    Li, Chenglong
    Liu, Lei
    Zhou, Aiwu
    Tang, Jin
    COGNITIVE COMPUTATION, 2023, 15 (06) : 1868 - 1883