Multi-modal RGB–Depth–Thermal Human Body Segmentation

被引：0

作者：

Cristina Palmero

Albert Clapés

Chris Bahnsen

Andreas Møgelmose

Thomas B. Moeslund

Sergio Escalera

机构：

[1] UB,Dept. Matemàtica Aplicada i Anàlisi

[2] Computer Vision Center,undefined

[3] Aalborg University,undefined

来源：

International Journal of Computer Vision | 2016年 / 118卷

关键词：

Human body segmentation; RGB; Depth; Thermal;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This work addresses the problem of human body segmentation from multi-modal visual cues as a first stage of automatic human behavior analysis. We propose a novel RGB–depth–thermal dataset along with a multi-modal segmentation baseline. The several modalities are registered using a calibration device and a registration algorithm. Our baseline extracts regions of interest using background subtraction, defines a partitioning of the foreground regions into cells, computes a set of image features on those cells using different state-of-the-art feature extractions, and models the distribution of the descriptors per cell using probabilistic models. A supervised learning algorithm then fuses the output likelihoods over cells in a stacked feature vector representation. The baseline, using Gaussian mixture models for the probabilistic modeling and Random Forest for the stacked learning, is superior to other state-of-the-art methods, obtaining an overlap above 75 % on the novel dataset when compared to the manually annotated ground-truth of human segmentations.

引用

页码：217 / 239

页数：22

共 50 条

[41] Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Jain, Kanishk
Gandhi, Vineet
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3427 - 3435
[42] Unpaired multi-modal tumor segmentation with structure adaptation
Pei Zhou
Houjin Chen
Yanfeng Li
Yahui Peng
Applied Intelligence, 2023, 53 : 3639 - 3651
[43] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
Kim, Kyungmin
SENSORS, 2024, 24 (23)
[44] MULTI-MODAL SEMANTIC MESH SEGMENTATION IN URBAN SCENES
Laupheimer, Dominik
Haala, Norbert
XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 267 - 274
[45] Evaluation of Multi-modal Image Segmentation for Radiotherapy Planning
Malladi, Sreeja
Purohit, Sanket
Brahme, Advait
Scott, Julia A.
IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, MEDICAL IMAGING 2024, 2024, 12931
[46] Unpaired Multi-Modal Segmentation via Knowledge Distillation
Dou, Qi
Liu, Quande
Heng, Pheng Ann
Glocker, Ben
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2415 - 2425
[47] PSALM: Pixelwise SegmentAtion with Large Multi-modal Model
Zhang, Zheng
Ma, Yeyao
Zhang, Enming
Bai, Xiang
COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 74 - 91
[48] Multi-Modal RGB-D Scene Recognition Across Domains
Ferreri, Andrea
Bucci, Silvia
Tommasi, Tatiana
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2199 - 2208
[49] The origin of human multi-modal communication
Levinson, Stephen C.
Holler, Judith
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1651)
[50] Lightweight Multi-modal Representation Learning for RGB Salient Object Detection
Xiao, Yun
Huang, Yameng
Li, Chenglong
Liu, Lei
Zhou, Aiwu
Tang, Jin
COGNITIVE COMPUTATION, 2023, 15 (06) : 1868 - 1883

← 1 2 3 4 5 →