MCTracker: Satellite video multi-object tracking considering inter-frame motion correlation and multi-scale cascaded feature enhancement

被引:1
|
作者
Wang, Bin [1 ]
Sui, Haigang [1 ]
Ma, Guorui [1 ]
Zhou, Yuan [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
关键词
Satellite video; Inter -frame motion correlation; Multi -scale feature; Spatio-temporal fusion; Multi -object tracking;
D O I
10.1016/j.isprsjprs.2024.06.006
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
The video data captured by satellites, colloquially referred to as "gazing," holds significant value for analyzing object status and dynamically tracking movements. Such data and their associated applications play pivotal roles in disaster response, urban traffic monitoring, national defense, and security operations. The multi-object tracking task in satellite video surveillance is inherently more complex than single-object tracking, presenting challenges such as tracking dense small objects and limited capability in multi-object tracking in ambiguous environments. This study proposes a satellite video multi-object tracking (MCTracker) method that considers inter-frame motion correlation and multi-scale cascaded feature enhancement, achieving outstanding performance in multi-object tracking tasks. This study builds upon a differentially encoded backbone network incorporating both global and local information, proposing an Inter-frame Motion Correlation Module (IFMCM) to enhance inter-frame dynamic continuity. A Mixed-scale Enhancement Block (MSEB) is designed to address the challenge of detecting and tracking small objects under multiscale effects. Additionally, a Spatio-temporal Fusion Module (STFM) is introduced to improve the feature fusion representation capability among modules. Through experiments conducted on two open-source datasets, AIR-MOT and SAT-MTB, it has been demonstrated that the proposed MCTracker exhibits efficient synergistic consistency in detection and ReID tasks. On the AIR-MOT dataset, the object tracking accuracy, as measured by MOTA, reaches 59.4 %, achieving optimal performance. In the SAT-MTB dataset's four-class multiple object tracking task, particularly in tracking small-scale objects such as cars and ships, the proposed method demonstrates comprehensive optimal performance, yielding robust tracking results. The satellite video-based multiple object tracking approach proposed in this study holds significant reference value for further exploration of object motion states and trajectory tracking.
引用
收藏
页码:82 / 103
页数:22
相关论文
共 50 条
  • [21] A Multi-Scale Position Feature Transform Network for Video Frame Interpolation
    Cheng, Xianhang
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3968 - 3981
  • [22] Video Object Segmentation Algorithm Based on Multi-scale Feature Enhancement and Global-Local Feature Aggregation
    Hou, Zhiqiang
    Dong, Jiale
    Ma, Sugang
    Wang, Chenxu
    Yang, Xiaobao
    Wang, Yunchen
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (11): : 4198 - 4207
  • [23] SCGTracker: object feature embedding enhancement based on graph attention networks for multi-object tracking
    Feng, Xin
    Jiao, Xiaoning
    Wang, Siping
    Zhang, Zhixian
    Liu, Yan
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5513 - 5527
  • [24] Spatio-Temporal Correlation Graph for Association Enhancement in Multi-object Tracking
    Zhong, Zhijie
    Sheng, Hao
    Zhang, Yang
    Wu, Yubin
    Chen, Jiahui
    Ke, Wei
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 394 - 405
  • [25] Intra-Frame Graph Structure and Inter-Frame Bipartite Graph Matching with ReID-Based Occlusion Resilience for Point Cloud Multi-Object Tracking
    Sun, Shaoyu
    Shi, Chunhao
    Wang, Chunyang
    Zhou, Qing
    Sun, Rongliang
    Xiao, Bo
    Ding, Yueyang
    Xi, Guan
    ELECTRONICS, 2024, 13 (15)
  • [26] LocaLock: Enhancing Multi-Object Tracking in Satellite Videos via Local Feature Matching
    Kong, Lingyu
    Yan, Zhiyuan
    Shi, Hanru
    Zhang, Ting
    Wang, Lei
    REMOTE SENSING, 2025, 17 (03)
  • [27] Multi-scale Deep Feature Transfer for Automatic Video Object Segmentation
    Zhen Yang
    Qingxuan Shi
    Yichuan Fang
    Neural Processing Letters, 2023, 55 : 11701 - 11719
  • [28] A Multi-Scale Learnable Feature Alignment Network for Video Object Detection
    Wang, Rui
    2024 IEEE 21ST INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SMART SYSTEMS, MASS 2024, 2024, : 496 - 501
  • [29] Multi-scale Deep Feature Transfer for Automatic Video Object Segmentation
    Yang, Zhen
    Shi, Qingxuan
    Fang, Yichuan
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11701 - 11719
  • [30] Multi-Object Tracking in Video Sequences Based on Background Subtraction and SIFT Feature Matching
    Rahman, Md. Saidur
    Saha, Aparna
    Khanum, Snigdha
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 457 - 462