CMFTNet: Multiple fish tracking based on counterpoised JointNet

被引:27
|
作者
Li, Weiran [1 ,2 ,3 ,4 ,5 ]
Li, Fei [1 ,2 ,3 ,4 ,5 ]
Li, Zhenbo [1 ,2 ,3 ,4 ,5 ,6 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr & Rural Affairs, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China
[3] Minist Agr & Rural Affairs, Key Lab Agr Informat Acquisit Technol, Beijing 100083, Peoples R China
[4] Beijing Engn & Technol Res Ctr Internet Things Agr, Beijing 100083, Peoples R China
[5] Minist Agr & Rural Affairs, Key Lab Smart Farming Aquat Anim & Livestock, Beijing 100083, Peoples R China
[6] China Agr Univ, POB 121,17 Tsinghua East Rd, Beijing 100083, Peoples R China
基金
国家重点研发计划;
关键词
Fish tracking; Multiple object tracking; Joint detection and embedding tracking; Computer vision; Deep learning;
D O I
10.1016/j.compag.2022.107018
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The analysis of fish motion is remarkably applied to investigate physiological behavior and water quality status. Multiple fish tracking methods based on computer vision have the advantages of contactless, information interpretability, single equipment, and high durability. However, the existed approaches cannot cope with complex scenarios, occlusions, and inconstant scales well. To solve the issues, we propose a multi-object video tracking model specifically for fish schools in aquaculture ponds, called CMFTNet. Firstly, we deploy the Joint Detection and Embedding paradigm to share the features for multiple fish detection and tracking tasks. It utilizes the anchor-free method to solve the problem of mutual occlusion of fish schools. Then, we embed the deformable convolution in the updated backbone to intensify the context features of fish in complex environments. Finally, we evaluate the influence of feature dimensions and propose a weight counterpoised loss that outperforms the previous aggregation methods on dual-branch. Extensive experiments show that CMFTNet achieves the best result both on precision and efficiency. The model reaches 65.5% MOTA and 27.4% IDF1 on the OptMFT dataset. The source codes and pre-trained models are available at: https://github.com/vranlee/CMFTNet.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] PLCFishMOT: multiple fish fry tracking utilizing particle filtering and attention mechanism
    Tan, Huachao
    Cheng, Yuan
    Liu, Dan
    Yuan, Guihong
    Jiang, Yanbo
    Gao, Hongyong
    Bi, Hai
    AQUACULTURE INTERNATIONAL, 2025, 33 (01)
  • [32] The Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and Counting
    Kay, Justin
    Kulits, Peter
    Stathatos, Suzanne
    Deng, Siqi
    Young, Erik
    Beery, Sara
    Van Horn, Grant
    Perona, Pietro
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 290 - 311
  • [33] A Survey of Multiple Pedestrian Tracking Based on Tracking-by-Detection Framework
    Sun, Zhihong
    Chen, Jun
    Chao, Liang
    Ruan, Weijian
    Mukherjee, Mithun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1819 - 1833
  • [34] Multiple-hypothesis tracking and graph-based tracking extensions
    Coraluppi, Stefano P.
    Carthel, Craig A.
    Willsky, Alan S.
    Journal of Advances in Information Fusion, 2019, 14 (02): : 152 - 166
  • [35] Human Tracking based on Multiple View Homography
    Seo, Dong-Wook
    Chae, Hyun-Uk
    Kim, Byeong-Woo
    Choi, Won-Ho
    Jo, Kang-Hyun
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2009, 15 (13) : 2463 - 2484
  • [36] Online Tracking Based on Multiple Appearances Model
    Tang, Shuo
    Zhang, Long-fei
    Yan, Jia-li
    Tan, Xiang-wei
    Ding, Gang-yi
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 634 - 637
  • [37] Multiple Moving Targets Tracking Based on the Video
    Li, Yucheng
    Wang, Fenyan
    2009 INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2009), VOLUMES 1 AND 2, 2009, : 434 - 437
  • [38] TRACKING MULTIPLE TARGETS BASED ON STEREO VISION
    Ganoun, Ali
    Veit, Thomas
    Aubert, Didier
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2009, : 470 - +
  • [39] PSO-Based Multiple People Tracking
    Chen Ching-Han
    Yan Miao-Chun
    DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND ITS APPLICATIONS, PT I, 2011, 166 : 267 - 276
  • [40] Compressive Visual Tracking Based on Multiple Features
    Wu, Xiang
    Zhao, Gao-peng
    Bo, Yu-ming
    2015 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS: TECHNIQUES AND APPLICATIONS (EETA 2015), 2015, : 249 - 253