A benchmark dataset in chemical apparatus: recognition and detection

被引:1
|
作者
Zou, Le [1 ]
Ding, Ze-Sheng [1 ]
Ran, Shuo-Yi [2 ]
Wu, Zhi-Ze [2 ]
Wei, Yun-Sheng [3 ]
He, Zhi-Huang [1 ]
Wang, Xiao-Feng [1 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Hefei 230601, Peoples R China
[2] Hefei Univ, Inst Appl Optimizat, Sch Artificial Intelligence & Big Data, Hefei 230601, Anhui, Peoples R China
[3] Hefei Univ, Sch Energy Mat & Chem Engn, Hefei 230601, Anhui, Peoples R China
关键词
Deep learning; Chemical apparatus; Object detection; Image recognition; Benchmark dataset; CONVOLUTIONAL NEURAL-NETWORK; ARTIFICIAL-INTELLIGENCE; IMAGE RECOGNITION;
D O I
10.1007/s11042-023-16563-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robots that perform chemical experiments autonomously have been implemented, using the same chemical apparatus as human chemists and capable of performing complex chemical experiments unmanaged. However, most robots in chemistry are still programmed and cannot adapt to diverse environments or to changes in displacement and angle of the object. To resolve this issue, we have conceived a computer vision method for identifying and detecting chemical apparatus automatically. Identifying and localizing such apparatus accurately from chemistry lab images is the most important task. We acquired 2246 images from real chemistry laboratories, with a total of 33,108 apparatus instances containing 21 classes. We demonstrate a Chemical Apparatus Benchmark Dataset (CABD) containing a chemical apparatus image recognition dataset and a chemical apparatus object detection dataset. We evaluated five excellent image recognition models: AlexNet, VGG16, GoogLeNet, ResNet50, MobileNetV2 and four state-of-the-art object detection methods: Faster R-CNN (3 backbones), Single Shot MultiBox Detector (SSD), YOLOv3-SPP and YOLOv5, respectively, on the CABD dataset. The results can serve as a baseline for future research. Experiments show that ResNet50 has the highest accuracy (99.9%) in the chemical apparatus image recognition dataset; Faster R-CNN (ResNet50-fpn) and YOLOv5 performed the best in terms of mAP (99.0%) and AR (94.5%) in the chemical apparatus object detection dataset.
引用
收藏
页码:26419 / 26437
页数:19
相关论文
共 50 条
  • [1] A benchmark dataset in chemical apparatus: recognition and detection
    Le Zou
    Ze-Sheng Ding
    Shuo-Yi Ran
    Zhi-Ze Wu
    Yun-Sheng Wei
    Zhi-Huang He
    Xiao-Feng Wang
    Multimedia Tools and Applications, 2024, 83 : 26419 - 26437
  • [2] Video Text Detection and Recognition: Dataset and Benchmark
    Phuc Xuan Nguyen
    Wang, Kai
    Belongie, Serge
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 776 - 783
  • [3] A comprehensive maritime benchmark dataset for detection, tracking and threat recognition
    Patino, Luis
    Cane, Tom
    Ferryman, James
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [4] Cartoon Face Recognition: A Benchmark Dataset
    Zheng, Yi
    Zhao, Yifan
    Ren, Mengyuan
    Yan, He
    Lu, Xiangju
    Liu, Junhui
    Li, Jia
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2264 - 2272
  • [5] FaceSurv: A Benchmark Video Dataset for Face Detection and Recognition Across Spectra and Resolutions
    Gupta, Sanchit
    Gupta, Nikita
    Ghosh, Soumyadeep
    Singh, Maneet
    Nagpal, Shruti
    Vatsa, Mayank
    Singh, Richa
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 138 - 144
  • [6] Towards lifelong object recognition: A dataset and benchmark
    Lan, Chuanlin
    Feng, Fan
    Liu, Qi
    She, Qi
    Yang, Qihan
    Hao, Xinyue
    Mashkin, Ivan
    Kei, Ka Shun
    Qiang, Dong
    Lomonaco, Vincenzo
    Shi, Xuesong
    Wang, Zhengwei
    Guo, Yao
    Zhang, Yimin
    Qiao, Fei
    Chan, Rosa H. M.
    PATTERN RECOGNITION, 2022, 130
  • [7] Benchmark Dataset for Offline Handwritten Character Recognition
    Yousaf, Adeel
    Khan, M. Jaleed
    Imran, M.
    Khurshid, Khurram
    2017 13TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET 2017), 2017,
  • [8] Towards lifelong object recognition: A dataset and benchmark
    Lan, Chuanlin
    Feng, Fan
    Liu, Qi
    She, Qi
    Yang, Qihan
    Hao, Xinyue
    Mashkin, Ivan
    Kei, Ka Shun
    Qiang, Dong
    Lomonaco, Vincenzo
    Shi, Xuesong
    Wang, Zhengwei
    Guo, Yao
    Zhang, Yimin
    Qiao, Fei
    Chan, Rosa H. M.
    PATTERN RECOGNITION, 2022, 130
  • [9] Structural Edge Detection: A Dataset and Benchmark
    Sun, Weixuan
    You, Shaodi
    Walker, Janine
    Li, Kunming
    Barnes, Nick
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 289 - 296
  • [10] Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering
    Maryam, Hiba
    Fu, Ling
    Song, Jiajun
    Shafayet, Tajrian A. B. M.
    Luo, Qidi
    Bai, Xiang
    Liu, Yuliang
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 279 - 292