Multi-modal sorting in plastic and wood waste streams

被引:11
|
作者
Konstantinidis, Fotios K. [1 ,2 ]
Sifnaios, Savvas [2 ]
Arvanitakis, George [2 ]
Tsimiklis, Georgios [2 ]
Mouroutsos, Spyridon G. [3 ]
Amditis, Angelos [2 ]
Gasteratos, Antonios [1 ]
机构
[1] Democritus Univ Thrace, Sch Engn, Dept Prod & Management Engn, Lab Robot & Automat, GR-67132 Xanthi, Greece
[2] Natl Tech Univ Athens, Inst Commun & Comp Syst, 9 Iroon Polytech Str, GR-15773 Athens, Greece
[3] Democritus Univ Thrace, Dept Elect & Comp Engn, Lab Mechatron & Syst Automat, GR-67100 Kimmeria, Greece
关键词
Machine vision; Multi-spectral; Multi-modal learning; Waste management; Waste sorting; Plastic-wood wastes;
D O I
10.1016/j.resconrec.2023.107244
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Addressing the escalating waste crisis necessitates innovative waste management strategies, particularly valorisation techniques, the efficiency of which is dictated by the purity of the feedstock. In order to mitigate the segregation challenges encountered in complex and non-homogeneous waste streams, this work proposes a vision-based architecture ample for effective sorting of parts based on shape and material-related properties. The proposed work encapsulates a novel deep learning multi-modal approach, in which multiple parallel auto -encoders are used to extract spatio-spectral information from the RGB and multi-spectral sensors and project them in a common latent space. By decoding the latent space representations, the class of each object is picked out, thus guiding the robotic sub-system accordingly. To support the proposed deep architecture, a dataset, called Multispectral Mixed Waste Dataset (MMWD) was produced, containing multi-spectral data from the visible (16 bands), near-infrared (25 bands) regions of the electromagnetic spectrum and RGB (3 bands) data. The dataset includes the following seven plastic and wood wastes: Polypropylene (PP), PolyEthylene Terephthalate (PET), Low-Density PolyEthylene (LDPE), High-Density PolyEthylene (HDPE), Medium Density Fibreboard (MDF), Melamine Faced Chipboards (MFC), and Oak Veneer samples. For the localisation of waste along the conveyor belt, YOLO v8 is used to achieve 99.5% mean average precision (mAP50). In the classification task, where the multi-modal approach was followed, the overall accuracy achieved is 96% with the prediction recall being greater than 95% for the majority of classes under examination.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Cross-modal generative models for multi-modal plastic sorting
    Neo, Edward R. K.
    Low, Jonathan S. C.
    Goodship, Vannessa
    Coles, Stuart R.
    Debattista, Kurt
    JOURNAL OF CLEANER PRODUCTION, 2023, 415
  • [2] Multi-modal event streams for virtual reality
    von Spiczak, J.
    Samset, E.
    DiMaio, S.
    Reitmayr, G.
    Schmalstieg, D.
    Burghart, C.
    Kikinis, R.
    MULTIMEDIA COMPUTING AND NETWORKING 2007, 2007, 6504
  • [3] Visual Sorting Method Based on Multi-Modal Information Fusion
    Han, Song
    Liu, Xiaoping
    Wang, Gang
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [4] StreamSketch: Exploring Multi-Modal Interactions in Creative Live Streams
    Lu Z.
    Kazi R.H.
    Wei L.-Y.
    Dontcheva M.
    Karahalios K.
    Proceedings of the ACM on Human-Computer Interaction, 2021, 5 (CSCW1)
  • [5] Real-Time Automatic Identification of Plastic Waste Streams for Advanced Waste Sorting Systems
    Giel, Robert
    Fieden, Mateusz
    Dabrowska, Alicja
    SUSTAINABILITY, 2025, 17 (05)
  • [6] SORTING PLASTIC WASTE
    PATSIGA, RA
    CHEMICAL & ENGINEERING NEWS, 1990, 68 (42) : 5 - 5
  • [7] Multi-modal anchor adaptation learning for multi-modal summarization
    Chen, Zhongfeng
    Lu, Zhenyu
    Rong, Huan
    Zhao, Chuanjun
    Xu, Fan
    NEUROCOMPUTING, 2024, 570
  • [8] Flexible Dual Multi-Modal Hashing for Incomplete Multi-Modal Retrieval
    Wei, Yuhong
    An, Junfeng
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024,
  • [9] Multi-Modal 2020: Multi-Modal Argumentation 30 Years Later
    Gilbert, Michael A.
    INFORMAL LOGIC, 2022, 42 (03): : 487 - 506
  • [10] Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams
    Xie, LX
    Kennedy, L
    Chang, SF
    Divakaran, A
    Sun, HF
    Lin, CY
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1053 - 1056