An Intelligent Industrial Visual Monitoring and Maintenance Framework Empowered by Large-Scale Visual and Language Models

被引:2
|
作者
Wang, Huan [1 ]
Li, Chenxi [2 ]
Li, Yan-Fu [1 ]
Tsung, Fugee [3 ,4 ]
机构
[1] Tsinghua University, Department of Industrial Engineering, Beijing,100084, China
[2] University of Electronic Science and Technology of China, Glasgow College, Chengdu,611731, China
[3] Department of Industrial Engineering and Decision Analytics, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
[4] Data Science and Analytics Thrust, Hong Kong University of Science and Technology (Guangzhou), Nansha, Guangzhou,511400, China
关键词
Computational linguistics - Embedded systems - Job analysis - Maintenance - Visual languages;
D O I
10.1109/TICPS.2024.3414292
中图分类号
学科分类号
摘要
引用
收藏
页码:166 / 175
相关论文
共 50 条
  • [41] Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications
    Zhang, Shiliang
    Tian, Qi
    Hua, Gang
    Huang, Qingming
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (09) : 2664 - 2677
  • [42] Propagating waves in visual cortex: A large-scale model of turtle visual cortex
    Nenadic, Z
    Ghosh, BK
    Ulinski, P
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2003, 14 (02) : 161 - 184
  • [43] Visual Informatics Tools for Supporting Large-Scale Collaborative Wildlife Monitoring with Citizen Scientists
    He, Zhihai
    Kays, Roland
    Zhang, Zhi
    Ning, Guanghan
    Huang, Chen
    Han, Tony X.
    Millspaugh, Josh
    Forrester, Tavis
    McShea, William
    IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2016, 16 (01) : 73 - 86
  • [44] Intelligent Maintenance of Moving Joints on Large-scale Machine Tool
    Wang, Mulan
    Chen, Xuanyu
    Ding, Wenzheng
    Zhu, Hao
    Zhu, Lei
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON MECHATRONICS, COMPUTER AND EDUCATION INFORMATIONIZATION (MCEI 2016), 2016, 130 : 1357 - 1361
  • [45] A Design of Interface for Visual-Impaired People to Access Visual Information from Images Featuring Large Language Models and Visual Language Models
    Zhang, Zhe-Xin
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [46] Large-scale Text-to-Image Generation Models for Visual Artists' Creative Works
    Ko, Hyung-Kwon
    Park, Gwanmo
    Jeon, Hyeon
    Jo, Jaemin
    Kim, Juho
    Seo, Jinwook
    PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023, 2023, : 919 - 933
  • [47] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
    Torii, Akihiko
    Taira, Hajime
    Sivic, Josef
    Pollefeys, Marc
    Okutomi, Masatoshi
    Pajdla, Tomas
    Sattler, Torsten
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 814 - 829
  • [48] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
    Sattler, Torsten
    Torii, Akihiko
    Sivic, Josef
    Pollefeys, Marc
    Taira, Hajime
    Okutomi, Masatoshi
    Pajdla, Tomas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6175 - 6184
  • [49] Can surgical computer vision benefit from large-scale visual foundation models?
    Rabbani, Navid
    Bartoli, Adrien
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2024, 19 (06) : 1157 - 1163
  • [50] CUBE: A scalable framework for large-scale industrial simulations
    Jansson, Niclas
    Bale, Rahul
    Onishi, Keiji
    Tsubokura, Makoto
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (04): : 678 - 698