Edge-Optimized Model for Multimedia Classification using Linguistic Metadata

被引:0
|
作者
Bharitkar, Sunil [1 ]
Paez, Thaddeus [2 ]
机构
[1] Samsung Res Amer, Digital Media Solut Grp Audio Lab, Mountain View, CA 94043 USA
[2] Samsung Elect, Samsung Res Tijuana, Mexico City, DF, Mexico
关键词
Metadata; text analysis; on-device classification; bag-of-words; latent semantic analysis; low-rank approximation; Transformers; RetNet; LSTM; Bayesian optimization;
D O I
10.1109/ICASSPW62465.2024.10626175
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Language models are relevant for text analysis. Transfer learning enables fine-tuning pre-trained large-language model (LLM) architectures for various classification and prediction tasks. However, these fine-tuned LLMs are computationally intensive, have large memory requirements, and have high inference latency, as shown in this paper, which can prevent the deployment of such models for real-time applications on edge devices. This paper presents results from a joint optimization between a low-rank factorization of a text embedding model and a recurrent long short-term memory (LSTM) model using linguistic metadata for a seventeen-class multimedia classification problem. A comparative study shows that our approach exceeds the performance of state-of-the-art large-language models in latency and number of parameters while performing approximately with the same accuracy as larger models, enabling real-time inference on an edge device. Consequently, the model performs real-time inference on a consumer TV for multimedia classification.
引用
收藏
页码:269 / 273
页数:5
相关论文
共 50 条
  • [31] Optimized classification model for plant diseases using generative adversarial networks
    Shweta Lamba
    Preeti Saini
    Jagpreet Kaur
    Vinay Kukreja
    Innovations in Systems and Software Engineering, 2023, 19 : 103 - 115
  • [32] ThanosNet: A Novel Trash Classification Method Using Metadata
    Sun, Alan
    Xiao, Harry
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1394 - 1401
  • [33] Cognitive Linguistic Corpus Classification and Terminology Database Design Based on Multimedia Technology
    Cuil, Weihui
    Cao, Yanwen
    Ail, Hua
    Shil, Juntao
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 91 - 105
  • [34] Improving Scientific Data Extraction using Metadata Classification
    Chang, Yue Shan
    Lai, Hsuan-Jen
    Cheng, Hsiang-Tai
    2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 669 - +
  • [35] Improving plankton image classification using context metadata
    Ellen, Jeffrey S.
    Graff, Casey A.
    Ohman, Mark D.
    LIMNOLOGY AND OCEANOGRAPHY-METHODS, 2019, 17 (08): : 439 - 461
  • [36] A fault detection model for edge computing security using imbalanced classification
    Liang, Peifeng
    Liu, Gang
    Xiong, Zenggang
    Fan, Honghui
    Zhu, Hongjin
    Zhang, Xuemin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 133
  • [37] Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
    Golub, Koraljka
    Hagelback, Johan
    Ardo, Anders
    JOURNAL OF DATA AND INFORMATION SCIENCE, 2020, 5 (01) : 18 - 38
  • [38] Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
    Koraljka Golub
    Johan Hagelbck
    Anders Ard
    JournalofDataandInformationScience, 2020, 5 (01) : 18 - 38
  • [39] Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
    Koraljka Golub
    Johan Hagelb?ck
    Anders Ard?
    Journal of Data and Information Science, 2020, (01) : 18 - 38
  • [40] Linguistic Model for Classification Measurements of the Distributions of Signals
    A. A. Gorshenkov
    Yu. N. Klikushin
    V. Yu. Kobenko
    Measurement Techniques, 2013, 56 : 31 - 36