Edge-Optimized Model for Multimedia Classification using Linguistic Metadata

被引:0
|
作者
Bharitkar, Sunil [1 ]
Paez, Thaddeus [2 ]
机构
[1] Samsung Res Amer, Digital Media Solut Grp Audio Lab, Mountain View, CA 94043 USA
[2] Samsung Elect, Samsung Res Tijuana, Mexico City, DF, Mexico
关键词
Metadata; text analysis; on-device classification; bag-of-words; latent semantic analysis; low-rank approximation; Transformers; RetNet; LSTM; Bayesian optimization;
D O I
10.1109/ICASSPW62465.2024.10626175
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Language models are relevant for text analysis. Transfer learning enables fine-tuning pre-trained large-language model (LLM) architectures for various classification and prediction tasks. However, these fine-tuned LLMs are computationally intensive, have large memory requirements, and have high inference latency, as shown in this paper, which can prevent the deployment of such models for real-time applications on edge devices. This paper presents results from a joint optimization between a low-rank factorization of a text embedding model and a recurrent long short-term memory (LSTM) model using linguistic metadata for a seventeen-class multimedia classification problem. A comparative study shows that our approach exceeds the performance of state-of-the-art large-language models in latency and number of parameters while performing approximately with the same accuracy as larger models, enabling real-time inference on an edge device. Consequently, the model performs real-time inference on a consumer TV for multimedia classification.
引用
收藏
页码:269 / 273
页数:5
相关论文
共 50 条
  • [11] The multimedia metadata interoperability using MOXMDR+
    Park H.-K.
    Moon S.-J.
    International Journal of Multimedia and Ubiquitous Engineering, 2016, 11 (09): : 255 - 264
  • [12] Annotation of multimedia data using semantic metadata
    An, Hyoung-Keun
    Koh, Jae-Jin
    IFOST 2006: 1ST INTERNATIONAL FORUM ON STRATEGIC TECHNOLOGY, PROCEEDINGS: E-VEHICLE TECHNOLOGY, 2006, : 269 - +
  • [13] iSEC: An Optimized Deep Learning Model for Image Classification on Edge Computing
    Kristiani, Endah
    Yang, Chao-Tung
    Huang, Chin-Yin
    IEEE ACCESS, 2020, 8 (08): : 27267 - 27276
  • [14] Edge-to-Edge Resource Discovery using Metadata Replication
    Murturi, Ilir
    Avasalcai, Cosmin
    Tsigkanos, Christos
    Dustdar, Schahram
    2019 IEEE 3RD INTERNATIONAL CONFERENCE ON FOG AND EDGE COMPUTING (ICFEC), 2019,
  • [15] Environmental microorganism classification using optimized deep learning model
    Liang, Chih-Ming
    Lai, Chun-Chi
    Wang, Szu-Hong
    Lin, Yu-Hao
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (24) : 31920 - 31932
  • [16] Environmental microorganism classification using optimized deep learning model
    Chih-Ming Liang
    Chun-Chi Lai
    Szu-Hong Wang
    Yu-Hao Lin
    Environmental Science and Pollution Research, 2021, 28 : 31920 - 31932
  • [17] CLASSIFICATION OF SENTIMENT USING OPTIMIZED HYBRID DEEP LEARNING MODEL
    Touate, Chaima Ahle
    EL Ayachi, Rachid
    Biniz, Mohamed
    COMPUTING AND INFORMATICS, 2023, 42 (03) : 651 - 666
  • [18] Fish Species Classification using Optimized Deep Learning Model
    Mol, J. M. Jini
    Jose, S. Albin
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 825 - 837
  • [19] Fish Species Classification using Optimized Deep Learning Model
    Mol, J. M. Jini
    Jose, S. Albin
    International Journal of Advanced Computer Science and Applications, 2022, 13 (09): : 825 - 837
  • [20] Hierarchical Model for Multimedia Content Classification
    Bharitkar, Sunil
    Lopes, Andre
    Flores, Walter
    2019 IEEE 9TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2019, : 132 - 137