Machine Learning Applied to Media Libraries for Insights, Search, and Segmentation

被引:0
|
作者
Gonsalves R.
Montajabi Z.
Mathur S.
Bouguila N.
机构
来源
SMPTE Motion Imaging Journal | 2023年 / 132卷 / 03期
关键词
Artificial intelligence (AI); CLIP4Clip; machine learning (ML); media search; Microsoft CLAP; multilingual CLIP; OpenAI contrastive language-image pretraining (CLIP); semantic search; Wav2CLIP; wav2vec2;
D O I
10.5594/JMI.2023.3245685
中图分类号
学科分类号
摘要
Recent advances in machine learning (ML) have produced a new form of semantic indexing that lets users enhance searches and gain new insights into their media libraries. Unlike typical search systems that use extracted metadata, semantic indexing allows users to find relevant material without the need to tag the media with selections from a predefined taxonomy. With semantic search, users can simply enter unstructured text, and the system will find the best matching media clips. This article extends the use of this technology to gather analytics on the data, which can then be further correlated to generate various insights. This new form of media indexing can be performed with the contrastive language-image pretraining (CLIP) model from OpenAI for images and similar models for video and audio. These models encode media into embeddings that can be searched to find the closest semantic similarity, enhanced with learned cultural knowledge. These systems have the benefit of finding media based on keywords, synonyms, and summaries. The systems can also be used for analytics and insights, such as segmentation, shot detection, and creating a 2D map to display correlations. This article ends with a discussion of the next steps, including using knowledge graphs for semantic search. © 2002 Society of Motion Picture and Television Engineers, Inc.
引用
收藏
页码:27 / 38
页数:11
相关论文
共 50 条
  • [21] Automotive market segmentation by machine learning
    Polpinij, J
    Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 404 - 408
  • [22] Various Frameworks and Libraries of Machine Learning and Deep Learning: A Survey
    Wang, Zhaobin
    Liu, Ke
    Li, Jian
    Zhu, Ying
    Zhang, Yaonan
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (01) : 1 - 24
  • [23] Machine Learning for Streaming Media
    Lamkhede, Sudarshan
    Chandar, Praveen
    Radosavljevic, Vladan
    Goyal, Amit
    Luo, Lan
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 759 - 759
  • [24] Deep Learning Techniques Applied for Road Segmentation
    Munteanu, Alexandru
    Selea, Teodora
    Neagul, Marian
    2019 21ST INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2019), 2020, : 297 - 303
  • [25] Insights into Vocational Learning from an Applied Learning Perspective
    Pridham, Bruce
    O'Mallon, Simon
    Prain, Vaughan
    VOCATIONS AND LEARNING, 2012, 5 (02) : 77 - 97
  • [26] Insights into Vocational Learning from an Applied Learning Perspective
    Bruce Pridham
    Simon O’Mallon
    Vaughan Prain
    Vocations and Learning, 2012, 5 : 77 - 97
  • [27] Learning in digital libraries: An information search process approach
    Kuhlthau, CC
    LIBRARY TRENDS, 1997, 45 (04) : 708 - 724
  • [28] Machine Learning Applied to Prevention and Mental
    Kcomt Ponce, Edwin
    Flores Cruz, Melissa
    Andrade-Arenas, Laberiano
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 823 - 831
  • [29] Machine Learning: An Applied Mathematics Introduction
    Lleo, Sebastien
    QUANTITATIVE FINANCE, 2020, 20 (03) : 359 - 360
  • [30] Special Issue on Applied Machine Learning
    Dudek, Grzegorz
    APPLIED SCIENCES-BASEL, 2022, 12 (04):