Machine Learning Applied to Media Libraries for Insights, Search, and Segmentation

被引:0
|
作者
Gonsalves R.
Montajabi Z.
Mathur S.
Bouguila N.
机构
来源
SMPTE Motion Imaging Journal | 2023年 / 132卷 / 03期
关键词
Artificial intelligence (AI); CLIP4Clip; machine learning (ML); media search; Microsoft CLAP; multilingual CLIP; OpenAI contrastive language-image pretraining (CLIP); semantic search; Wav2CLIP; wav2vec2;
D O I
10.5594/JMI.2023.3245685
中图分类号
学科分类号
摘要
Recent advances in machine learning (ML) have produced a new form of semantic indexing that lets users enhance searches and gain new insights into their media libraries. Unlike typical search systems that use extracted metadata, semantic indexing allows users to find relevant material without the need to tag the media with selections from a predefined taxonomy. With semantic search, users can simply enter unstructured text, and the system will find the best matching media clips. This article extends the use of this technology to gather analytics on the data, which can then be further correlated to generate various insights. This new form of media indexing can be performed with the contrastive language-image pretraining (CLIP) model from OpenAI for images and similar models for video and audio. These models encode media into embeddings that can be searched to find the closest semantic similarity, enhanced with learned cultural knowledge. These systems have the benefit of finding media based on keywords, synonyms, and summaries. The systems can also be used for analytics and insights, such as segmentation, shot detection, and creating a 2D map to display correlations. This article ends with a discussion of the next steps, including using knowledge graphs for semantic search. © 2002 Society of Motion Picture and Television Engineers, Inc.
引用
收藏
页码:27 / 38
页数:11
相关论文
共 50 条
  • [41] Workshop on Applied Machine Learning Management
    Goldenberg, Dmitri
    Sokolova, Elena
    Lador, Shir Meir
    Mandelbaum, Amit
    Vasilinetc, Irina
    Jain, Ankit
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4874 - 4875
  • [42] Machine learning applied to pack classification
    Hang, Weiqiang
    Banks, Timothy
    INTERNATIONAL JOURNAL OF MARKET RESEARCH, 2019, 61 (06) : 601 - 620
  • [43] Machine learning applied to asteroid dynamics
    Carruba, V
    Aljbaae, S.
    Domingos, R. C.
    Huaman, M.
    Barletta, W.
    CELESTIAL MECHANICS & DYNAMICAL ASTRONOMY, 2022, 134 (04):
  • [44] Multitask Learning for Query Segmentation in Job Search
    Salehi, Bahar
    Liu, Fei
    Baldwin, Timothy
    Wong, Wilson
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 179 - 182
  • [45] Search Personalization Using Machine Learning
    Yoganarasimhan, Hema
    MANAGEMENT SCIENCE, 2020, 66 (03) : 1045 - 1070
  • [46] Fairness of Machine Learning in Search Engines
    Fang, Yi
    Liu, Hongfu
    Tao, Zhiqiang
    Yurochkin, Mikhail
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5132 - 5135
  • [47] Machine learning search for variable stars
    Pashchenko, Ilya N.
    Sokolovsky, Kirill V.
    Gavras, Panagiotis
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2018, 475 (02) : 2326 - 2343
  • [48] Unemployment Insights: A Machine Learning Approach
    Boboc, Cristina
    Rosca, Alexandra Roberta
    Ciuhu, Ana-Maria
    Vasile, Valentina
    ROMANIAN STATISTICAL REVIEW, 2024, (02) : 26 - 47
  • [49] Machine learning plus on-line libraries = IDL
    Semeraro, G
    Esposito, F
    Malerba, D
    Fanizzi, N
    Ferilli, S
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 1997, 1324 : 195 - 214
  • [50] SMaLL: Software for Rapidly Instantiating Machine Learning Libraries
    Sridhar, Upasana
    Tukanov, Nicholai
    Binder, Elliott
    Low, Tze Meng
    McMllan, Scott
    Schatz, Martin D.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (03)