Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance

被引：0

作者：

Nguyen, Manh-Duy ^{[1
]}

Nguyen, Binh T. ^{[2
,3
]}

Gurrin, Cathal ^{[1
]}

机构：

[1] Dublin City Univ, Sch Comp, Dublin, Ireland

[2] Univ Sci, Ho Chi Minh City, Vietnam

[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam

来源：

INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL | 2025年 / 14卷 / 02期

基金：

爱尔兰科学基金会;

关键词：

Lifelog; Interactive retrieval system; Multimodal retrieval; Fusion model;

D O I：

10.1007/s13735-025-00359-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many lifelog retrieval systems have been introduced that apply various approaches to their search engines. The traditional method was to match concepts, which are visual objects detected in images and semantic queries. This concept-based approach has been applied in many retrieval systems, achieving the top performance in lifelog search challenges. Many novel embedding-based cross-modality retrieval models, such as CLIP, BLIP, or HADA, have been developed recently and obtained state-of-the-art (SOTA) results in the image-text retrieval task. These models have recently been applied in several lifelog search challenges. However, there is no comprehensive comparison between them since many benchmarking evaluations contain bias factors such as different user interfaces of participated lifelog retrieval systems. In this paper, we conducted non-biased experiments in both automatic (non-interactive) and interactive configurations to evaluate the performance of many SOTA retrieval models, including the traditional concept-based approach, in the lifelog retrieval task. Furthermore, we retrained the models in a lifelog Q&A dataset to assess whether retraining on a small lifelog dataset could improve the performance. The result showed that embedding-based search engines outperformed the concept-based approach by a large margin in both settings. The finding opens the opportunity to apply the embedding-based models as a new generation of lifelog retrieval models instead of the conventional concept-based approach. The source code and detailed result are available online https://github.com/m2man/Comparing-models-in-Lifelog-Retrieval-Task.

引用

页数：9

共 50 条

[41] Spectral embedding-based multiview features fusion for content-based image retrieval
Feng, Lin
Yu, Laihang
Zhu, Hai
JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (05)
[42] Contextual Path Retrieval: A Contextual Entity Relation Embedding-based Approach
Lo, Pei-Chi
Lim, Ee-Peng
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (01)
[43] Improving Embedding-Based Retrieval in Friend Recommendation with ANN Query Expansion
Kung, Pau Perng-Hwa
Fan, Zihao
Zhao, Tong
Liu, Yozen
Lai, Zhixin
Shi, Jiahui
Wu, Yan
Yu, Jun
Shah, Neil
Venkataraman, Ganesh
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2930 - 2934
[44] Embedding-based Query Expansion for Weighted Sequential Dependence Retrieval Model
Balaneshin-kordan, Saeid
Kotov, Alexander
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1213 - 1216
[45] QuadrupletBERT: An Efficient Model For Embedding-Based Large-Scale Retrieval
Liu, Peiyang
Wang, Sen
Wang, Xi
Ye, Wei
Zhang, Shikun
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3734 - 3739
[46] Understanding and Enhancing Robustness of Concept-Based Models
Sinha, Sanchit
Huai, Mengdi
Sun, Jianhui
Zhang, Aidong
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15127 - 15135
[47] Kelpie: an Explainability Framework for Embedding-based Link Prediction Models
Rossi, Andrea
Firmani, Donatella
Merialdo, Paolo
Teofili, Tommaso
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3566 - 3569
[48] Evaluating the Robustness of Embedding-Based Topic Models to OCR Noise
Zosa, Elaine
Mutuvi, Stephen
Granroth-Wilding, Mark
Doucet, Antoine
TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 392 - 400
[49] Architecture of a Concept-Based Information Retrieval System for Educational Resources
Perez-Rodriguez, Roberto
Anido-Rifon, Luis
Gomez-Carballa, Miguel
Mourino-Garcia, Marcos
2014 INTERNATIONAL SYMPOSIUM ON COMPUTERS IN EDUCATION (SIIE), 2014, : 99 - 104
[50] Architecture of a concept-based information retrieval system for educational resources
Perez-Rodriguez, Roberto
Anido-Rifon, Luis
Gomez-Carballa, Miguel
Mourino-Garcia, Marcos
SCIENCE OF COMPUTER PROGRAMMING, 2016, 129 : 72 - 91

← 1 2 3 4 5 →