ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

被引:56
|
作者
Min, Weiqing [1 ,2 ]
Liu, Linhu [1 ,2 ]
Wang, Zhiling [1 ,2 ]
Luo, Zhengdong [1 ,2 ]
Wei, Xiaoming [3 ]
Wei, Xiaolin [3 ]
Jiang, Shuqiang [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Meituan Dianping Grp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Food Recognition; Food Datasets; Benchmark; Deep Learning;
D O I
10.1145/3394171.3414031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Food recognition has received more and more attention in the multimedia community for its various real-world applications, such as diet management and self-service restaurants. A large-scale ontology of food images is urgently needed for developing advanced large-scale food recognition algorithms, as well as for providing the benchmark dataset for such algorithms. To encourage further progress in food recognition, we introduce the dataset ISIA Food-500 with 500 categories from the list in the Wikipedia and 399,726 images, a more comprehensive food dataset that surpasses existing popular benchmark datasets by category coverage and data volume. Furthermore, we propose a stacked global-local attention network, which consists of two sub-networks for food recognition. One sub-network first utilizes hybrid spatial-channel attention to extract more discriminative features, and then aggregates these multi-scale discriminative features from multiple layers into global-level representation (e.g., texture and shape information about food). The other one generates attentional regions (e.g., ingredient relevant regions) from different regions via cascaded spatial transformers, and further aggregates these multi-scale regional features from different layers into local-level representation. These two types of features are finally fused as comprehensive representation for food recognition. Extensive experiments on ISIA Food-500 and other two popular benchmark datasets demonstrate the effectiveness of our proposed method, and thus can be considered as one strong baseline. The dataset, code and models can be found at http://123.57.42.89/FoodComputing-Dataset/ISIA-Food500.html.
引用
收藏
页码:393 / 401
页数:9
相关论文
共 50 条
  • [31] Global Coverage of Mandatory Large-Scale Food Fortification Programs: A Systematic Review and Meta-Analysis
    Rohner, Fabian
    Wirth, James P.
    Zeng, Wu
    Petry, Nicolai
    Donkor, William E. S.
    Neufeld, Lynnette M.
    Mkambula, Penjani
    Groll, Sydney
    Mbuya, Mduduzi NN.
    Friesen, Valerie M.
    ADVANCES IN NUTRITION, 2023, 14 (05) : 1197 - 1210
  • [32] Impact of large-scale agricultural investments on the food security status of local community in Gambella region, Ethiopia
    Guyalo, Amanuel Kussia
    Alemu, Esubalew Abate
    Degaga, Degefa Tolossa
    AGRICULTURE & FOOD SECURITY, 2022, 11 (01):
  • [33] Cross-Modal Object Tracking via Modality-Aware Fusion Network and a Large-Scale Dataset
    Liu, Lei
    Zhang, Mengya
    Li, Cheng
    Li, Chenglong
    Tang, Jin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [34] GGLA-NeXtE2NET: A Dual-Branch Ensemble Network With Gated Global-Local Attention for Enhanced Brain Tumor Recognition
    Saeed, Adnan
    Shehzad, Khurram
    Bhatti, Shahzad Sarwar
    Ahmed, Saim
    Azar, Ahmad Taher
    IEEE ACCESS, 2025, 13 : 7234 - 7257
  • [35] Receptive-Field and Direction Induced Attention Network for Infrared Dim Small Target Detection With a Large-Scale Dataset IRDST
    Sun, Heng
    Bai, Junxiang
    Yang, Fan
    Bai, Xiangzhi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [36] Global Meets Local: Dual Activation Hashing Network for Large-Scale Fine-Grained Image Retrieval
    Jiang, Xin
    Tang, Hao
    Li, Zechao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6266 - 6279
  • [37] Large-scale agricultural investments and local food security - Evidence from a mixed-method case study in Benin
    Muder, Anika
    Luckmann, Jonas
    Schmid, Julia C.
    FOOD SECURITY, 2024, 16 (02) : 511 - 531
  • [38] Large-scale agricultural investments and local food security – Evidence from a mixed-method case study in Benin
    Anika Muder
    Jonas Luckmann
    Julia C. Schmid
    Food Security, 2024, 16 : 511 - 531
  • [40] Marine Vessel Re-Identification: A Large-Scale Dataset and Global-and-Local Fusion-Based Discriminative Feature Learning
    Qiao, Dalei
    Liu, Guangzhong
    Dong, Feng
    Jiang, She-Xiang
    Dai, Likun
    IEEE ACCESS, 2020, 8 : 27744 - 27756