ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

被引:56
|
作者
Min, Weiqing [1 ,2 ]
Liu, Linhu [1 ,2 ]
Wang, Zhiling [1 ,2 ]
Luo, Zhengdong [1 ,2 ]
Wei, Xiaoming [3 ]
Wei, Xiaolin [3 ]
Jiang, Shuqiang [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Meituan Dianping Grp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Food Recognition; Food Datasets; Benchmark; Deep Learning;
D O I
10.1145/3394171.3414031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Food recognition has received more and more attention in the multimedia community for its various real-world applications, such as diet management and self-service restaurants. A large-scale ontology of food images is urgently needed for developing advanced large-scale food recognition algorithms, as well as for providing the benchmark dataset for such algorithms. To encourage further progress in food recognition, we introduce the dataset ISIA Food-500 with 500 categories from the list in the Wikipedia and 399,726 images, a more comprehensive food dataset that surpasses existing popular benchmark datasets by category coverage and data volume. Furthermore, we propose a stacked global-local attention network, which consists of two sub-networks for food recognition. One sub-network first utilizes hybrid spatial-channel attention to extract more discriminative features, and then aggregates these multi-scale discriminative features from multiple layers into global-level representation (e.g., texture and shape information about food). The other one generates attentional regions (e.g., ingredient relevant regions) from different regions via cascaded spatial transformers, and further aggregates these multi-scale regional features from different layers into local-level representation. These two types of features are finally fused as comprehensive representation for food recognition. Extensive experiments on ISIA Food-500 and other two popular benchmark datasets demonstrate the effectiveness of our proposed method, and thus can be considered as one strong baseline. The dataset, code and models can be found at http://123.57.42.89/FoodComputing-Dataset/ISIA-Food500.html.
引用
收藏
页码:393 / 401
页数:9
相关论文
共 50 条
  • [21] The role of supply chains for the sustainability transformation of global food systems: A large-scale, systematic review of food cold chains
    Trotter, Philipp A.
    Becker, Tristan
    Renaldi, Renaldi
    Wang, Xinfang
    Khosla, Radhika
    Walther, Grit
    JOURNAL OF INDUSTRIAL ECOLOGY, 2023, 27 (06) : 1429 - 1446
  • [22] Large-scale seasonal forecasts of river discharge by coupling local and global datasets with a stacked neural network: Case for the Loire River system
    Vu, M. T.
    Jardani, A.
    Krimissa, M.
    Zaoui, F.
    Massei, N.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 897
  • [23] Speech Emotion Recognition Using Multi-Scale Global-Local Representation Learning with Feature Pyramid Network
    Wang, Yuhua
    Huang, Jianxing
    Zhao, Zhengdao
    Lan, Haiyan
    Zhang, Xinjia
    APPLIED SCIENCES-BASEL, 2024, 14 (24):
  • [24] Agroecology for a Sustainable Agriculture and Food System: From Local Solutions to Large-Scale Adoption
    Ewert, Frank
    Baatz, Roland
    Finger, Robert
    ANNUAL REVIEW OF RESOURCE ECONOMICS, 2023, 15 : 351 - 381
  • [25] Consumers' environmental responsibility and their purchase of local food: evidence from a large-scale survey
    Bimbo, Francesco
    Russo, Carlo
    Di Fonzo, Antonella
    Nardone, Gianluca
    BRITISH FOOD JOURNAL, 2021, 123 (05): : 1853 - 1874
  • [26] Capturing the fast-food landscape in England using large-scale network analysis
    Baniukiewicz, Magda
    Dick, Zachariah L.
    Giabbanelli, Philippe J.
    EPJ DATA SCIENCE, 2018, 7
  • [27] Capturing the fast-food landscape in England using large-scale network analysis
    Magda Baniukiewicz
    Zachariah L. Dick
    Philippe J. Giabbanelli
    EPJ Data Science, 7
  • [28] Global-local finite element stress analysis of thick laminate multi-bolt joints in large-scale structures
    Liu, L.
    Chen, K.
    FINITE ELEMENTS IN ANALYSIS AND DESIGN, 2013, 75 : 31 - 37
  • [29] Where is the Global Corporate Elite? A Large-scale Network Study of Local and Nonlocal Interlocking Directorates
    Heemskerk, Eelke M.
    Takes, Frank W.
    Garcia-Bernardo, Javier
    Huijzer, M. Jouke
    SOCIOLOGICA-ITALIAN JOURNAL OF SOCIOLOGY ON LINE, 2016, (02):
  • [30] Large-scale point cloud semantic segmentation via local perception and global descriptor vector
    Zeng, Ziyin
    Xu, Yongyang
    Xie, Zhong
    Tang, Wei
    Wan, Jie
    Wu, Weichao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246