共 61 条
- [51] Liu H, Gu L, Chi ZX, Wang Y, Yu YH, Chen J, Tang J., Few-shot class-incremental learning via entropy-regularized data-free replay, Proc. of the 17th European Conf. on Computer Vision, pp. 146-162, (2022)
- [52] Cheraghian A, Rahman S, Fang PF, Roy SK, Petersson L, Harandi M., Semantic-aware knowledge distillation for few-shot class-incremental learning, Proc. of the 2021 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 2534-2543, (2021)
- [53] Dong SL, Hong XP, Tao XY, Chang XY, Wei X, Gong YH., Few-shot class-incremental learning via relation knowledge distillation, Proc. of the 35th AAAI Conf. on Artificial Intelligence, pp. 1255-1263, (2021)
- [54] Radford A, Kim JW, Hallacy C, Ramesh A, Goh G, Agarwal S, Sastry G, Askell A, Mishkin P, Clark J, Krueger G, Sutskever I., Learning transferable visual models from natural language supervision, Proc. of the 38th Int’l Conf. on Machine Learning. PMLR, pp. 8748-8763, (2021)
- [55] Alayrac JB, Donahue J, Luc P, Miech A, Barr I, Hasson Y, Lenc K, Mensch A, Millican K, Reynolds M, Ring R, Rutherford E, Cabi S, Han TD, Gong ZT, Samangooei S, Monteiro M, Menick JL, Borgeaud S, Brock A, Nematzadeh A, Sharifzadeh S, Binkowski M, Barreira R, Vinyals O, Zisserman A, Simonyan K., Flamingo: A visual language model for few-shot learning, Proc. of the 36th Int’l Conf. on Neural Information Processing Systems, pp. 23716-23736, (2022)
- [56] Jia C, Yang YF, Xia Y, Chen YT, Parekh Z, Pham H, Le QV, Sung YH, Li Z, Duerig T., Scaling up visual and vision-language representation learning with noisy text supervision, Proc. of the 38th Int’l Conf. on Machine Learning. PMLR, pp. 4904-4916, (2021)
- [57] Krizhevsky A., Learning multiple layers of features from tiny images, (2009)
- [58] Wah C, Branson S, Welinder P, Perona P, Belongie S., The caltech-UCSD birds-200-2011 dataset, (2011)
- [59] Krause J, Stark M, Deng J, Fei-Fei L., 3D object representations for fine-grained categorization, Proc. of the 2013 IEEE Int’l Conf. on Computer Vision Workshops, pp. 554-561, (2013)
- [60] Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai XH, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreit J, Houlsby N., An image is worth 16x16 words: Transformers for image recognition at scale, Proc. of the 9th Int’l Conf. on Learning Representations, (2021)