Human-like Controllable Image Captioning with Verb-specific Semantic Roles

被引:42
|
作者
Chen, Long [2 ,3 ]
Jiang, Zhihong [1 ]
Xiao, Jun [1 ]
Liu, Wei [4 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Tencent AI Lab, Bellevue, WA USA
[3] Columbia Univ, New York, NY 10027 USA
[4] Tencent Data Platform, New York, NY USA
基金
浙江省自然科学基金; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.01657
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Controllable Image Captioning (CIC) - generating image descriptions following designated control signals- has received unprecedented attention over the last few years. To emulate the human ability in controlling caption generation, current CIC studies focus exclusively on control signals concerning objective properties, such as contents of interest or descriptive patterns. However, we argue that almost all existing objective control signals have overlooked two indispensable characteristics of an ideal control signal: 1) Event-compatible: all visual contents referred to in a single sentence should be compatible with the described activity. 2) Sample-suitable: the control signals should be suitable for a specific image sample. To this end, we propose a new control signal for CIC: Verb-specific Semantic Roles (VSR). VSR consists of a verb and some semantic roles, which represents a targeted activity and the roles of entities involved in this activity. Given a designated VSR, we first train a grounded semantic role labeling (GSRL) model to identify and ground all entities for each role. Then, we propose a semantic structure planner (SSP) to learn human-like descriptive semantic structures. Lastly, we use a roleshift captioning model to generate the captions. Extensive experiments and ablations demonstrate that our framework can achieve better controllability than several strong baselines on two challenging CIC benchmarks. Besides, we can generate multi-level diverse captions easily.
引用
收藏
页码:16841 / 16851
页数:11
相关论文
共 40 条
  • [31] Acute vascular effects of sex steroid hormones in mice with advanced human-like atherosclerosis:: Gender-specific role of nitric oxide synthase
    Traupe, T
    Forte, S
    Ortmann, J
    Strässle, R
    Vetter, W
    Barton, M
    HYPERTENSION, 2003, 42 (04) : 633 - 633
  • [32] MafA-deficient and beta cell-specific MafK-overexpressing hybrid transgenic mice develop human-like severe diabetic nephropathy
    Shimohata, Homare
    Yoh, Keigyou
    Fujita, Akiko
    Morito, Naoki
    Ojima, Masami
    Tanaka, Hiromi
    Hirayama, Kouichi
    Kobayashi, Masaki
    Kudo, Takashi
    Yamagata, Kunihiro
    Takahashi, Satoru
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2009, 389 (02) : 235 - 240
  • [33] PET/CT And SPECT/CT Based Identification of Novel Rodent BAT and Beige Depots-An Image Guided Exploration Of Human-like Thermogenic Tissues in Mice
    Oz, O. K.
    Zhang, F.
    Hao, G.
    Hassan, G.
    Shao, M.
    An, Y.
    Wang, Q.
    Kusminski, C.
    Nham, K.
    Zhai, Q.
    Scherer, P.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2017, 44 : S241 - S241
  • [34] Inhibition of vascular endothelial growth factor receptor under hypoxia causes severe, human-like pulmonary arterial hypertension in mice: Potential roles of interleukin-6 and endothelin
    Tran Van Hung
    Emoto, Noriaki
    Vignon-Zellweger, Nicolas
    Nakayama, Kazuhiko
    Yagi, Keiko
    Suzuki, Yoko
    Hirata, Ken-ichi
    LIFE SCIENCES, 2014, 118 (02) : 313 - 328
  • [35] Isoproterenol exacerbates a long QT phenotype in Kcnq1-deficient neonatal mice:: Possible roles for human-like Kcnq1 isoform 1 and slow delayed rectifier K+ current
    Knollmann, BC
    Casimiro, MC
    Katchman, AN
    Sirenko, SG
    Schober, T
    Rong, Q
    Pfeifer, K
    Ebert, SN
    JOURNAL OF PHARMACOLOGY AND EXPERIMENTAL THERAPEUTICS, 2004, 310 (01): : 311 - 318
  • [36] Human apolipoprotein E2, E3, and E4 isoform-specific transgenic mice: Human-like pattern of glial and neuronal immunoreactivity in central nervous system not observed in wild-type mice
    Xu, PT
    Schmechel, D
    RothrockChristian, T
    Burkhart, DS
    Qiu, HL
    Popko, B
    Sullivan, P
    Maeda, N
    Saunders, AM
    Roses, AD
    Gilbert, JR
    NEUROBIOLOGY OF DISEASE, 1996, 3 (03) : 229 - 245
  • [37] ESTABLISHMENT OF PFIC 3 MOUSE MODEL CARRYING HUMAN-LIKE BILE ACID COMPOSITION BY IN VIVO LIVER-SPECIFIC GENE DELETION USING ADENO-ASSOCIATED VIRUS AND CRISPR/Cas9 SYSTEM
    Tsuruya, Kota
    Kamiya, Akihide
    Mishima, Yusuke
    Arase, Yoshitaka
    Honda, Akira
    Kagawa, Tatehiro
    HEPATOLOGY, 2023, 78 : S113 - S115
  • [38] ESTABLISHMENT OF AN FXR-RELATED PFIC MOUSE MODEL CARRYING HUMAN-LIKE BILE ACID COMPOSITION BY IN VIVO LIVER-SPECIFIC GENE DELETION USING ADENO-ASSOCIATED VIRUS AND CRISPR/CAS9 SYSTEM
    Mishima, Yusuke
    Kamiya, Akihide
    Tsuruya, Kota
    Arase, Yoshitaka
    Honda, Akira
    Kagawa, Tatehiro
    HEPATOLOGY, 2024, 80 : S1808 - S1809
  • [39] Multiple roles for heparin-binding epidermal growth factor-like growth factor are suggested by its cell-specific expression during the human endometrial cycle and early placentation
    Leach, RE
    Khalifa, R
    Ramirez, ND
    Das, SK
    Wang, J
    Dey, SK
    Romero, R
    Armant, DR
    JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 1999, 84 (09): : 3355 - 3363
  • [40] HUMAN H4-HISTONE GENE-TRANSCRIPTION REQUIRES THE PROLIFERATION-SPECIFIC NUCLEAR FACTOR HINF-D - AUXILIARY ROLES FOR HINF-C (SP1-LIKE) AND HINF-A (HIGH MOBILITY GROUP-LIKE)
    VANWIJNEN, AJ
    WRIGHT, KL
    LIAN, JB
    STEIN, JL
    STEIN, GS
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1989, 264 (25) : 15034 - 15042