Seeing What You're Told: Sentence-Guided Activity Recognition In Video

被引:12
|
作者
Siddharth, N. [1 ]
Barbu, Andrei [2 ]
Siskind, Jeffrey Mark [3 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] MIT, Cambridge, MA 02139 USA
[3] Purdue Univ, W Lafayette, IN 47907 USA
关键词
D O I
10.1109/CVPR.2014.99
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, providing a medium for top-down and bottom-up integration as well as multi-modal integration between vision and language. We show how the roles played by participants (nouns), their characteristics (adjectives), the actions performed (verbs), the manner of such actions (adverbs), and changing spatial relations between participants (prepositions), in the form of whole-sentence descriptions mediated by a grammar, guides the activity-recognition process. Further, the utility and expressiveness of our framework is demonstrated by performing three separate tasks in the domain of multi-activity video: sentence-guided focus of attention, generation of sentential description, and query-based search, simply by leveraging the framework in different manners.
引用
收藏
页码:732 / 739
页数:8
相关论文
共 22 条
  • [1] The 'Barge (iii) What you're told'
    Williams, JH
    AGENDA, 1999, 37 (01): : 6 - 7
  • [2] Beauty of seeing what you're looking at
    Polygr Int, 1 (38):
  • [3] Doing What You're Told: It's Not That Simple
    Corless, Inge B.
    JANAC-JOURNAL OF THE ASSOCIATION OF NURSES IN AIDS CARE, 2016, 27 (02): : 117 - 120
  • [4] Believing What You're Told: Politeness and Scalar Inferences
    Mazzarella, Diana
    Trouche, Emmanuel
    Mercier, Hugo
    Noveck, Ira
    FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [5] What to do when you're a midwife - and told you should choose another career
    Williams, Kara
    WOMEN AND BIRTH, 2023, 36 : S4 - S5
  • [6] Trusting What You're Told: How Children Learn From Others
    Litkowski, Ellen
    Renken, Maggie
    SCIENCE EDUCATION, 2013, 97 (05) : 797 - 799
  • [7] Trusting What You're Told: How Children Learn from Others
    Dore, Rebecca A.
    Lillard, Angeline S.
    Jaswal, Vikram K.
    JOURNAL OF COGNITION AND DEVELOPMENT, 2014, 15 (03) : 520 - 523
  • [8] Seeing what you hear: Visual feedback improves pitch recognition
    Eldridge, Marcus
    Saltzman, Elliot
    Lahav, Amir
    EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2010, 22 (07): : 1078 - 1091
  • [9] Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
    Wang, Jiadong
    Qian, Xinyuan
    Zhang, Malu
    Tan, Robby T.
    Li, Haizhou
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14653 - 14662
  • [10] Testing What You're Told: Young Children's Empirical Investigation of a Surprising Claim
    Ronfard, Samuel
    Chen, Eva E.
    Harris, Paul L.
    JOURNAL OF COGNITION AND DEVELOPMENT, 2021, 22 (03) : 426 - 447