Automatic Video Descriptor for Human Action Recognition

被引:0
|
作者
Perera, Minoli [1 ]
Farook, Cassim [1 ]
Madurapperuma, A. P. [2 ]
机构
[1] Informat Inst Technol, Dept Comp, Colombo, Sri Lanka
[2] Open Univ Sri Lanka, Dept Elect & Comp Engn, Colombo, Sri Lanka
关键词
Human Action Recognition; Classification; Support Vector Machine; Machine Learning; Feature Extraction; Action Detection; TRACKING;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Assistive software such as screen readers are unable to describe images or videos for visually impaired people. Although recent research have found ways to describe an image automatically, describing the content of a video is still an ongoing issue. Visually impaired people find it difficult to understand video content without an indication of sound. The current solution of video description is only provided through digital television and for selected programs and movies. As an initiative to describe video content for visually impaired people, the solution acts as a video player which automatically understands the ongoing human action on screen, associates textual descriptions and narrates it to the blind user. The human actions in the video should be recognized in real time, hence fast, reliable feature extraction and classification methods must be adopted. A feature set is extracted for each frame and is obtained from the projection histograms of the foreground mask. The number of moving pixels for each row and column of the frame is used to identify the instant position of a person. Support Vector Machine (SVM) is used to classify extracted features of each frame. The final classification is given by analyzing frames in segments. The classified actions will be converted from text to speech.
引用
收藏
页码:61 / 66
页数:6
相关论文
共 50 条
  • [41] A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION
    Song, Yan
    Tang, Sheng
    Zheng, Yan-Tao
    Chua, Tat-Seng
    Zhang, Yongdong
    Lin, Shouxun
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 772 - 777
  • [42] Human Action Recognition on Simple and Complex Background in Video
    Tuan Le-Viet
    Ngoc Ly-Quoc
    2012 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2012, : 114 - 119
  • [43] Analysis of CNN Architectures for Human Action Recognition in Video
    Silva, David
    Manzo-Martinez, Alain
    Gaxiola, Fernando
    Gonzalez-Gurrola, Luis
    Ramirez-Alonso, Graciela
    COMPUTACION Y SISTEMAS, 2022, 26 (02): : 623 - 641
  • [44] Temporal segment dropout for human action video recognition
    Zhang, Yu
    Chen, Zhengjie
    Xu, Tianyu
    Zhao, Junjie
    Mi, Siya
    Geng, Xin
    Zhang, Min-Ling
    PATTERN RECOGNITION, 2024, 146
  • [45] On the Effects of Low Video Quality in Human Action Recognition
    See, John
    Rahman, Saimunur
    2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 574 - 581
  • [46] Human Body Articulation for Action Recognition in Video Sequences
    Thi, Tuan Hue
    Lu, Sijun
    Zhang, Jian
    Cheng, Li
    Wang, Li
    AVSS: 2009 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2009, : 92 - +
  • [47] A survey of video datasets for human action and activity recognition
    Chaquet, Jose M.
    Carmona, Enrique J.
    Fernandez-Caballero, Antonio
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (06) : 633 - 659
  • [48] HUMAN ACTION RECOGNITION WITH OPTIMIZED VIDEO DENSELY SAMPLING
    Wang, Bin
    Liu, Yu
    Xiao, Wenhua
    Xiong, Zhihui
    Wang, Wei
    Zhang, Maojun
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [49] Compact Video Analysis Human Action Recognition Approach
    Aly, Cherry Aly
    Abas, Fazly Salleh
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2019), 2019, : 329 - 334
  • [50] Human Action Recognition in Surveillance Video of a Computer Laboratory
    Yussiff, Abdul-Lateef
    Yong, Suet Peng
    Baharudin, Baharum
    2016 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2016, : 418 - 423