HIGH-LEVEL SEMANTIC PHOTOGRAPHIC COMPOSITION ANALYSIS AND UNDERSTANDING WITH DEEP NEURAL NETWORKS

被引:0
|
作者
Wu, Min-Tzu [1 ]
Pan, Tse-Yu [1 ]
Tsai, Wan-Lun [1 ]
Kuo, Hsu-Chan [2 ]
Hu, Min-Chun [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
[2] Natl Cheng Kung Univ, Inst Educ, Tainan, Taiwan
关键词
Photographic Composition; Visual Art; Deep Learning; High-level Semantic Feature;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In order to take better photos, it is a fundamental step for the beginners of photography to learn basic photo composition rules. However, there are no tools developed to help beginners analyze the composition rules in given photos. Thus, in this study we developed a system with the capability to identify 12 common composition rules in a photo. It should be noted that some of the 12 common composition rules have not been considered by the previous studies, and this deficit gives this study its significance and appropriateness. In particular, we utilized deep neural networks (DNN) to extract high-level semantic features for facilitating the further analysis of photo composition rules. In order to train the DNN model, our research team constructed a dataset, which is collected from some famous photo websites, such as DPChallenge, Flicker, and Unsplash. All the collected photos were later labelled with 12 composition rules by a wide range of raters recruited from Amazon Mechanical Turk (AMT). Two DNN architectures (Alex Net and GoogLeNet) were then employed to build our system based on the collected dataset. The representative features of each composition rule were further visualized in our system. The results showed the feasibility of the proposed system and revealed the possibility of using this system to assist potential users to improve their photographical skills and expertise.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Semantic language models with deep neural networks
    Bayer, Ali Orkan
    Riccardi, Giuseppe
    COMPUTER SPEECH AND LANGUAGE, 2016, 40 : 1 - 22
  • [42] A high-level tool for the development of FPLD-based stochastic neural networks
    Maunder, B
    Salcic, Z
    Coghill, G
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 684 - 688
  • [43] Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis
    Qian, Fan
    Han, Jiqing
    Guan, Yadong
    Song, Wenjie
    He, Yongjun
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 561 - 565
  • [44] High-Level Feature Guided Decoding for Semantic Segmentation
    Huang, Ye
    Kang, Di
    Gao, Shenghua
    Li, Wen
    Duan, Lixin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8281 - 8291
  • [45] Indoor Image Representation by High-Level Semantic Features
    Sitaula, Chiranjibi
    Xiang, Yong
    Zhang, Yushu
    Lu, Xuequan
    Aryal, Sunil
    IEEE ACCESS, 2019, 7 : 84967 - 84979
  • [46] Specifying and enforcing high-level semantic obligation policies
    Liu, Zhen
    Ranganathan, Anand
    Riabov, Anton
    JOURNAL OF WEB SEMANTICS, 2009, 7 (01): : 28 - 39
  • [47] Scene Segmentation and Semantic Representation for High-Level Retrieval
    Zhu, Songhao
    Liu, Yuncai
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 713 - 716
  • [48] Using high-level semantic features in video retrieval
    Zheng, Wujie
    Li, Jianmin
    Si, Zhangzhang
    Lin, Fuzong
    Zhang, Bo
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 370 - 379
  • [49] An Intelligent Assistant for High-Level Task Understanding
    Sun, Ming
    Chen, Yun-Nung
    Rudnicky, Alexander I.
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES (IUI'16), 2016, : 169 - 174
  • [50] Specifying and enforcing high-level semantic obligation policies
    Liu, Zhen
    Ranganathan, Anand
    Riabov, Anton
    EIGHTH IEEE INTERNATIONAL WORKSHOP ON POLICIES FOR DISTRIBUTED SYSTEMS AND NETWORKS - PROCEEDINGS, 2007, : 119 - +