HIGH-LEVEL SEMANTIC PHOTOGRAPHIC COMPOSITION ANALYSIS AND UNDERSTANDING WITH DEEP NEURAL NETWORKS

被引：0

作者：

Wu, Min-Tzu ^{[1
]}

Pan, Tse-Yu ^{[1
]}

Tsai, Wan-Lun ^{[1
]}

Kuo, Hsu-Chan ^{[2
]}

Hu, Min-Chun ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan

[2] Natl Cheng Kung Univ, Inst Educ, Tainan, Taiwan

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) | 2017年

关键词：

Photographic Composition; Visual Art; Deep Learning; High-level Semantic Feature;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In order to take better photos, it is a fundamental step for the beginners of photography to learn basic photo composition rules. However, there are no tools developed to help beginners analyze the composition rules in given photos. Thus, in this study we developed a system with the capability to identify 12 common composition rules in a photo. It should be noted that some of the 12 common composition rules have not been considered by the previous studies, and this deficit gives this study its significance and appropriateness. In particular, we utilized deep neural networks (DNN) to extract high-level semantic features for facilitating the further analysis of photo composition rules. In order to train the DNN model, our research team constructed a dataset, which is collected from some famous photo websites, such as DPChallenge, Flicker, and Unsplash. All the collected photos were later labelled with 12 composition rules by a wide range of raters recruited from Amazon Mechanical Turk (AMT). Two DNN architectures (Alex Net and GoogLeNet) were then employed to build our system based on the collected dataset. The representative features of each composition rule were further visualized in our system. The results showed the feasibility of the proposed system and revealed the possibility of using this system to assist potential users to improve their photographical skills and expertise.

引用

页数：6

共 50 条

[41] Semantic language models with deep neural networks
Bayer, Ali Orkan
Riccardi, Giuseppe
COMPUTER SPEECH AND LANGUAGE, 2016, 40 : 1 - 22
[42] A high-level tool for the development of FPLD-based stochastic neural networks
Maunder, B
Salcic, Z
Coghill, G
ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 684 - 688
[43] Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis
Qian, Fan
Han, Jiqing
Guan, Yadong
Song, Wenjie
He, Yongjun
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 561 - 565
[44] High-Level Feature Guided Decoding for Semantic Segmentation
Huang, Ye
Kang, Di
Gao, Shenghua
Li, Wen
Duan, Lixin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8281 - 8291
[45] Indoor Image Representation by High-Level Semantic Features
Sitaula, Chiranjibi
Xiang, Yong
Zhang, Yushu
Lu, Xuequan
Aryal, Sunil
IEEE ACCESS, 2019, 7 : 84967 - 84979
[46] Specifying and enforcing high-level semantic obligation policies
Liu, Zhen
Ranganathan, Anand
Riabov, Anton
JOURNAL OF WEB SEMANTICS, 2009, 7 (01): : 28 - 39
[47] Scene Segmentation and Semantic Representation for High-Level Retrieval
Zhu, Songhao
Liu, Yuncai
IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 713 - 716
[48] Using high-level semantic features in video retrieval
Zheng, Wujie
Li, Jianmin
Si, Zhangzhang
Lin, Fuzong
Zhang, Bo
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 370 - 379
[49] An Intelligent Assistant for High-Level Task Understanding
Sun, Ming
Chen, Yun-Nung
Rudnicky, Alexander I.
PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES (IUI'16), 2016, : 169 - 174
[50] Specifying and enforcing high-level semantic obligation policies
Liu, Zhen
Ranganathan, Anand
Riabov, Anton
EIGHTH IEEE INTERNATIONAL WORKSHOP ON POLICIES FOR DISTRIBUTED SYSTEMS AND NETWORKS - PROCEEDINGS, 2007, : 119 - +

← 1 2 3 4 5 →