making machine understand beauty: a photography · disclaimer this presentation contain images from...

54
Making Machine Understand Beauty: A Photography Perspective Saimunur Rahman Advanced Robotic Vision © IEEE TMM (Lu et al., 2015) Technical Sharing

Upload: others

Post on 21-Mar-2020

19 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Making Machine Understand

Beauty: A Photography Perspective

Saimunur RahmanAdvanced Robotic Vision

© IEEE TMM (Lu et al., 2015)

Technical Sharing

Page 2: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Disclaimer

● This presentation contain images from various sources collected using Google Image search. If any of slides left source description, please feel free to consider Google Search as the source.

● This presentation contain materials and results which are private and confidential, and currently under publication and patent filing process. Please maintain the Vitrox honor code and do not disclose outside.

Page 3: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Video URL: https://www.youtube.com/watch?v=DyxgDM8O8OM

Page 4: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source
Page 5: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

One picture is worth a thousand words.

~ chinese proverb

Page 6: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

6Video URL: https://www.youtube.com/watch?v=YX8vvvvMLHI

Page 7: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Beautiful Capture “Beauty is in the eye of the beholder”

A SUBJECTIVE CONJECTURE

● Everyone has different taste!

● Universal acceptance of beauty

○ Sunrise and sunset

○ Blue ocean, Mount Everest

Amatuer

Professional

7

Page 8: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Beauty affecting photographic factors

Beautiful Capture

IMAGE CAPTURING TECHNIQUES

a. http://www.brandon-schaefer.com/wp-content/uploads/2013/07/composition-03.jpgb. https://s-media-cache-ak0.pinimg.com/originals/4c/25/2d/4c252dcad46ae590e6298caf61e5fc5a.jpg

a

b

Rule of thirds Rule of thirds+POI

Normal image Lighting+effects

Lighting ContrastImage

Compos.

8

Page 9: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Vs.

Good Capture Bad Capture

Source: Wang, Yeqing, Yi Li and Fatih Murat Porikli. “Fine-tuning Convolutional Neural Networks for visual aesthetics.” ICPR (2016). 9

Page 10: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

In photography, beauty = aesthetics !

10

Page 11: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

1815-1825

Greek aisthētikós:

- aisthēt (ḗs) - a person who affects great love of art, music, poetry, etc.,

- ikos or -ic - occurs in nouns that represent a substantive use of adjectives

11Image source: http://www.dictionary.com/browse/aesthetic

Page 12: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

12

Page 13: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Okay, What makes a higher

aesthetic image?

Image source: a. http://www.ideachampions.com/weblogs/Confused.jpg, b. https://expertfile.com/speakers/img/ExpertCroppedHomePage.png

Well, judged by experts based on common photo-graphic rules

a b13

Page 14: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

14Video URL: https://www.youtube.com/watch?v=PPSDexnuZKs

Page 15: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

“2” dominating factors for comps. rules

Lighting Contrast&

15

Page 16: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

How to make machine understand the sense of beauty a.k.a. aesthetics?

Image Source: http://www.quickmeme.com/too-much-feel16

Page 17: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

higher aesthetic image = more likes!

Facebook posts with images see 2.3X more engagement

Source: https://blog.hubspot.com/marketing/visual-content-marketing-strategy17

Page 18: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Possible applications

Image gallery

a. http://ezoui.com/gallery/img/03_Grid.pngb. https://s-media-cache-ak0.pinimg.com/736x/df/57/95/df5795c199c0d28d3a1eb476c349739a.jpgc. https://screenshots.en.sftcdn.net/en/scrn/79000/79347/video-thumbnails-maker-28.jpg

Video thumbnailMultimedia archives

a b c 18

Page 19: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Case study: EyeEm

19Video URL: https://player.vimeo.com/video/154364175

Page 20: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Outline of talk

● Learning for image aesthetics evaluations

● Recent progress on image aesthetics evaluations

● My work on image aesthetics evaluations

● Image popularity vs. image aesthetics

● Image quality vs. image aesthetics

● Demo for image aesthetics evaluations

● Sharing by two aesthetics evaluation researchers

20

Page 21: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Machine learning for aesthetics prediction

21

Page 22: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Image aesthetics ranking

Good Bad

Binary ranking Fuzzy ranking

22

[1 ………….. 0]

Page 23: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Common binary ranking pipeline

23

Higher ranked images

Lower ranked images

Feature Extraction

Train Classifier

Good/Bad

Feature Extraction

Test ImageTrained

Classifier

Training Phase

Testing Phase

Good/Bad

Page 24: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Common fuzzy ranking pipeline

24

Higher to lower ranked images

Feature Extraction

Train Regressor

[0 ... 1]

Feature Extraction

Test ImageTrained

Regressor

Training Phase

Testing Phase

[0 ... 1]

Page 25: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Aesthetics inference

25

Datasets

Image source: http://vision.stanford.edu/Datasets/collage_s.png

Page 26: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

26

Photo.net DPChallenge AVA

~20k10 rating/imgRating: [0-7]

~17kBinary

ranking/img

~250k78-549 rating/img

Rating: [0-10]

Page 27: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

27

Guidelines for judging images

it looks good

attracts/holds attention

has an interesting

composition

has great use of color

contains drama, humor, impact

(if sports) peak

moment, struggle

of athlete

Raking in Photo.net

Images were collected from numerous sources through Google search

Page 28: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Aesthetic Feature Extraction

28Image source: http://www.kdnuggets.com/wp-content/uploads/feature-extraction.jpg

Page 29: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

29

Generic image features

Low level High level

Edge distribution Rule of third

Color histogram Golden ratio

Slide courtesy: Magzhan Kiranbay

Page 30: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Slide courtesy: Magzhan Kiranbay

Page 31: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Generic image feature types

31

Global Local

example: GIST example: SIFT

Page 32: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Global Features: GIST

32Image courtesy: Nguyen, Quang-Khue, Thi-Lan Le, and Ngoc-Hai Pham. "Leaf based plant identification system for android using surf features in combination with bag of words model and supervised learning." In Advanced Technologies for Communications (ATC), 2013 International Conference on, pp. 404-407. IEEE, 2013.

GIST Estimation

Page 33: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Local Feature: SIFT

33Image source: https://www.codeproject.com/KB/recipes/619039/SIFT.JPG

DoG in scales

Page 34: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Video URL: https://www.youtube.com/watch?v=6G8QdOID3EQ

Page 35: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Additional step for local features:

global feature transformation or holisticization

35

Input Image Feature extraction

Expected outcome

General expectation from a descriptor

Input Image Feature extraction … … …

… … …

Local feature outcome

Holistic transformer known as feature encoder such as Bag-of-Features, Sparse Coding, LLC etc.

Page 36: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Generic/Task-specific deep features

36

● Use popular pre-trained models on datasets such as ImageNet on aesthetics images● Fine-tune deep architecture weights with aesthetics images● Any intermediate till end of layers can be used as features!

Sample CNN architecture image source: https://upload.wikimedia.org/wikipedia/commons/6/63/Typical_cnn.png

Page 37: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Classifier/Regressor

Classification: if “y” is discrete/categorical variable

Regression: if “y” is real number/continuous

37

f:x y

Page 38: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Recent progress on Aesthetic Evaluations

38

Page 39: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

39

Typical aestheticevaluation pipeline

Image reproduced from Deng, Yubin, Chen Change Loy, and Xiaoou Tang. "Image Aesthetic Assessment: An Experimental Survey." arXiv preprint arXiv:1610.00838 (2016).

Page 40: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Hand-engineered features

● Simple Image Features: color, brightness, contrast, texture, simplicity,composition geometry … [global features]

● Image Composition Features: salient regions, composition regions [global+local features]

● General-Purpose Features: SIFT, HOG, GIST etc. [mostly local features]

● Task-Specific Features: known image nature [global+local features]

40Source: Deng, Yubin, Chen Change Loy, and Xiaoou Tang. "Image Aesthetic Assessment: An Experimental Survey." arXiv preprint arXiv:1610.00838 (2016).

Page 41: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Deep learned features

● Generic Deep Features

○ Single stream architecture

○ Multi-stream architecture

● Pretrained/tuned features

● Mostly CNNs

41

Single stream architecture

Multi-stream architecture

Image reproduced from Deng, Yubin, Chen Change Loy, and Xiaoou Tang. "Image Aesthetic Assessment: An Experimental Survey." arXiv preprint arXiv:1610.00838 (2016).

Page 42: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Some results: DPChallenge (CUHK-PQ) dataset

42

CHUK-PQ Dataset: ~4.5k positive and ~13k negative images

Source: Deng, Yubin, Chen Change Loy, and Xiaoou Tang. "Image Aesthetic Assessment: An Experimental Survey." arXiv preprint arXiv:1610.00838 (2016).

Page 43: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Some results: AVA dataset

43

AVA Dataset: ~176k positive and ~74k negative images

Source: Deng, Yubin, Chen Change Loy, and Xiaoou Tang. "Image Aesthetic Assessment: An Experimental Survey." arXiv preprint arXiv:1610.00838 (2016).

Page 44: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

CNN Transfer Learning for Image Aesthetics Evaluations

44

Slides removed due to publication issues!

Page 45: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Real-time Aesthetics Evaluation of South-Asian Selfies

45

Slides removed due to publication issues!

Page 46: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Image Popularityvs.

Image Aesthetics

46

Page 47: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Image popularity vs. aesthetics

47Image courtesy: Khosla, Aditya, Atish Das Sarma, and Raffay Hamid. "What makes an image popular?." In Proceedings of the 23rd international conference on World wide web, pp. 867-876. ACM, 2014.

Page 48: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

What makes an image popular? (Khosla et al. 2014)

48Slide courtesy: Aditya Khosla

Page 49: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Image qualityvs.

Image Aesthetics

49

Page 50: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Image quality vs. image aesthetics

● Image quality measure (IQM) used for objective quality assessment

● Typically used for image restoration

○ Superresolution

○ De-blur

○ de-artifacts

● Usually have a reference image for comparison

● IQM is not designed to measure the subjective nature of human perceived aesthetic quality

○ May generate misleading results

50

Page 51: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Interesting Observation

51

Aesthetics Score: 0.23

Page 52: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

Interesting Observation (2)

52

Aesthetics Score: 0.76

Page 53: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

53

Thank you

Page 54: Making Machine Understand Beauty: A Photography · Disclaimer This presentation contain images from various sources collected using Google Image search. If any of slides left source

54

Got Any Question?