enhancenet - webdav.tuebingen.mpg.dewebdav.tuebingen.mpg.de/pixel/enhancenet/files/... · ¤ drcn,...
TRANSCRIPT
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Single Image Super-ResolutionThrough Automated Texture Synthesis
EnhanceNet
Max Planck Institutefor Intelligent Systems
state of the art by PSNR our result
Mehdi S. M. Sajjadi
BernhardSchölkopf
Michael Hirsch
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Single Image Super-Resolution
2
Low-resolution input
?
Original high-res image
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Method: Classical approach
¤ Minimize mean-squared error¤ MSE = %
&'∑ ∑ 𝐼*+, − 𝐼+,
./,
0+
¤ Evaluation by Peak signal-to-noise ratio (PSNR)¤ PSNR = −10 log%6 𝑀𝑆𝐸
¤ The field is dominated by convolutional neural nets¤ SRCNN (Dong et al., ECCV 2014)¤ DRCN, VDSR (Kim et al., CVPR 2016)¤ NTIRE challenge on image super-resolution (CVPR 2017)
3
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
State of the art by PSNR
4
CNN
Low-resolution input High-resolution output
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Is PSNR the right metric?
5
Low-resolution input Original imageGenerated image
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
EnhanceNet’s output
6
Low-resolution input Original imageOur result
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
PSNR vs. visual similarity
7
High-resolutionimage
Low-resolutionimage
Optimal PSNR Realistic imageLow PSNR
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Method: residual CNN with loss
¤ Euclidean distance / mean squared error (MSE)
¤ 𝐼* − 𝐼.
.= %
:;<∑ 𝐼*+, − 𝐼*+,
.�+,
¤ Perceptual loss (Dosovitskiy and Brox 2016, Johnson et al. 2016)
¤ 𝜙 𝐼* − 𝜙 𝐼.
.MSE in VGG feature space
¤ Texture loss / style transfer (Gatys et al. 2015)
¤ 𝐺 𝜙 𝐼* − 𝐺 𝜙 𝐼.
.MSE of correlation in VGG feature space
¤ Adversarial loss / GAN (Goodfellow et al. 2014)
¤ 𝐷 𝐼* , 𝐷 𝐼 ∈ 0, 1 discriminator rates realism of image patches
8
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Best result by PSNR vs. our best
9
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Evaluation
¤ PSNR/SSIM/IFC: ENet-E SOTA; ENet-PAT low scores
¤ Survey: ENet-PAT is preferred over ENet-E in 91.0% images
¤ Object recognition image quality benchmark¤ Feed ImageNet through super-resolution models¤ Run pre-trained object recognition network on results¤ ENet-PAT leads to lowest error
10
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Comparison w/ other methods
11
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
A closer look at ENet-PAT
12
EnhanceNet: Single-Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch msajjadi.com
Conclusion
¤ Introduce a novel combination of loss functions for single image super-resolution
¤ State of the art in quantitative + qualitative benchmarks
¤ Propose object detection image quality benchmark¤ Let’s see if it works in other domains as well
¤ Outlook: lots of room for improvements¤ Perceptual evaluation still an unsolved problem
13