deep learning - marc-peter schambach · deep learning from script and address recognition to...

69
Deep Learning From Script and Address Recognition to General Decisions and Solutions

Upload: others

Post on 25-May-2020

13 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Deep Learning

From

Script and Address Recognition

to

General Decisions and Solutions

Page 2: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Marc-Peter Schambach*1971

Dipl. PhysicsPhD Computer Science

Siemens Konstanz, GermanyPostal, Parcel & Airport LogisticsHandwriting Recognition

Page 3: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 4: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 5: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Overview

● Postal Automation● Retinal Stimulus Reconstruction● Handwriting Recognition

– HMM

– RNN● Deep Learning

– Geolocation of Images

– Game of Go

– Playing Atari

Page 6: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

I

Postal Automation

Page 7: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Siemens Postal, Parcel & Airport Logistics GmbH

Page 8: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 9: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 10: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 11: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 12: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 13: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 14: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 15: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

II

Neurobiology

Retinal Stimulus Reconstruction

Page 16: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Retinal Stimulus Reconstruction

Page 17: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Retinal Stimulus Reconstruction

Page 18: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Retinal Stimulus Reconstruction

Page 19: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 20: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 21: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 22: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

III

Handwriting Recognition

Hidden Markov Models

Page 23: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 24: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Cursive Script

Page 25: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Preprocessing

Contour Approximation

Slant Normalization

Base Line Recognition

Skeletization

Feature Generation

Line Approximation

Page 26: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Character Models

Linear TransformationLinear Transformation

Vector QuantizationVector Quantization

Viterbi AlgorithmViterbi Algorithm

Word List New London

lower

upper

lower

upper

joker

print

script

Page 27: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Lexicon-free Recognition

Recurrent HMM

Postprocessing with language models:• N-grams• Lexika• Rules

Page 28: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Viterbi-Decoding

Word models

Prefix-Trie

• Merge word beginnings

• Maximize probability at word endings

Beam-Search

• Calculate only activated nodes

• Deactivate nodes with low likelihood

Page 29: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 30: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

III

Handwriting Recognition

Recurrent Neural Networks

Page 31: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 32: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Recurrent Neural Networks

Page 33: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 34: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Long short term memory

Page 35: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Vanishing Gradient

Page 36: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Multi-dimensional Recurrent Neural Networks

Page 37: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

RNN Training

- Backpropagation

- CTC-Layer: Forward-Backward-Alignmentdetermines error function

- Similar to Expectation-Maximization

- Stabilizing by fixed alignment during first training epochs

Page 38: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Recurrent Neural Networks

Jacobi-Matrix: Shows the input range relevant for classification

Page 39: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Handwriting recognition with RNN

+ WRR cursive handwriting 95% (lexicon 10k)

+ CRR machine print 99,6%

+ Robust

+ Trains language models

- Slower than HMM (training and recognition)

Page 40: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 41: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

III

Reading & InterpretationVision & Understanding

Future Directions

Page 42: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Vision & Understanding: Operation Pipeline

Image Processing Deep Learning Databases, Rules

Objects of Interest Recognition Interpretation

Deep Learning

Deep Learning Databases, Rules

Page 43: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 44: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 45: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

IV

Deep Learning

Recent Applications

Page 46: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Photo Geolocation

with

Convolutional Neural Networks

Page 47: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 48: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 49: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Convolutional Neural Networks

Page 50: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 51: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 52: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 53: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Geolocation Error: PlaNet vs. Humans

Page 54: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

More examples

GeoGuessr Panorama

Ground truth

Human guess

PlaNet guess

Page 55: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Mastering the Game of Go

with

Deep Neural Networks

and Tree Search

Page 56: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

March 2016: AlphaGo defeats Lee Sedol 4–1

December 2015: AlphaGo defeats Fan Hui 5–0

Page 57: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Neural Network Architecture

Page 58: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Neural Network Training Pipeline

Page 59: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Monte Carlo Tree Search

Page 60: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Selecting Moves

Page 61: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

AlphaGo: Results

Page 62: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Playing Atari

with

Deep Reinforcement Learning

Page 63: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Pong Breakout

SpaceInvaders

Beam RiderSeaquest

Page 64: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

BA C

Page 65: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Playing Atari: Results

Page 66: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Round-Up

● Neurobiological background● Script Recognition● Image Classification● Strategy Games● Video Games

What are your problems to be solved?

Page 67: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of
Page 68: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Leonardo da Vinci, 1494–1498

Page 69: Deep Learning - Marc-Peter Schambach · Deep Learning From Script and Address Recognition to General Decisions and Solutions. Marc-Peter Schambach *1971 ... Mastering the Game of

Marc-Peter Schambach*1971

Dipl. PhysicsPhD Computer Science

Siemens Konstanz, GermanyPostal, Parcel & Airport LogisticsHandwriting Recognition