an introduction to deep learning · deep learning an introduction to ... •deep learning •feed...

38
Deep Learning An Introduction to By Rahil Mahdian December 12-13, 2017 1

Upload: others

Post on 01-Sep-2020

37 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Deep Learning

An Introduction to

By Rahil Mahdian – December 12-13, 2017

1

Page 2: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Outline • Machine Learning

• Learning Strategies

• Neural Network Learning

• Deep Learning

• Feed Forward Network

• Problems of Deep Learning

• AutoEncoders

• Restricted Boltzmann Machines

• Convolutional Neural Networks

• Recurrent Neural Networks (RNNs, LSTM)

• Deep Learning Applications

2

Page 3: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Scope of Machine Learning

3

Nando de Freitas, Oxford

Page 4: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

When to Apply Machine Learning

4

Nando de Freitas, Oxford

Page 5: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Machine Learning Pioneering ( C. Shannon 1961)

5

Page 6: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Learning Types

6

Supervised Learning UnSupervised Learning

Semi-Supervised Learning

Page 7: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Machine Learning vs Deep Learning

7

Page 8: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Perceptron – Single Neuron element, Rosenblatt 1958

8

e.g. Sigmoidal function

Page 9: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Neural Networks (MLP)

9

Page 10: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Training schemes (SGD, Batch, MiniBatch)

10

SGD Batch Mini-Batch

Page 11: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

MLP - Function Approximation

11

Page 12: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Deep Motivation- Different Layers of Abstraction

12

Page 13: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Deep Neural Networks

13

Page 14: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Feed Forward Neural Networks

14

Page 15: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Training DNNs - Backpropagation

15

Page 16: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Deep NN - Training Problem

16

The back-propagation encounters the three following difficulties in the training process of deep neural networks:

Vanishing Gradient- output error fails to reach the farther back nodes Overfitting Computational Load

Page 17: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Vanishing Gradient Solutions

17

Page 18: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Overfitting – Generalization Problem

18

Bishop

Page 19: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Model Complexity

19

Page 20: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Data Matters

20

Among competing hypotheses, the one with the fewest assumptions should be selected.

In the related concept of overfitting, excessively complex models are affected by statistical

noise (a problem also known as the bias-variance trade-off), whereas simpler models may

capture the underlying structure better and may thus have better predictive performance.

Hoeffding’s inequalities:

Failure rate

Empirical error rate

Occam's razor: William of Ockham (c. 1287–1347)

# of model parameters

True error rate

Number of sufficient samples

A 32-bits floating point computer

Page 21: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Overfitting Solutions – Dropout & Regularization

21

50% of hidden layers, and 25% for the input layer

Rule of thumb:

Regularization:

Drop out:

Add a norm of the weights to the cost function. (l1-norm, l2-norm)

Data Augmentation is also a way to avoid overfitting; i.e., adding noise, translating data, etc.

Page 22: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

AutoEncoder- Nonlinear dimensionality reduction

22

Page 23: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Restricted Boltzmann Machines (RBM)

23

Hugo Larochelle

Page 24: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Unsupervised Pre-training – another solution

24

Page 25: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Convolution Neural Network (Lecun et al. 1993, LeNet)

25

Page 26: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Convolution NN - Architecture

26

Photo: Phil Kim

Page 27: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

ConvNet – How it works?

27

Vincent Vanhoucke- Google

Page 28: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Convolutional NN – (LeCun, Fukushima)

28

AlexNet - 2012

Page 29: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

ConvNet for Speech

29

Page 30: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

CNN Structures

30

Page 31: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

LSTM and RNNs – sequential data

31

Page 32: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

LSTM Training

32

Page 33: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

RNNs & Multi-Hypothesis Tracking- BeamSearch

33

Vincent Vanhoucke- Google

Page 34: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Captioning & Translation - RNNs

34

Vincent Vanhoucke- Google

Page 35: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Deep Learning Applications: Computer Vision

35

Page 36: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

DNN Application - Caption Generation

36

Page 37: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

Wrap Up

37

Machine Learning Influence Learning Neural Networks Deep Learning Motivation, Problems, Solutions Unsupervised Neural Networks: AutoEncoders, Restricted Boltzmann Machines Deep Learning Training Solutions Feed Forward Neural Networks as MLPs Convolutional Neural Networks Recurrent Neural Networks for sequential data LSTMs as a generalization of RNNs Applications of DNNs

Page 38: An Introduction to Deep Learning · Deep Learning An Introduction to ... •Deep Learning •Feed Forward Network •Problems of Deep Learning •AutoEncoders •Restricted Boltzmann

38

Thanks for attending the Talk.

Questions?