basic deep architectures (d1l4 deep learning for speech and language)

40
[course site] Day 1 Lecture 4 Basic Deep Architectures Xavier Giró-i-Nieto

Upload: xavier-giro

Post on 13-Apr-2017

225 views

Category:

Data & Analytics


4 download

TRANSCRIPT

Page 2: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

2

The Full Story

F. Van Veen, “The Neural Network Zoo” (2016)

Page 3: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

3

Previously… A Perceptron

J. Alammar, “A visual and interactive guide to the Basics of Neural Networks” (2016)

Page 4: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

4

Previously… A Perceptron

J. Alammar, “A visual and interactive guide to the Basics of Neural Networks” (2016)F. Van Veen, “The Neural Network Zoo” (2016)

Page 5: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

5

Two Perceptrons + Softmax classifier

J. Alammar, “A visual and interactive guide to the Basics of Neural Networks” (2016)

Page 6: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

6

Three perceptrons + Softmax classifier

TensorFlow, “MNIST for ML beginners”

Page 7: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

7TensorFlow, “MNIST for ML beginners”

Three perceptrons + Softmax classifier

Page 8: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

8TensorFlow, “MNIST for ML beginners”

Three perceptrons + Softmax classifier

Page 9: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

9

Neural Network = Multi Layer Perceptron

2 layers of perceptronsF. Van Veen, “The Neural Network Zoo” (2016)

Page 10: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

10

Deep Neural Network (DNN)

F. Van Veen, “The Neural Network Zoo” (2016)

Page 11: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

11

Recurrent Neural Network (RNN)

Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”

The hidden layers and the output depend from previous

states of the hidden layers

Page 12: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

12Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”

The hidden layers and the output depend from previous

states of the hidden layers

Recurrent Neural Network (RNN)

Page 13: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

13

time

time

Rotation 90o

Front View Side View

Rotation 90o

Recurrent Neural Network (RNN)

Page 14: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

14

Recurrent Neural Network (RNN)

More details:D2L2, “Recurrent Neural Networks”

F. Van Veen, “The Neural Network Zoo” (2016)

Page 15: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

15

Recurrent Neural Network (RNN)

More details:D2L2, “Recurrent Neural Networks”

F. Van Veen, “The Neural Network Zoo” (2016)

Page 16: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

16

Recurrent Neural Network (RNN)

Deep Learning TV, “Recurrent Neural Networks - Ep. 9”

Page 18: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

18

Convolutional Neural Network (CNN)

F. Van Veen, “The Neural Network Zoo” (2016)

More details:D1L3, “Convolutional Neural Networks”

Page 19: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

19

Convolutional Neural Network (CNN)

Deep Learning TV, “Convolutional Neural Networks - Ep. 8”

Page 20: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

20

Deconvolutional Neural Network (Deconv)

Junting Pan, SalGAN (2017)F. Van Veen, “The Neural Network Zoo” (2016)

Page 21: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

21

Autoencoder (AE)

“Deep Learning Tutorial”, Dept. Computer Science, Stanford

Autoencoders:● Predict at the output the

same input data.● Do not need labels:

Unsupervisedlearning

Page 22: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

22

Autoencoder (AE)

“Deep Learning Tutorial”, Dept. Computer Science, Stanford

Dimensionality reduction:● Use hidden layer as

a feature extractor of the desired size.

Unsupervisedlearning

Page 23: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

23

Autoencoder (AE)

F. Van Veen, “The Neural Network Zoo” (2016)

Page 24: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

24

Autoencoder (AE)

Deep Learning TV, “Autoencoders - Ep. 10”

Page 25: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

25

Variational Autoencoder (VAE)

Kevin Frans, “Variational Autoencoders explained” (2016)

The latent vector learned in the hidden layer of the basic autoencoder (in green)...

Page 26: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

26

Variational Autoencoder (VAE)

Kevin Frans, “Variational Autoencoders explained” (2016)

...is forced to follow a unit Gaussian distribution in VAEs.

Page 27: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

27

Variantional Autoencoder (VAE)

“Deep Learning Tutorial”, Dept. Computer Science, Stanford

Generative model:● Create new samples

by drawing from a Gaussian distribution.

Unsupervisedlearning

Page 28: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

28

Variantional Autoencoder (VAE)

Alec Radford, “Face manifold from conv/deconv variational autoencoder” (2015)

Page 29: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

29

Variational Autoencoder (VAE)

F. Van Veen, “The Neural Network Zoo” (2016)

Page 30: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

30

Restricted Boltzmann Machine (RBM)

Figure: Geoffrey Hinton (2013)

Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "Restricted Boltzmann machines for collaborative filtering." Proceedings of the 24th international conference on Machine learning. ACM, 2007.

● Shallow two-layer net.● Restricted=No two nodes in a layer share a

connection● Bipartite graph.● Bidirectional graph

○ Shared weights.○ Different biases.

Page 31: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

31

Restricted Boltzmann Machine (RBM)

Figure: Geoffrey Hinton (2013)

Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "Restricted Boltzmann machines for collaborative filtering." Proceedings of the 24th international conference on Machine learning. ACM, 2007.

Page 32: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

32

Restricted Boltzmann Machine (RBM)

DeepLearning4j, “A Beginner’s Tutorial for Restricted Boltzmann Machines”.

RBMs are a specific type of autoencoder.

Unsupervisedlearning

Page 33: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

33

Deep Belief Networks (DBN)

Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. "A fast learning algorithm for deep belief nets." Neural computation 18, no. 7 (2006): 1527-1554.

● Architecture like an MLP.● Training as a stack of

RBMs…● ...so they do not need

labels:

Unsupervisedlearning

Page 34: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

34

Deep Belief Networks (DBN)

Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. "A fast learning algorithm for deep belief nets." Neural computation 18, no. 7 (2006): 1527-1554.

After the DBN is trained, it can be fine-tuned with a reduced amount of labels to solve a supervised task with superior performance.

Supervisedlearning

Softm

ax

Page 35: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

35

Deep Belief Networks (DBN)

F. Van Veen, “The Neural Network Zoo” (2016)

More details:D2L1,”Deep Belief Networks”

Page 36: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

36

Restricted Boltzmann Machine (RBM)

Deep Learning TV, “Restricted Boltzmann Machines”.

Page 37: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

37

Deep Belief Networks (DBN)

Deep Learning TV, “Deep Belief Nets”

Page 38: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

38

Deep Belief Networks (DBN)

Geoffrey Hinton, "Introduction to Deep Learning & Deep Belief Nets” (2012)Geoorey Hinton, “Tutorial on Deep Belief Networks”. NIPS 2007.

Page 39: Basic Deep Architectures (D1L4 Deep Learning for Speech and Language)

39

The Full Story

F. Van Veen, “The Neural Network Zoo” (2016)