deep belief networks (d2l1 deep learning for speech and language upc 2017)

Day 2 Lecture 1

Deep Belief Networks (DBN)

Elisa Sayrol

https://imatge.upc.edu/web/people/elisa-sayrol

https://imatge.upc.edu/web/people/elisa-sayrol

Restrictive Boltzmann Machine (RBM)

Training. Contrastive Divergence (CD)


Overview

3

Restricted Boltzmann Machine (RBM)

Figure: Geoffrey Hinton (2013)

Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "Restricted Boltzmann machines for collaborative filtering." Proceedings of the 24th international conference on Machine learning. ACM, 2007.

● Shallow two-layer net.● Restricted=No two nodes in a layer share a

connection● Bipartite graph.● Bidirectional graph

○ Shared weights.○ Different biases.

http://machinelearning.wustl.edu/mlpapers/paper_files/icml2007_SalakhutdinovMH07.pdf



4


Figure: Geoffrey Hinton (2013)

Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "Restricted Boltzmann machines for collaborative filtering." Proceedings of the 24th international conference on Machine learning. ACM, 2007.




5


DeepLearning4j, “A Beginner’s Tutorial for Restricted Boltzmann Machines”.

Forward pass

https://deeplearning4j.org/restrictedboltzmannmachine

6



Backwardpass

c

c

c

c


7



Backwardpass

The reconstructed values at the visible layer are compared with the actual ones with the KL Divergence.


https://en.wikipedia.org/wiki/Kullback%E2%80%93Leibler_divergence



8

What are the Maths behind RBMs? (Estimation of the parameters)

Geoffrey Hinton, "Introduction to Deep Learning & Deep Belief Nets” (2012)Geoorey Hinton, “Tutorial on Deep Belief Networks”. NIPS 2007.

https://www.youtube.com/watch?v=GJdWESd543Y

https://www.cs.toronto.edu/~hinton/nipstutorial/nipstut3.pdf

http://youtube.com/v/GJdWESd543Y

9

What are the Maths behind RBMs?

Other references:

Deeplearning.net: Restricted Boltzmann Machines (with Theano functions and concepts)Hugo Larochelle: Course on NN

Let’s take a look at some of his slides on RBM….

http://deeplearning.net/tutorial/rbm.html#id1

http://info.usherbrooke.ca/hlarochelle/neural_networks/content.html

10

What are the Maths behind RBMs?

Hugo Larochelle Slides

Hugo Larochelle Slides

19


Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. "A fast learning algorithm for deep belief nets." Neural computation 18, no. 7 (2006): 1527-1554.

● Architecture like an MLP.● Training as a stack of

RBMs.

http://www.mitpressjournals.org/doi/pdfplus/10.1162/neco.2006.18.7.1527



20




RBMs.




21




RBMs.




22




RBMs.




23




RBMs…● ...so they do not need

labels:

Unsupervisedlearning




24



After the DBN is trained, it can be fine-tuned with a reduced amount of labels to solve a supervised task with superior performance.

Supervisedlearning

Softm

ax




Thank You!

deep belief networks (d2l1 deep learning for speech and language upc 2017)

Data & Analytics