deep learning - theory and practice...neural networks multi-layer perceptron [hopfield, 1982]...

Deep Learning - Theory and Practice

20-02-2020Linear Regression, Least Squares Classification and Logistic Regression

http://leap.ee.iisc.ac.in/sriram/teaching/DL20/

deeplearning.cce2020@gmail.com

Least Squares for Classification

Bishop - PRML book (Chap 3)

❖ K-class classification problem

❖ With 1-of-K hot encoding, and least squares regression

Logistic Regression

❖ 2- class logistic regression

❖ K-class logistic regression

❖ Maximum likelihood solution

Typical Error Surfaces

Typical Error Surface as a function of parameters

(weights and biases)

Learning with Gradient Descent

Error surface close to a local

Learning Using Gradient Descent

Parameter Learning

• Solving a non-convex optimization.

• Iterative solution. • Depends on the initialization. • Convergence to a local

optima. • Judicious choice of learning

Least Squares versus Logistic Regression

Neural Networks

Perceptron Algorithm

What if the data is not linearly separable

Perceptron Model [McCulloch, 1943, Rosenblatt, 1957]

Targets are binary classes [-1,1]

Multi-layer PerceptronMulti-layer Perceptron [Hopfield, 1982]

thresholding function

non-linear function (tanh,sigmoid)

Neural Networks

Multi-layer Perceptron [Hopfield, 1982]

thresholding function

non-linear function (tanh,sigmoid)

• Useful for classifying non-linear data boundaries - non-linear class separation can be realized given enough data.

Neural Networks

tanh sigmoid ReLu

Cost-Function

are the desired outputs

Mean Square Error Cross Entropy

Types of Non-linearities

Learning Posterior Probabilities with NNsChoice of target function

• Softmax function for classification

• Softmax produces positive values that sum to 1 • Allows the interpretation of outputs as posterior

probabilities

deep learning - theory and practice...neural networks multi-layer perceptron [hopfield, 1982]...

Documents

exact solutions of the rosenau hyman equation, coupled kdv...

bivariate linear correlation. linear function y = a + bx

6.3 linear functions€¦ · 258 chapter 6 functions 6.3...

section 3.4 basic functions constant function linear...

linear models for regression - penn...

linear programming problem. definition a linear programming...

280 chapter 4 linear functions -...

splash screen. vocabulary function discrete function...

linear discriminant function

bounded linear operators on banach function …

a non-linear function-on-function model for regression

linear operators & green's function

1 4 writing linear function rules

expanding the tanh-function method for solving nonlinear...

analytical solution with tanh-method and fractional sub...

linear function important.ppt

piecewise-defined function piecewise-linear function step...

4.3 linear function patterns - big ideas math€¦ · ·...

thuthuattienich.vn · löl nÓi ÐÂu tanh tanh tôi là...

linear function module 2