cs4501: introduction to computer vision neural networksvicente/vision/2018/slides/... · 2018. 4....

CS4501: Introduction to Computer VisionNeural Networks +Backpropagation

• Softmax Classifier• Generalization / Overfitting• Pytorch

Last Class

• Global Features• The perceptron model• Neural Networks – multilayer perceptron model (MLP)• Backpropagation

Today’s Class

Prediction

Supervised Machine Learning StepsTraining Labels

Training Images

Training

Image Features

Testing

Test Image

Learned model

Slide credit: D. Hoiem

Supervised Learning –Softmax Classifier

!" = [!"%!"'!"(!")]Extract features

+, = -,%!"% + -,'!"' + -,(!"( + -,)!") + /,+0 = -0%!"% + -0'!"' + -0(!"( + -0)!") + /0+1 = -1%!"% + -1'!"' + -1(!"( + -1)!") + /1

2, = 345/(345+348 + 349)20 = 348/(345+348 + 349)21 = 349/(345+348 + 349)

Run features through classifier

;<" = [2,2021]

Get predictions

Last Class:(mini-batch) Stochastic Gradient Descent (SGD)

!(#, %) =(−log./,01230(#, %)/∈5

6 = 0.01

for e = 0, num_epochs do

Initialize w and b randomly

:!(#, %)/:# :!(#, %)/:%Compute: and

Update w:

Update b:

# = # − 6:!(#, %)/:#% = % − 6:!(#, %)/:%

Print: !(#, %) // Useful to see if this is becoming smaller or not.

for b = 0, num_batches do B is a smallset of trainingexamples.

Generalization• Generalization refers to the ability to correctly classify never before

seen examples• Can be controlled by turning “knobs” that affect the complexity of

the model

Training set (labels known) Test set (labels unknown)

Overfitting

!"## $ is high !"## $ is low !"## $ is zero!

OverfittingUnderfittingHigh Bias High Variance

% is linear % is cubic% is a polynomial of

degree 9

Pytorch: Project Assignment 4

• http://vicenteordonez.com/vision/

Image Features• In your Project 4: Nearest Neighbors + Softmax Classifier features are:

Image: 3x32x32Feature: 3072-dim vector

Image Features: Color

Photo by: marielito

slide by Tamara L. Berg

Image Features: Color

However, these are all images of people but thecolors in each image are very different.

Image Features: HoG

Scikit-image implementation

Paper by Navneet Dalal & Bill Triggs presented at CVPR 2005 for detecting people.

Image Features: HoG

Paper by Navneet Dalal & Bill Triggs presented at CVPR 2005 for detecting people.Figure from Zhuolin Jiang, Zhe Lin, Larry S. Davis, ICCV 2009 for human action recognition.

+ Block Normalization

Image Features: GIST

The “gist” of a scene: Oliva & Torralba, 2001

Oriented edge response at multiple scales (5 spatial scales, 6 edge orientations) Hays and Efros,

SIGGRAPH 2007

Aggregated edge responses over 4x4 windowsHays and Efros, SIGGRAPH 2007

Image Features: Bag of (Visual) Words Representation

slide by Fei-fei Li

Summary: Image Features

• Many other features proposed• LBP: Local Binary Patterns: Useful for recognizing faces.• Dense SIFT: SIFT features computed on a grid similar to the HOG features.• etc.

• Largely replaced by Neural networks• Still useful to study for inspiration in designing neural networks that

compute features.

Perceptron ModelFrank Rosenblatt (1957) - Cornell University

More: https://en.wikipedia.org/wiki/Perceptron

! " = $1, if* +,",-

,./+ 1 > 0

0, otherwise

+;+<+=+>

Activationfunction

! " = $1, if* +,",-

,./+ 1 > 0

0, otherwise

+;+<+=+>

! " = $1, if* +,",-

,./+ 1 > 0

0, otherwise

+;+<+=+>

Activationfunction

Activation Functions

ReLU(x) = max(0, x)Tanh(x)

Sigmoid(x)Step(x)

Pytorch - Perceptron

Two-layer Multi-layer Perceptron (MLP)

!"!#!$!%

'"'#'$'%

”hidden" layer

Loss / Criterion

Forward pass

!"!#!$!%

'"'#'$'%

()" ("

*+ =& -"+.'+/

+01+ 3" !+ = 567896:(*+)

=" =& -#+!+/

+01+ 3#

(" = 567896:(=+)

>9?? = >((", ()")

Backward pass

!"!#!$!%

'"'#'$'%

()" ("

= **,-

/012304(,-)*+*!7

= 88:;

/012304(<-) 898=);

898=);

= 88=);

+((", ()")

= ( **'7& ?"-@'-

-BC+ E")

*+*?"-@

= *'7*?"-@

= ( **!7& ?#-!-

-BC+ E#)

*+*?#-

= *!7*?#-

GradInputs

GradParams

Pytorch – Two-layer MLP + Regression

Pytorch – Two-layer MLP + LogSoftmax

# of Hidden Units

Pytorch – Two-layer MLP + LogSoftmax

LogSoftmax + Negative Likelihood Loss

Bookmark Opportunity!

Questions?

cs4501: introduction to computer vision neural networksvicente/vision/2018/slides/... · 2018. 4....

Documents

deep learning for robotics...deep learning: end-to-end...

asymmetric key signatures david evans and samee zahur...

vision and key features for 5th generation (5g) cellular -...

vision based body gesture meta features for a ective...

issn: 1992-8645 fusion of color and statistic features...

features night vision binoculars

computer vision and new areadetector features

lecture 5: image interpolation and features cs4670: computer...

sbs, vision sensor internet: ... subject to change –...

stanford cs223b computer vision, winter 2005 lecture 3 ...

machine vision for robots modeling cameras processing images...

learning and inference in vision: from features to scene...

web crawling and basic text analysis hongning wang cs@uva...

cs4501: introduction to computer vision epipolargeometry

computer vision features extraction - ucerd

local features for range and vision-based robotic...

computer vision machine learning features

cryptocurrency café uva cs4501 spring 2015 david evans...

the internet of things vision key features, applications and...

cryptocurrency café uva cs4501 spring 2015 david evans...