a bayesian model of visual attention - mit9.520/spring09/classes/06may2009... · 2010-01-22 ·...

24
A Bayesian Model of Visual Attention Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and Computational Learning, MIT

Upload: others

Post on 01-Aug-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

A Bayesian Model of Visual  Attention

Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio

Center for Biological and Computational Learning, MIT

Page 2: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Outline

• IntroductionLimitations of feed‐forward processing

Role of attention

• A computational model of attention

• ApplicationsModeling human eye‐movements 

Object recognition under clutter

Page 3: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Outline

• IntroductionLimitations of feed‐forward processing

Role of attention

• A computational model of attention

• ApplicationsModeling human eye‐movements 

Object recognition under clutter

Page 4: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Riesenhuber & Poggio 1999, 2000; Serre Kouh Cadieu Knoblich Kreiman & Poggio 2005; Serre Oliva Poggio 2007

*Modified from (Gross, 1998)

Feed‐forward processing

Page 5: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

see also Broadbent 1952 1954; Treisman 1960; Treisman & Gelade 1see also Broadbent 1952 1954; Treisman 1960; Treisman & Gelade 1980; Duncan & Desimone 1995; Wolfe, 1997; Tsotsos and  many othe980; Duncan & Desimone 1995; Wolfe, 1997; Tsotsos and  many othersrs

Zoccolan Kouh Poggio DiCarlo 2007 Reynolds Chelazzi & 

Desimone 1999Serre Oliva Poggio 2007

Role of attention

Parallel processing  (No attention) Serial processing (With attention)

Page 6: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Biology of attention

Page 7: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Feed-fowardFeature-attention

Rao 2005; Lee & Mumford 2003Rao 2005; Lee & Mumford 2003

Attention as Bayesian inference

OO

FiFi

FliFli

II

LL

NN

LIP/FEFLIP/FEF

V2V2

V4V4

ITIT

PFCPFC

Spatial attention

•We use a Bayesian framework to model 

the interaction between the ventral 

stream and LIP/FEF

•Feed forward connections

within the 

ventral stream are modeled as bottom‐up

evidence.  Feedback

connections from 

higher areas are modeled as top‐down

priors.

•The posterior probability of location 

generates a task‐based saliency map.

Page 8: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Model description

Fi

Fil

L

N

“What”“Where”

Feature-maps

Image

Feature-maps

Page 9: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Model properties: invariance

Fi

Fil

L

N

“What”“Where”

Page 10: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Model properties: crowding

Fi

Fil

L

N

“What”“Where”

Page 11: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Model: spatial attention

Fi

Fil

L

N

What is at location X?

X

* * *

Page 12: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Model: feature‐based attention

Fi

Fil

L

N

Where is object X?

X

* * *

Page 13: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Outline

• IntroductionLimitations of feed‐forward processing

Role of attention

• A computational model of attention

• ApplicationsModeling human eye‐movements 

Object recognition under clutter

Page 14: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Matching human eye‐movements

Car search Pedestrian search

Dataset100 CBCL street-scenes images having cars & pedestrians20 images with neither objects

Experiment8 subjects where shown these 120 images in random order. Each image in the stimuli-set was presented twiceThe subjects were asked to count the number of cars/pedestrians For each of these block trials, the subject’s eye movements were recorded using an infra-red eye tracker.

(Psychophysics done by Cheston T )

Page 15: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Model instantiation

OOO

FiFFii

FliFFllii

III

LLL

NN

LIP/FEFLIP/FEF

V2V2

V4V4

ITIT

PFCPFC

Page 16: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Examples

Page 17: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Examples

Page 18: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Quantitative evaluation: ROC

Page 19: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

0

0.25

0.5

0.75

1

car pedestrian

Humans Bottom-upTop-down (feature-based) Feaure-based + contextual cues

RO

Car

ea

Integrating (local) featureIntegrating (local) feature--based + (global) contextbased + (global) context--based based cues accounts for cues accounts for 92%92% of interof inter--subject agreement!subject agreement!

Chikkerur ,Tan Serre & Poggio (SFN Chikkerur ,Tan Serre & Poggio (SFN ‘‘09,VSS 09,VSS ‘‘09)09)

Quantitative evaluation: ROC

Page 20: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Outline

• IntroductionLimitations of feed‐forward processing

Role of attention

• A computational model of attention

• ApplicationsModeling human eye‐movements

Object recognition under clutter

Page 21: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Effect of clutter on detection

recognition without attention

recognition under attention

Page 22: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Scale and location prediction

Page 23: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Performance improves under  attention

0

1

2

3

Model Humans

perfo

rman

ce (d

perfo

rman

ce (d

’’ ))

no attentionno attention one shift of one shift of attentionattention

Tan, Tan, ChikkerurChikkerur , , SerreSerre & & PoggioPoggio (VSS (VSS ‘‘09)09)

Page 24: A Bayesian Model of Visual Attention - MIT9.520/spring09/Classes/06May2009... · 2010-01-22 · Sharat Chikkerur, Thomas Serre, Cheston Tan & Tomaso Poggio Center for Biological and

Thank you!

Questions?