extreme, non-parametric object recognition

21
Extreme, Non-parametric Object Recognition 80 million tiny images (Torralba et al)

Upload: ewa

Post on 03-Feb-2016

23 views

Category:

Documents


0 download

DESCRIPTION

Extreme, Non-parametric Object Recognition. 80 million tiny images (Torralba et al). Our World is Boring…. Slide by Antonio Torralba. Lots Of Images. A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008. Lots Of Images. A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008. Lots Of Images. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Extreme, Non-parametric Object Recognition

Extreme, Non-parametric Object Recognition

80 million tiny images (Torralba et al)

Page 2: Extreme, Non-parametric Object Recognition

Our World is Boring…

Slide by Antonio Torralba

Page 3: Extreme, Non-parametric Object Recognition

Lots

Of

Images

A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

Page 4: Extreme, Non-parametric Object Recognition

Lots

Of

Images

A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

Page 5: Extreme, Non-parametric Object Recognition

Lots

Of

Images

Page 6: Extreme, Non-parametric Object Recognition

Automatic Colorization Result

Grayscale input High resolution

Colorization of input using average

A. Torralba, R. Fergus, W.T.Freeman. 2008

Page 7: Extreme, Non-parametric Object Recognition

Automatic Orientation

• Many images have ambiguous orientation

• Look at top 25% by confidence:

• Examples of high and low confidence images:

Slide by Antonio Torralba

Page 8: Extreme, Non-parametric Object Recognition

Automatic Orientation Examples

A. Torralba, R. Fergus, W.T.Freeman. 2008

Page 9: Extreme, Non-parametric Object Recognition

What If we have Labels…

Page 10: Extreme, Non-parametric Object Recognition
Page 11: Extreme, Non-parametric Object Recognition

Are 32x32 images enough?

Page 12: Extreme, Non-parametric Object Recognition

10% of the objects account for 90% of the data

~Zipf’s law

Caltech 101

Tiny images

LabelMe

Slide by Antonio Torralba

Page 13: Extreme, Non-parametric Object Recognition

Do people do this?

Page 14: Extreme, Non-parametric Object Recognition

What’s the Capacity of Visual Long Term Memory?

“Basically, my recollection is that we just separated the pictures into distinct thematic categories: e.g. cars, animals, single-person, 2-people, plants, etc.) Only a few slides were selected which fell into each category, and they were visually distinct.”

According to Standing

Standing (1973)

10,000 images

83% Recognition

What we know… What we don’t know…

Sparse Details

Dogs Dogs Playing CardsPlaying Cards

“Gist” Only Highly Detailed

… people can remember thousands

of images

… what people are remembering for each item?

High Fidelity Visual Memory is possible (Hollingworth 2004)

Slide by Aude Oliva

Page 15: Extreme, Non-parametric Object Recognition

Massive Memory I: Methods

... ......

Showed 14 observers 2500 categorically unique objects

1 at a time, 3 seconds each

800 ms blank between items

Study session lasted about 5.5 hours

Repeat Detection task to maintain focus

1-back

Followed by 300 2-alternative forced choice tests

1024-back

Slide by Aude Oliva

Page 16: Extreme, Non-parametric Object Recognition

Slide by Aude Oliva

Page 17: Extreme, Non-parametric Object Recognition

how far can we push the fidelity of visual LTM representation ?

Same object, different states

Slide by Aude Oliva

Page 18: Extreme, Non-parametric Object Recognition

Visual CognitionExpert Predictions

92%

Massive Memory I: Recognition Memory Results

Replication of Standing (1973)

Slide by Aude Oliva

Page 19: Extreme, Non-parametric Object Recognition

92% 88% 87%

Massive Memory I: Recognition Memory Results

Slide by Aude Oliva

Page 20: Extreme, Non-parametric Object Recognition

Extrapolation of Repeat Detection Data

Human performances for n = 1024

Power law(r2=.988)

Quadratic (r2=.988)

Brady, Konkle, Alvarez, Oliva (submitted) Slide by Aude Oliva

Page 21: Extreme, Non-parametric Object Recognition

Past and future of image datasets in computer vision

Lenaa dataset in one picture

1972

100

105

1010

1020

Number of

pictures

1015

Human Click Limit(all humanity takingone picture/secondduring 100 years)

Time 1996

40.000

COREL

2007

2 billion

2020?

Slide by Antonio Torralba