object recognition: overview and historylazebnik/spring11/lec18_recognition_intro.pdf ·...

53
Object Recognition: Overview and History Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Upload: others

Post on 21-Jul-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Object Recognition: Overview and History

Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Page 2: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

How many visual object categories are there?

Biederman 1987

Page 3: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception
Page 4: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

OBJECTS

ANIMALS INANIMATEPLANTS

MAN-MADENATURALVERTEBRATE…..

MAMMALS BIRDS

GROUSEBOARTAPIR CAMERA

Page 5: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Specific recognition tasks

Page 6: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Scene categorization• outdoor/indoor• outdoor/indoor• city/forest/factory/etc.

Page 7: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Image annotation/tagging

• street• people• building• mountainmountain• …

Page 8: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Object detection• find pedestrians• find pedestrians

Page 9: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Image parsingsky

mountainsky

t

building

buildingtree

bbanner

street lamp

market

street lamp

marketpeople

Page 10: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Image understanding?

Page 11: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Recognition is all about modeling variability

Variability: Camera positionIlluminationShape parametersShape parameters

Within-class variations?

Page 12: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Within-class variationsWithin class variations

Page 13: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era

Page 14: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

θθ

Variability: Camera positionIllumination Alignment

Shape: assumed known

Roberts (1965); Lowe (1987); Faugeras & Hebert (1986); Grimson & Lozano-Perez (1986); Huttenlocher & Ullman (1987)

p

Page 15: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Recall: Alignment

• Alignment: fitting a model to a transformationAlignment: fitting a model to a transformation between pairs of features (matches) in two images

∑ ′

Find transformation Tthat minimizesT

xixi

'

∑ ′i

ii xxT )),((residual

Page 16: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Recognition as an alignment problem:Block worldBlock world

L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. hthesis, MIT Department of Electrical Engineering, 1963.

J. Mundy, Object Recognition in the Geometric Era: a Retrospective, 2006

Page 17: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Alignment: Huttenlocher & Ullman (1987)

Page 18: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Variability Camera positionInvariance to:y pIlluminationInternal parameters

Duda & Hart ( 1972); Weiss (1987); Mundy et al. (1992-94);Rothwell et al. (1992); Burns et al. (1993)

Page 19: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Example: invariant to similarity transformations computed from four Btransformations computed from four points C

D

A

D

Projective invariants (Rothwell et al., 1992):

General 3D objects do not admit monocular viewpoint invariants (Burns et al., 1993)

Page 20: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Representing and recognizing object categoriesi h dis harder...

ACRONYM (Brooks and Binford 1981)ACRONYM (Brooks and Binford, 1981)Binford (1971), Nevatia & Binford (1972), Marr & Nishihara (1978)

Page 21: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Recognition by components

P i iti ( ) Obj t

Biederman (1987)

Primitives (geons) Objects

http://en.wikipedia.org/wiki/Recognition_by_Components_Theory

Page 22: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

General shape primitives?

Generali ed c lindersGeneralized cylindersPonce et al. (1989)

Zisserman et al. (1995) Forsyth (2000)

Page 23: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era• 1990s: appearance-based models

Page 24: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Empirical models of image variability

Appearance-based techniques

Turk & Pentland (1991); Murase & Nayar (1995); etc.

Page 25: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Eigenfaces (Turk & Pentland, 1991)

Page 26: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Color HistogramsColor Histograms

Swain and Ballard, Color Indexing, IJCV 1991.

Page 27: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Appearance manifoldspp

H. Murase and S. Nayar, Visual learning and recognition of 3-d objects from appearance, IJCV 1995

Page 28: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Limitations of global appearance d lmodels

• Requires global registration of patternsRequires global registration of patterns• Not robust to clutter, occlusion, geometric

transformations

Page 29: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era• 1990s: appearance-based models• 1990s – present: sliding window approaches1990s present: sliding window approaches

Page 30: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Sliding window approachesg pp

Page 31: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Sliding window approachesSliding window approaches

• Turk and Pentland 1991Turk and Pentland, 1991• Belhumeur, Hespanha, &

Kriegman, 1997• Schneiderman & Kanade 2004Schneiderman & Kanade 2004• Viola and Jones, 2000

• Schneiderman & Kanade, 2004• Argawal and Roth, 2002• Poggio et al. 1993

Page 32: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era• 1990s: appearance-based models• Mid-1990s: sliding window approachesMid-1990s: sliding window approaches• Late 1990s: local features

Page 33: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Local features for object instance recognition

D. Lowe (1999, 2004)

Page 34: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Large-scale image searchCombining local features, indexing, and spatial constraintsCombining local features, indexing, and spatial constraints

Image credit: K. Grauman and B. Leibe

Page 35: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Large-scale image searchCombining local features, indexing, and spatial constraintsCombining local features, indexing, and spatial constraints

Philbin et al. ‘07

Page 36: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Large-scale image searchCombining local features, indexing, and spatial constraintsCombining local features, indexing, and spatial constraints

Page 37: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era• 1990s: appearance-based models• Mid-1990s: sliding window approachesMid-1990s: sliding window approaches• Late 1990s: local features• Early 2000s: parts and shape models• Early 2000s: parts-and-shape models

Page 38: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Parts-and-shape modelsParts and shape models• Model:

– Object as a set of parts– Relative locations between parts– Appearance of part

Figure from [Fischler & Elschlager 73]

Page 39: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Constellation models

Weber, Welling & Perona (2000), Fergus, Perona & Zisserman (2003)

Page 40: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Representing peopleRepresenting people

Page 41: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era• 1990s: appearance-based models• Mid-1990s: sliding window approachesMid-1990s: sliding window approaches• Late 1990s: local features• Early 2000s: parts and shape models• Early 2000s: parts-and-shape models• Mid-2000s: bags of features

Page 42: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Bag-of-features models

Obj tObj t Bag of Bag of

g

ObjectObject gg‘words’‘words’

Page 43: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Objects as textureObjects as texture• All of these are treated as being the sameg

• No distinction between foreground and o d st ct o bet ee o eg ou d a dbackground: scene recognition?

Page 44: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

History of ideas in recognitionHistory of ideas in recognition

• 1960s – early 1990s: the geometric era1960s early 1990s: the geometric era• 1990s: appearance-based models• Mid-1990s: sliding window approachesMid-1990s: sliding window approaches• Late 1990s: local features• Early 2000s: parts and shape models• Early 2000s: parts-and-shape models• Mid-2000s: bags of features• Present trends: combination of local and global• Present trends: combination of local and global

methods, data-driven methods, context

Page 45: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Global scene descriptorsGlobal scene descriptors

• The “gist” of a scene: Oliva & Torralba (2001)The gist of a scene: Oliva & Torralba (2001)

http://people.csail.mit.edu/torralba/code/spatialenvelope/

Page 46: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Data-driven methods

J. Hays and A. Efros, Scene Completion using Millions of Photographs, SIGGRAPH 2007

Page 47: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Data-driven methods

J. Tighe and S. Lazebnik, ECCV 2010

Page 48: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Geometric context

D Hoiem A Efros and M Herbert Putting Objects inD. Hoiem, A. Efros, and M. Herbert. Putting Objects in Perspective. CVPR 2006.

Page 49: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

Discriminatively trained part-based models

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, "Object Detection with Discriminatively Trained Part-Based Models," PAMI 2009

Page 50: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks

Page 51: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks• Fingerprint recognition

Page 52: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks• Fingerprint recognition• Face detectionFace detection

Page 53: Object Recognition: Overview and Historylazebnik/spring11/lec18_recognition_intro.pdf · Recognition as an alignment problem: Block worldBlock world L. G. Roberts, Machine Perception

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks• Fingerprint recognition• Face detectionFace detection• Recognition of flat textured objects (CD covers,

book covers etc )book covers, etc.)