vision sensing. multi-view stereo for community photo collections michael goesele, et al, iccv 2007...

37
Vision Sensing

Upload: emerald-pearson

Post on 20-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Vision Sensing

Page 2: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Multi-View Stereo for Community Photo CollectionsMichael Goesele, et al, ICCV 2007

Venus de Milo

Page 3: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

The Digital Michelangelo Project, Stanford

Page 4: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

How to sense 3D very accurately?

Page 5: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

How to sense 3D very accurately?

rangerangeacquisitionacquisition

contactcontact

transmissivetransmissive

reflectivereflectivenon-opticalnon-optical

opticaloptical

industrial CTindustrial CT

mechanical mechanical (CMM, jointed arm)(CMM, jointed arm)

radarradar

sonarsonar

MRIMRI

ultrasoundultrasound

Page 6: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

opticalopticalmethodsmethods

passivepassive

activeactive

shape from X:shape from X:stereostereomotionmotionshadingshadingtexturetexturefocusfocusdefocusdefocus

active variants of passive methodsactive variants of passive methodsStereo w. projected textureStereo w. projected textureActive depth from defocusActive depth from defocusPhotometric stereoPhotometric stereo

time of flighttime of flight

triangulationtriangulation

Page 7: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

CameraCameraCameraCamera

Triangulation

• Depth from ray-plane triangulation:• Intersect camera ray with light plane

LaserLaser

ObjectObject

Light Plane

Image Point

Page 8: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Example: Laser scanner

Cyberware® face and head scanner

+ very accurate < 0.01 mm − more than 10sec per scan

Page 9: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Digital Michelangelo Projecthttp://graphics.stanford.edu/projects/mich/

Example: Laser scanner

Page 10: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

XYZRGB

Page 11: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Shadow scanning

DeskLamp

Camera

Stick orpencil

Desk

http://www.vision.caltech.edu/bouguetj/ICCV98/

Page 12: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Basic idea

• Calibration issues:• where’s the camera wrt. ground plane?• where’s the shadow plane?

– depends on light source position, shadow edge

Page 13: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Two Plane Version

• Advantages• don’t need to pre-calibrate the light source• shadow plane determined from two shadow edges

Page 14: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Estimating shadow lines

Page 15: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Shadow scanning in action

Page 16: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

accuracy: 0.1mm over 10cm ~ 0.1% error

Results

Page 17: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Textured objects

Page 18: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

accuracy: 1mm over 50cm ~ 0.5% error

Scanning with the sun

Page 19: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

accuracy: 1cm over 2m~ 0.5% error

Scanning with the sun

Page 20: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Faster Acquisition?

• Project multiple stripes simultaneously• Correspondence problem: which stripe is which?

• Common types of patterns:

• Binary coded light striping

• Gray/color coded light striping

Page 21: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Binary Coding

Pattern 1

Pattern 2

Pattern 3

Projected over time

Example:

3 binary-encoded patterns which allows the measuring surface to be divided in 8 sub-regions

Faster:

stripes in images.12 n n

Page 22: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Binary Coding

• Assign each stripe a unique illumination codeover time [Posdamer 82]

SpaceSpace

TimeTime

Page 23: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Binary Coding

Pattern 1

Pattern 2

Pattern 3

Projected over time

Example: 7 binary patterns proposed by Posdamer &

Altschuler

Codeword of this píxel: 1010010 identifies the corresponding pattern stripe

Page 24: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

More complex patterns

Zhang et al

Works despite complex appearances

Works in real-time and on dynamic scenes

• Need very few images (one or two).• But needs a more complex correspondence algorithm

Page 25: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Continuum of Triangulation Methods

Slow, robustSlow, robust Fast, fragileFast, fragile

Multi-stripeMulti-stripeMulti-frameMulti-frame

Single-frameSingle-frameSingle-stripeSingle-stripe

Page 26: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Time-of-flight

+ No baseline, no parallax shadows+ Mechanical alignment is not as critical − Low depth accuracy− Single viewpoint capture

Miyagawa, R., Kanade, T., “CCD-Based Range Finding Sensor”, IEEE Transactions on Electron Devices, 1997

Working Volume: 1500mm - Accuracy: 7%Spatial Resolution: 1x32- Speed: ??

Page 27: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Comercial products

Canesta

64x64@30hzAccuracy 1-2cm

Not accurate enough for face modeling, but good enough for layer extraction.

Page 28: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Depth from Defocus

Page 29: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Depth from Defocus

Page 30: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Depth from Defocus

Nayar, S.K., Watanabe, M., Noguchi, M., “Real-Time Focus Range Sensor”, ICCV 1995

Working Volume: 300mm - Accuracy: 0.2%Spatial Resolution: 512x480 - Speed: 30Hz

+ Hi resolution and accuracy, real-time− Customized hardware− Single view capture?

Page 31: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Capturing and Modeling Appearance

Page 32: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Computer Vision

Computer Graphics

Satellite Imaging

Underwater Imaging

Medical Imaging

Appearance

Page 33: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Debevec, Siggraph 2002

Capture Face Appearance

Page 34: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Image-Based Rendering / Recognition

+

+

Schechner et. al. Multiplexed Illumination

Page 35: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Paul Debevec’s Light Stage 3

Page 36: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Light Stage DataLight Stage DataOriginal Resolution: 6432

Original Resolution: 6432

Lighting through image recombination: Haeberli ‘92, Nimeroff ‘94, Wong ‘97Lighting through image recombination: Haeberli ‘92, Nimeroff ‘94, Wong ‘97

Page 37: Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo

Georghiades, Belhumeur & KriegmanYale Face Database B

Shape Recovery

BRDF

Material Recognition

Human Vision

Rendering

Object / Face Recognition