cv: perceiving 3d from 2d

35
MSU CSE 803 Stockman 1 CV: Perceiving 3D from 2D Many cues from 2D images enable interpretation of the structure of the 3D world producing them

Upload: dakota

Post on 12-Feb-2016

34 views

Category:

Documents


0 download

DESCRIPTION

CV: Perceiving 3D from 2D. Many cues from 2D images enable interpretation of the structure of the 3D world producing them. Topic roadmap. Labeling 3D structure in a 2D image Labeling constraints on edge graphs Huffman-Clowes-Waltz labeling Other cues motion parallax - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 1

CV: Perceiving 3D from 2D

Many cues from 2D images enable

interpretation of the structure of the 3D

world producing them

Page 2: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 2

Topic roadmap Labeling 3D structure in a 2D image Labeling constraints on edge graphs Huffman-Clowes-Waltz labeling Other cues motion parallax shape from texture or shading stereo from two 2d images

Page 3: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 3

Topic roadmap: mathematical models Shape from shading Depth from stereo Depth from focus Perspective transformation

(review)

Page 4: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 4

Many 3D cues

How can humans and other machines reconstruct the 3D nature of a scene from 2D images?What other world knowledge needs to be added in the process?

Page 5: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 5

Vocabulary for image labeling

Interpret the local structure of the scene in the image space

Page 6: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 6

Some terms for local 3D structure

Crease: (convex + or concave -) formed by an abrupt change to a surface or the joining of two surfaces. Surface on both sides of the crease can be sensed

Blade: ( > ) as in the blade of a knife, where the normal to the occluding surface element. Occluding and occluded surfaces unrelated.

(left) intensity image of 3 blocks (right) result of 5x5 Prewitt operator

Page 7: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 7

Limb: smooth object contour

Limb: (>>) formed by viewing a smooth 3D object, such as an arm or a soup can: when approaching the contour, the surface normal becomes perpendicular to the line of sight. (Right side of arrow is the occluding surface.)

egg

Soup can

Page 8: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 8

Albedo and lighting mark: (M) surface mark or

change of “albedo” (reflectance) and not the 3d surface, creating an intensity contour in the image

shading: (S) illumination change due to a change in lighting or shadow on the surface, creating an intensity contour in the image

Page 9: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 9

Labeling image contours interprets the 3D scene structure

+An egg and a thin cup on a table top lighted from the top right

Logo on cup is a “mark” on the material

“shadow” relates to illumination, not material

Page 10: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 10

“Intrinsic Image” stores 3D info in “pixels” and not intensity.

For each point of the image, we want depth to the 3D surface point, surface normal at that point, albedo of the surface material, and illumination of that surface point.

Page 11: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 11

Practice labeling the contours

(left) an unopened can of Brand X soda is a solid blue can with a bright orange block ‘X’. (right) an empty box with all four flaps open so even the bottom of the box is visible.

Page 12: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 12

3D scene versus 2D image

Creases Corners Faces Occlusions (for

some viewpoint)

Edges Junctions Regions Blades, limbs, T’s

Page 13: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 13

Labeling of simple polyhedra

Labeling of a block floating in space. BJ and KI are convex creases. Blades AB, BC, CD, etc model the occlusion of the background. Junction K is a convex trihedral corner. Junction D is a T-junction modeling the occlusion of blade CD by blade JE.

Page 14: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 14

Trihedral Blocks World Image Junctions: only 16 cases!

Only 16 possible junctions in 2D formed by viewing 3D corners formed by 3 planes and viewed from a general viewpoint! From top to bottom: L-junctions, arrows, forks, and T-junctions.

Page 15: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 15

Challenge !

Create a scene shot of no more than 2 blocks that

creates all 16 junctions in the image

Page 16: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 16

How do we obtain the catalog? think about solid/empty

assignments to the 8 octants about the X-Y-Z-origin

think about non-accidental viewpoints

account for all possible topologies of junctions and edges

then handle T-junction occlusions

Page 17: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 17

Blocks world labeling

Left: block floating in space

Right: block glued to a wall at the back

Page 18: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 18

Try labeling these: interpret the 3D structure, then label parts

What does it mean if we can’t label them? If we can label them?

Page 19: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 19

Waltz filtering discards edge interpretations spanning junctions

Page 20: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 20

1975 researchers very excited very strong constraints on

interpretations several hundred in catalogue when

cracks and shadows allowed (Waltz): algorithm works very well with them

but, world is not made of blocks! later on, curved blocks world work

done but not as interesting

Page 21: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 21

Backtracking or interpretation tree

Page 22: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 22

Interpretation tree search

Page 23: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 23

“Necker cube” has multiple interpretations

A human staring at one of these cubes typically experiences changing interpretations. The interpretation of the two forks (G and H) flip-flops between “front corner” and “back corner”. What is the explanation?

Label the different interpretations

Page 24: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 24

Depth cues in 2D images

Page 25: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 25

“Interposition” cue

Def: Interposition occurs when one object occludes another object, thus indicating that the occluding object is closer to the viewer than the occluded object.

Page 26: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 26

interposition • T-junctions indicate occlusion: top is occluding edge while bar is the occluded edge• Bench occludes lamp post• leg occludes bench• lamp post occludes fence• railing occludes trees• trees occlude steeple

Page 27: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 27

• Perspective scaling: railing looks smaller at the left; bench looks smaller at the right; 2 steeples are far away• Forshortening: the bench is sharply angled relative to the viewpoint; image length is affected accordingly

Page 28: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 28

Texture gradient reveals surface orientation

Texture Gradient: change of image texture along some direction, often corresponding to a change in distance or orientation in the 3D world containing the objects creating the texture.

( In East Lansing, we call it “corn” not “maize’. )

Note also that the rows appear to converge in 2D

Page 29: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 29

3D Cues from Perspective

Page 30: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 30

3D Cues from perspective

Page 31: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 31

More 3D cues

Virtual lines Falsely perceived interposition

Page 32: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 32

More 3D cues

2D alignment usually means 3d alignment

2D image curves create perception of 3D surface

Page 33: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 33

“structured light” can enhance surfaces in industrial vision

Sculpted object Potatoes with light stripes

Page 34: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 34

Shape (normals) from shading

Cylinder with white paper and pen stripes

Intensities plotted as a surface

Clearly intensity encodes shape in this case

Page 35: CV: Perceiving 3D from 2D

MSU CSE 803 Stockman 35

Shape (normals) from shading

Plot of intensity of one image row reveals the 3D shape of these diffusely reflecting objects.