min sun: 3d object detection

24
3D object categorization, detection, and viewpoint classification Final Presentation Min Sun EECS, University of Michigan at Ann Arbor Mentor: Gary Bradski

Upload: willow-garage

Post on 12-Nov-2014

17.682 views

Category:

Documents


2 download

DESCRIPTION

Presentation by Min Sun on the work he has been doing this summer in recognizing common objects in the office, like staplers, cups, and mice.

TRANSCRIPT

Page 1: Min Sun: 3D Object Detection

3D object categorization, detection, and viewpoint classification

Final Presentation

Min SunEECS, University of Michigan at Ann Arbor

Mentor: Gary Bradski

Page 2: Min Sun: 3D Object Detection

2

Goal: Viewpoint ClassificationGoal: Detection and Categorization

Page 3: Min Sun: 3D Object Detection

Approaches

• Discriminative Codeword (Random Forest), using Random Forest

• Hough voting for each viewing region

Mouse, V1

Mouse, V2

Page 4: Min Sun: 3D Object Detection

First Step: System implementation

• Using OpenCV and octave to re-implement the system

• Old system in Matlab: slow and not open source

• New system is fast and open source

• Speed-up detection from ~2 minutes to ~5 seconds for single object class

• Create a ROS node (rf_detector) to recognize object online

Page 5: Min Sun: 3D Object Detection

Challenges

• Need good shape descriptor for objects with less texture

• Need to have a multi-class object detector to detect multiple object classes at the same time

Page 6: Min Sun: 3D Object Detection

Second Step: system upgrade

• Exploring different shape features:

1. Histogram Oriented Gradients (opencv)

2. Geometric Blur (geometric_blur in ROS)

3. Berkeley natural boundary (Nb) detector

Berkeley (Nb) OpenCV

Page 7: Min Sun: 3D Object Detection

Conclusion

• Hog has similar performance as Geometric blur+natural boundary detector(Nb)

• It takes 3 minutes to compute Natural boundary(Nb) for each image

• Hog is fast and almost the best

Recall Mouse Stapler

Gb+Nb 28% 37%

Hog+Nb 25% 45%

Hog 30% 35%

Page 8: Min Sun: 3D Object Detection

Second Step: system upgrade

• Multi-class Random Forest

Mouse

Stapler

Mug & Mouse

Mug

Page 9: Min Sun: 3D Object Detection

Third Step: 3d information

• Using stereo depth to sample image patches corresponding to fix physical size to avoid scale search

• Using Dan’s shape spectral and spin image descriptors in descriptor_3d (ROS pkg)

• Combine both Hog and 3d descriptors

Page 10: Min Sun: 3D Object Detection

Data collection

• Table top object classes: mice, staplers, and mugs

• Collect aligned images and dense stereo point clouds

Page 11: Min Sun: 3D Object Detection

Multiple Views

Page 12: Min Sun: 3D Object Detection

Third Step: Results

mouse

staplermug

mouse stapler mug

Page 13: Min Sun: 3D Object Detection

Mouse

Page 14: Min Sun: 3D Object Detection

Stapler

Page 15: Min Sun: 3D Object Detection

Mug

Page 16: Min Sun: 3D Object Detection

Third Step: Comparison

Average Precision

ClassificationAccuracy

spin 0.213 0.2

shape 0.138 0.4

hog 0.635 0.73

hog+spin 0.612 0.7

Hog+shape 0.67 0.72

Page 17: Min Sun: 3D Object Detection

Working system

• Texture_light_on_off node aligns images w/o texture light and dense stereo point clouds

• Table top object detector (t2obj) segments out the point clouds of table top objects

• Finally, rf_detector recognizes object locations, classes, and viewpoints.

Page 18: Min Sun: 3D Object Detection

Results: miceRecognition Table top segmentation

Page 19: Min Sun: 3D Object Detection

Results: mugsRecognition Table top segmentation

Page 20: Min Sun: 3D Object Detection

Results: staplersRecognition Table top segmentation

Page 21: Min Sun: 3D Object Detection

ResultsRecognition Table top segmentation

Page 22: Min Sun: 3D Object Detection

ResultsRecognition Table top segmentation

Page 23: Min Sun: 3D Object Detection

• Train on 3d+image, Test on image only

Use image patches of fix physical size to detect objects and infer the 3d position of the supporting image patches -> Object Pop-Up

• Vote for object center directly in 3d

Make the model fully rotational invariant and more compact

Future work

Page 24: Min Sun: 3D Object Detection

Thank you