thesis presentation

23
Semantically Integrating Laser and Vision in Pedestrian Detection Luciano Oliveira Advisors: Prof. Urbano Nunes Prof. Paulo Peixoto

Upload: luciano-oliveira

Post on 28-May-2015

837 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Thesis presentation

Semantically Integrating Laser and Vision in Pedestrian Detection

Luciano Oliveira

Advisors:Prof. Urbano NunesProf. Paulo Peixoto

Page 2: Thesis presentation

Motivation

SegmentationRecognition

TrackingSearching

Clustering methods

Kalman Filter

Efficient sub-window searching

(image)

Where is the pedestrianin the scene?

Page 3: Thesis presentation

Goals

Object detectionusing laser/vision

Proof-of-concept: pedestrian detection, but can be applied to several other objects

Recover objectlocalization

DO NOT entirelyrely on laser, as previous methods do

Perform the fusion in a context-aware mode

Page 4: Thesis presentation

Overview of the proposed method

Coarsesegmentation

Finesegmentation

Sensor registration

Parts-based ensemble detector

Semantic/contextualinterpretation

3D sliding windowsearching

Inference and decision outputs

Procrustesanalysis

Referenceshapes

Laser points

For each2D window

(label, confidence) for each fm

Images

For each3D window

(object, confidence)

Templating matching

Laser-image registration

Laser segmentation and labeling

MLN

HLSM-FINT

Ground MRF

{ }Nnnc 1=

{ }Mmmf 1=

for each cn

Page 5: Thesis presentation

Experimental setupPointgrey camera

Sick LMS200 laser

Odometry

Page 6: Thesis presentation

Sensor-driven detectors

Coarsesegmentation

Finesegmentation

Sensor registration

Parts-based ensemble detector

Semantic/contextualinterpretation

3D sliding windowsearching

Inference and decision outputs

Procrustesanalysis

Referenceshapes

Laser points

For each2D window

(label, confidence) for each fm

Images

For each3D window

(object, confidence)

Templating matching

Laser-image registration

Laser segmentation and labeling

MLN

HLSM-FINT

Ground MRF

{ }Nnnc 1=

{ }Mmmf 1=

for each cn

Page 7: Thesis presentation

Ensemble of classifiers HFI

Perimeter rate

Distance / max(w,w´)

Fuzzy System

C1 scaled score

C2 scaled score

FuzzySystem

Join

t co

nfid

ence

Inte

rsec

tion

rate

Fina

l con

fiden

ceFuzzy

System

Hierarchical Fuzzy IntegrationFuzzy inputs

C2

C1

Page 8: Thesis presentation

It suffers from exponential growing of rules and low overall performance over challenging situations

Drawbacks

Initially evaluated on Haar-like features / Adaboost and HOG / SVM classificationsystems

Page 9: Thesis presentation

Ensemble of classifiers HLSM-FINT

Page 10: Thesis presentation

HLSM-FINT – Rationale

• CNN – expert in background (BG) (60% of hit rate in NiSIS competition)• HOG/SVM – expert in objects (OB) (70% of hit rate in NiSIS competition)• Fuzzy integral (Sugeno) – providesa comprehensive framework andgreat synergism• 95.67% of hit rate in NiSIScompetition over 6125 croppedimages (ped + non-ped), usingHeuristic Majority Vote method• 96.4% of hit rate over fullDaimlerChrysler datasets : ~15.000 images

BG

BG BG

BG OB

OB

Page 11: Thesis presentation

Parts-based HLSM-FINT

Upper HLSM-FINT (shoulder + header)

Lower HLSM-FINT (waist)

Page 12: Thesis presentation

Laser detector

Page 13: Thesis presentation

Laser detector

arm armtorso

armtorso

torsoarm armpartial

segment

• Featureless approach

• Coarse-to-fine segmentation

• Relative Neighboorhood Graph (RNG) clustering + clustering index

• Procrustes Analysis (PA) labeling procedure

Page 14: Thesis presentation

Laser detector

Page 15: Thesis presentation

Laser detector

Occlusion problem:

• z-buffer analysis +

• angle between start and endpoint (proportional to laser angle resolution)

Page 16: Thesis presentation

Laser-image registration

Coarsesegmentation

Finesegmentation

Sensor registration

Parts-based ensemble detector

Semantic/contextualinterpretation

3D sliding windowsearching

Inference and decision outputs

Procrustesanalysis

Referenceshapes

Laser points

For each2D window

(label, confidence) for each fm

Images

For each3D window

(object, confidence)

Templating matching

Laser-image registration

Laser segmentation and labeling

MLN

HLSM-FINT

Ground MRF

{ }Nnnc 1=

{ }Mmmf 1=

for each cn

Page 17: Thesis presentation

Laser-image registration

Zhang and Pless’ calibration method(with an error of 6 mm in the calibration)

Page 18: Thesis presentation

Semantic Fusion

Coarsesegmentation

Finesegmentation

Sensor registration

Parts-based ensemble detector

Semantic/contextualinterpretation

3D sliding windowsearching

Inference and decision outputs

Procrustesanalysis

Referenceshapes

Laser points

For each2D window

(label, confidence) for each fm

Images

For each3D window

(object, confidence)

Templating matching

Laser-image registration

Laser segmentation and labeling

MLN

HLSM-FINT

Ground MRF

{ }Nnnc 1=

{ }Mmmf 1=

for each cn

Page 19: Thesis presentation

Semantic fusion

Page 20: Thesis presentation

Semantic fusion

MRF

Wi

• MRFs given by FOL formulas• Weights given by the MRF training (gradient ascent method over theconditonal log-likelihood)

Page 21: Thesis presentation

Semantic fusion – Examples

Page 22: Thesis presentation

ConclusionsHFI has achieved better performance than its components, but failedto get the gist of the fusionHLSM-FINT has succeeded to capture the aimed synergism of thefusion, but has had difficulties on hard situations (e.g. occlusion). Parts-based occlusion has improved this issue.The introduction of the laser sensor has brought significantimprovementThe proposed fusion method offers two main advantages:

Contextual and spatial relationship among the parts of theobject, dropping the false alarm rateIt is able to detect the object in spite of laser failing

The whole system is not able to run on-the-fly, although there is no code optimization. Nevertheless, parallel hardware can provideinteresting plataform to make the system faster. It will be subject offuture research.

Page 23: Thesis presentation

Publications and awardsJournals

OLIVEIRA, L.; NUNES, U.; PEIXOTO, P.; SILVA, M. and MOITA, F. SemanticFusion of Laser and Vision in Pedestrian Detection, Journal of PatternRecognition, Elsevier, accepted for publication (ISI impact factor: 3.279).OLIVEIRA, L.; NUNES, U. and PEIXOTO, P. On Exploration of ClassifierEnsemble Synergism in Pedestrian Detection, IEEE Transactions onIntelligent Transportation Systems, pp. 16-21, 2010 (ISI impact factor: 2.844).

Awards3rd place in Intel/GV Entrepreneurship and Venture Capital Competition(2008)1st place in NiSIS Competition - Best accuracy model over Daimler Chrysler image dataset. Scheme of Primate's Visual Cortex Cells for Pedestrian Recognition (2007)

5 international conferences