perspective multiscale detection and tracking of persons

Perspective Multiscale Detection and

Tracking of Persons

Marcos Nieto, Juan Diego Ortega, Andoni Cortés, and Seán Gaines

MMM 2014 – The 20th Anniversary International Conference on

Multimedia Modeling, Dublin (Ireland), 6,7,8-10th January 2014

1. Motivation

2. Perspective calibration

3. Approach

4. Results

Outline

5. Conclusions

Outline

1. Motivation

1. Object detection in images

2. Real-time application

3. Contextual information

2. Perspective calibration

3. Approach

4. Results

5. Conclusions

Motivation

• Object detection in images

Detection-by-classification

Supervised learning

Feature extraction

Binary or multiclass

Multiscale detection

Sliding window

Spans position & size

Bounding boxes

Open Open

Close Close

• Real-time applications

Motivation

Multiscale detection

Kind of brute-force

Too many evaluations

Some are absurd given the context

1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61

Levels

Parameters

Initial (smallest) size

Number of scales

Factor between scales

Offset (stride)

Therefore, some

knowledge about the

scene must be provided

• Contextual information

Motivation

Color, motion, depth

Low generality

Particular to each application

Perspective of the scene

High generality

Allows to maintain multiscale technique

Applicable in real-time

Two assumptions

There is a dominant ground plane

Objects lie on the plane, and their 3D size is app. known

Surveillance, ADAS

Vehicles, persons

Outline

1. Motivation

2. Perspective Calibration

1. Plane view calibration

2. GUI

3. Projection of objects

3. Approach

4. Results

5. Conclusions

Perspective calibration

• Plane view calibration

Homography calculation

4-points

2 metric references

Extrinsics from Homography

Rotation and translation of

camera

1 DoF Camera model

Focal length from homography

Refinement using Lev.-Marq.

Useful to calibrate videos

Quick (2-5 minutes)

Also lens distortion

correction

• Projection of objects

Farthest size of object

Closest size of object

Outline

1. Motivation

3. Approach

1. Overview

2. Perspective Multiscale

3. Perspective Grid

4. Results

5. Conclusions

• Define the perspective of the scene

• Define the 3D size of the object to search

Approach

Camera calibration

Intrinsic parameters

Camera pose

Extrinsic parameters

Homography

calibration

• Define the 3D size of the object to search

• A) Calculate the best parameters for multiscale

• B) Define a fixed grid of positions in the plane

Persons

1700 x 500 x 500Car

1500 x 1700 x 3500

• A) Perspective multiscale

• Rescale original

image so model size

fits farthest object

• Compute scale

factor so that model

size coincides with

Approach

Multiscale Perspective Multiscale

size coincides with

closest object at the

smallest image

• Filter out invalid

positions

Focused effort: less

number of levels are

required

• It is still necessary to filter out invalid positions-sizes

• The advantage of using this approach is that traditional multiscale

implementations can still be used with much less number of levels

Approach

1 6 11 16 21 26 31 36 41 46 51 56 61

Levels

Focused effort: less number

of levels are required

(typically 3 to 5)

• B) Grid of fixed positions

• Predefine feasible

locations of objects

• No need to filter

• Can not be used in

multiscale

Approach

Can not be used in

multiscale

implementations.

One evaluation per

candidate

Much more focused

effortBounding boxesProjected boxes

Outline

1. Motivation

3. Approach

4. Results

1. Case study: person detection

2. Case study: vehicle detection

5. Conclusions

• Case study: Person detection

– Full-body and Head & Shoulder SVM-HOG detector

– Perspective Multiscale

– Linear multiobject tracking

– Active Vision Group dataset (1920x1080, 4500 frames, 71460 persons

labeled)

Results

labeled)

Results

• Performance

– Reduction from

144880 to 46226

(68%) for similar

performance

Using 3 levels is

Multiscale Perspective Multiscale

-0,1 6E-16 0,1 0,2 0,3 0,4 0,5 0,6

Recall

DAFFiltering

TrackingLess FN

Less FP but also

missdetections

L=3, 5, 7

– Using 3 levels is

enough because

perspective effect is

• Case study: Vehicle detection

– Vehicle detection application for embedded vision system

– Road can be assumed as planar in the short distance

– Ground truth sequence 2 minutes

– Grid of fixed positions

Results

• Case study: Vehicle detection

– Detections are sparse and noisy

– Tracking is still necessary

Results

•1000x less evaluations

•7x speed in PC

•Same TP

•5 times less FP

Results

Type Processor RAM CPU OS Language

PC Intel Core

8 GB 3.0 GHz Windows 7

Ubuntu 12.04

Embedded

Cortex

512 MB 800 MHz Xilinx Zynq

30 - 40 ms in ARM Cortex30 - 40 ms in ARM Cortex

FastSlow

11 – 40 ms in PC11 – 40 ms in PC

25 fps real-time

Perspective

multiscale

Brute-force

multiscale

2 - 10 ms in PC 2 - 10 ms in PC

Conclusions

• Perspective is a contextual information available in many situations

• Assumptions: dominant ground plane and known object size

• Its computation is easy (K, R, t) using homographies

• It can be used for object detection to focus computational Twoways of applying it

ways of applying it

• A) Perspective Multiscale: Wrapping multiscale function (~60% reduction in typical surveillance scene)

• B) Grid of fixed positions: for even more reduction of complexity (x7 speed up in low perspective scenes like onboardvehicle detection)

Thank You!Dr. Marcos Nieto

Dr. Marcos Nieto

Researcher

mnieto@vicomtech.org

Offline process

Online process

perspective multiscale detection and tracking of persons

approach multiscale

depth perspective

images multiscale detection

multiscale b

motivation object detection

d size

closest object

window spans position

Technology

acquisition of land for the oil refinery: tracking progress...

harry potter’s marauder’s map: localizing and tracking...

machine learning for generalized multiscale modeling ·...

multiscale filtering and neural network classification for...

multiscale method

multiscale computational methods

the strategy of tracking travel history of infected ... ·...

breast density scoringwith multiscale denoising...

multiscale modeling

point-based multiscale surface...

multiscale retinex

semiclassical & multiscale aspects of wave propagation ·...

multiscale damage

harry potter's marauder's map: localizing and tracking...

multiscale materials modeling

simultaneous tracking of multiple body parts of...

acquisition of land for the oil refinery: tracking … of...

1 tracking persons-of-interest via unsupervised...

strategic multiscale

multiscale physical and structural particle mechanics ·...