computer vision systems programming: Übungs präsentation

24
Computer Vision Systems Programming: Übungs Präsentation Sergejs Stavro 1128054

Upload: others

Post on 09-May-2022

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computer Vision Systems Programming: Übungs Präsentation

Computer Vision Systems Programming: Übungs Präsentation

Sergejs Stavro 1128054

Page 2: Computer Vision Systems Programming: Übungs Präsentation

Theme

● Person recognition based on depth data.

Page 3: Computer Vision Systems Programming: Übungs Präsentation

Development Environment

● Haxe NME

– Haxe is an open-source high-level multiplatform programming language and compiler that can produce applications and source code for many different platforms from a single code-base. [wiki]

– NME is a framework for building games and applications for mobile, desktop and web platforms. [nme.io]

Page 4: Computer Vision Systems Programming: Übungs Präsentation

Method

● Downsampling;

● Slicing;

● Removing the floor;

● Object separation;

● Finding the person;

Page 5: Computer Vision Systems Programming: Übungs Präsentation

Downsampling

● The original image contains a lot of redundant data;

+ will improve performance and reduce complexity, unimportant data will be discarded;

– valuable data can get lost;

● Used method:

Scaling to 12.5% without interpolation.

Page 6: Computer Vision Systems Programming: Übungs Präsentation

Downsampling (Image)

Page 7: Computer Vision Systems Programming: Übungs Präsentation

Slicing

● Simply getting a slice of data from a certain range, e.g. getting only the foreground;

+ improves precision, allows to use different parameters for different ranges;

● Used method:

Simple thresholding, with two values.

Page 8: Computer Vision Systems Programming: Übungs Präsentation

Slicing (Image)

Page 9: Computer Vision Systems Programming: Übungs Präsentation

Object separation

● Partially done by slicing;

● Remove the ground;

● Group the pixels and find group bounding boxes;

● Used method:

Posterization + scan line checking;

Page 10: Computer Vision Systems Programming: Übungs Präsentation

Object separation (Image)

Page 11: Computer Vision Systems Programming: Übungs Präsentation

Object separation (Image)(2)

Page 12: Computer Vision Systems Programming: Übungs Präsentation

Finding the person

● Idea:

For every found group let's treat the data inside the bounding box as a one dimensional function f(y) = x, where y is the index of the scanline and x is the number or non-zero pixels in the scanline.

Let's apply an edge filter to this function [-0.5, 0, 0.5].

This will give us the places with sharp changes in width. In a human body such a change will often identify the neck/shoulder area (shoulders are twice as wide as the head).

Having that in mind we can assume that everything above this point is the head and we can check the width to height ratio for that part of the group. I accept 2:3 – 1:1 as a valid ratio.

Page 13: Computer Vision Systems Programming: Übungs Präsentation

Finding the person (Image)

Page 14: Computer Vision Systems Programming: Übungs Präsentation

Result (Image)

Page 15: Computer Vision Systems Programming: Übungs Präsentation

Problems and Limitations

● Different distances require different sets of parameters.

● Doesn't handle far away objects very well;

● Will fail if the head is covered;

● Might have problems recognizing people form the side view;

● Won't recognize a person if he/she is too close to some other object.

● Person must be standing straight;

Page 16: Computer Vision Systems Programming: Übungs Präsentation

Images

Page 17: Computer Vision Systems Programming: Übungs Präsentation

Images (2)

Page 18: Computer Vision Systems Programming: Übungs Präsentation

Images (3)

Page 19: Computer Vision Systems Programming: Übungs Präsentation

Images (4)

Page 20: Computer Vision Systems Programming: Übungs Präsentation

Images (4)

Page 21: Computer Vision Systems Programming: Übungs Präsentation

Images (5)

Too far away, too small to predict.

Page 22: Computer Vision Systems Programming: Übungs Präsentation

Images (5)

Same as the previous case.

Page 23: Computer Vision Systems Programming: Übungs Präsentation

Images (5)

Head and body almost the same width.

Page 24: Computer Vision Systems Programming: Übungs Präsentation

Images (5)

Human validation method is not perfect.