microsoft kinect gesture recognition usingchang/231/y14/seminarqi.ppt · gesture recognition an...

11
Gesture Recognition usi Microsoft Kinect

Upload: others

Post on 19-Aug-2020

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

Gesture Recognition using Microsoft Kinect

Page 2: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

Outline:

1. Show an example of gesture recognition

2. Describe the framework of Kinect gesture recognition

3. Divide the framework into several stages and explain the techniques used in each stage

Page 3: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

Example for gesture recognition

XBOX 360

Dragon Ball Z for Kinect

Gestures can be used to both attack and defense

Page 4: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

Question:

Suppose we want to use “release” to attack and “hold” to defense, how can we recognize the gestures?

Page 5: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

General frame work for gesture recognition

1) identifying the pixels in the image that constitute the hand we’re interested in

2) extracting features from those identified pixels in order to classify the hand into one of a set of predefined poses

3) recognizing the occurrence of specific pose sequences as

gestures.

Page 6: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

IDENTIFICATION OF ‘HAND PIXELS’

1. Kinect collects color and depth infomation

2. differentiate between(‘hand pixels’) and (‘background pixels’).

solutions: 1) threshold based on depth2) label hand pixels using rgb data

Page 7: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

Identification based on rgb data

- RGB indicates a specific set of rgb values

- S is a binary random variable indicating whether or not a pixel is that of skin

Page 8: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

Identification based on rgb data and depth data

- RGB indicates a specific set of rgb values

- S is a binary random variable indicating whether or not a pixel is that of skin

- D represents a pixel’s depth value

- H is a binary random variable indicating whether or not a pixel is that of a hand

Page 9: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

FEATURE EXTRACTION

Given a labeling of hand pixels, we now want to identify it as one of a set of predefined poses

Solution: Radial histogram

1) find the center of mass in our hand pixel

2) for each pixel, we calculate the angle offset from this

center.

The idea here is that for any hand image, the corresponding radial histogram will have distinct spikes corresponding to extended fingers.

Page 10: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

GESTURE RECOGNITION

an open hand followed by aclosed hand can be labeled a ‘grasping’ gesture, while a closed hand followed by an open hand can be labeled a ‘dropping’ gesture.

exploiting the temporal dependence between poses

Et be the estimated pose attime t, and let St be the actual pose (or state) at time t

use p(St |E t ) as our final estimate ofthe hand pose at time t.

Page 11: Microsoft Kinect Gesture Recognition usingchang/231/y14/seminarQi.ppt · GESTURE RECOGNITION an open hand followed by a closed hand can be labeled a ‘grasping’ gesture, while

GESTURE RECOGNITION

an open hand followed by aclosed hand can be labeled a ‘grasping’ gesture, while a closed hand followed by an open hand can be labeled a ‘dropping’ gesture.

exploiting the temporal dependence between poses

Et be the estimated pose attime t, and let St be the actual pose (or state) at time t

use p(St |E t ) as our final estimate ofthe hand pose at time t.