3d fingertip and palm tracking in depth image sequences hui liang, junsong yuan and daniel thalmann...
TRANSCRIPT
3D Fingertip and Palm Tracking in Depth Image SequencesHui Liang, Junsong Yuan and Daniel Thalmann
Proceedings of the 20th ACM international conference on Multimedia 2012
4
Introduction• Human hand is an essential body part for human-computer
interaction.
• The positions of tracked fingertips: gesture estimation
• Difficulty in fingertip tracking:
Side-by-side Bending Nearby
5
Introduction• Many previous methods:
• Only focus on extracting 2D fingertips
• Cannot track fingertips robustly for a freely moving hand
• In this paper:
• Present a robust fingertip and palm tracking scheme
• With the input of depth images (KINECT)• Track the 3D fingertip positions quite accurately
Related work• Focus only on 2D fingertips:[4][5][6][9]
• Based on contour analysis of the extracted hand region:[2][4][5][6]
• Usually can track the fingertips for only stretched fingers.
Related work• In [6],
• Fingertips are tracked for infrared image sequences.• It utilizes a template matching strategy • Fingertip tracking : Kalman filter
• In [2],
• Stereoscopic vision is adopted• Maximize the distance center of gravity of the hand & the boundary
Related work• In [9] (Kinect),
Depth < Threshold Circular filter
Minimum depth
[9] J. L. Raheja, A. Chaudhary, and K. Singal. Tracking of fingertips and centres of palm using kinect, 2011. Proc. Of the 3rd Int’l Conf. on Computational Intelligence, Modelling and Simulation.
Related work• [2] S. Consei1, S. Bourennane, and L. Martin. Three dimensional fingertip tracking in
stereovision, 2005. Proc. of the 7th Int’l Conf. on Advanced Concepts for Intelligent Vision Systems.
• [4] K. Hsiao, T. Chen, and S. Chien. Fast fingertip positioning by combining particle filtering with particle random diffusion, 2008. Proc. IEEE Int’l Conf. on Multimedia and Expo.
• [5] I. Katz, K. Gabayan, and H. Aghajan. A multi-touch surface using multiple cameras, 2007. Proc. of the 9th Int’l Conf. on Advanced concepts for intelligent vision systems.
• [6] K. Oka, Y. Sato, and H. Koike. Real-time tracking of multiple fingertips and gesture recognition for augmented desk interface systems, 2002. Proc. IEEE Int’l Conf. on Automatic Face and Gesture Recognition.
• [9] J. L. Raheja, A. Chaudhary, and K. Singal. Tracking of fingertips and centres of palm using kinect, 2011. Proc. Of the 3rd Int’l Conf. on Computational Intelligence, Modelling and Simulation.
12
Overview
Foreground Segmentation
PalmLocalization
HandSegmentation
Fingertip Detection
Fingertip Tracking
Stereo images(Depth map)
Fingertips
13
Hand and Palm Detection• 1) Assume the hand is the nearest object
• 2) Constrain global hand rotation by:
• : global rotation angle of the hand
Foreground
Segmentation
Palm
Localization
Hand
Segmentation
14
• Threshold the depth frame to obtain the foreground F :
• p : a pixel coordinate • z(p) : depth value (of point p )
• z0 : the minimum depth value
• zD : depth threshold
Hand and Palm DetectionForeground
Segmentation
PalmLocalization
HandSegmentation
foreground F
0.2m
15
• The palm region is approximated with a circle:
• pp : the palm center (of point p )
• rp : the radius
• Assume that hand palm forms a globally largest blob• Cp equals to the largest inscribed circle of the contour of F .
• 2D Kalman filter : reduce computational complexity
Hand and Palm DetectionForeground
Segmentation
PalmLocalization
HandSegmentation
16
• Separate hand and forearm by a line:
• 1) Tangent to Cp
• 2) Perpendicular to the orientation of the forearm
• Orientation of the forearm :• The Eigenvector that corresponds to the largest
Eigenvalue of the covariance matrix of the contour pixel coordinates of F
Hand and Palm DetectionForeground
Segmentation
PalmLocalization
HandSegmentation
Hand region : FV(2D) → FD(3D)
17
Fingertip Detection and Tracking• Constraints on possible fingertip locations:
• 1) Only in depth discontinuous region ( in contour Fv)
• 2) | Depth(one point) – Depth(neighborhoods) | are important.
• 3) Utilize the 3D geodesic shortest path (GSP)
Fingertip vs. Non-fingertip
Nearby Fingertips
Fingertip
Detection
Fingertip
Tracking
18
Fingertip detection
Fingertip tracking
• Goal: detect all five fingertips in the depth image• Based on three depth-based features
• Build a graph G :
• Vh : contains of all points within FV (hand contour)
• Eh : for each pair of vertices(p,q), 1) they are in the 8-neighborhood of each other 2) their 3D distance is within threshold τ
Fingertip Detection
Edge weight : 3D Euclidean distance
19
• Calculate Geodesic distance dg(p):• From palm center pp for each vertex Vh
• Dijkstra graph search on Gh
• GSP point set Ug(p):
• The set of vertices on the shortest path from pp to p
• Rectangle local feature RL(p):• Describe the neighborhood of a point p in FV
•
• : ratio of 1s
Fingertip Detection
0 0 1
1 1 1 1 1
1 1 X 1 0
1 1 1 1
0 1 0
Fingertip detection
Fingertip tracking
1cm
20
• Calculate Geodesic distance dg(p):
Fingertip Detection
0.4
Fingertip detection
Fingertip tracking
dg(p)
𝜂(𝑝)
fingertips
21
• Fingertip labeling:
Fingertip Detection
:estimate the probability that has the label lj
number of GSP points kth GSP point
1 2 3 4 5
1
2
3
4
5i : fingertip
j : label
Fingertip detection
Fingertip tracking
Nmax=6
23
Fingertip Tracking• Build a particle filter for each fingertip
• (x, ω) denote a particle• x : 2D position in FV
• ω : the particle weight
• Constrain the positions of each particle to the border point set UB to reduce the search space
Fingertip detection
Fingertip tracking
24
• Likelihood function :
•
Metric parameters
difference
Hausdorff difference
feature difference
Fingertip TrackingFingertip detection
Fingertip tracking
/
/
Geodesicdistance
GSP points
Neighbordepth
pre now
dg(p)
𝑹𝑳(𝒑)
27
Experimental Results• Quantitative results on synthetic sequences:
Seq. No.
MotionSeq. No.
Motion
Seq. 1 grasping Seq. 4 flexion
Seq. 2 adduction/abduction Seq. 5 global rotation
Seq. 3 successive single finger Seq. 6combination of grasping
and global rotation
‧Error : Euclidean distance‧Ground truth : phalanx end point