A novel Human Detection approach Based on Depth Map via Kinect

Last Update:2014-08-28 Source: Internet

Author: User

Tags svm

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

In this paper, a new method of human detection based on depth maps from 3D sensor Kinect is proposed. First, the pixel filtering and context filtering is employed to roughly repair defects on the depth map due to Informatio n inaccuracy captured by Kinect. Second, a dataset consisting of depth maps with various indoor human poses are constructed as benchmark. Finally, by introducing Kirsch mask and three-value codes to local Binary pattern, a novel local ternary Direction pattern (LTDP) feature descriptor is extracted and was used for human detection with SVM as classifier. The performance for the proposed approach are evaluated and compared with other five existing feature descriptors using the Same SVM classifier. Experiment results manifest the effectiveness of the proposed approach.

Man's detection

These methods can is roughly divided into three different categories; Human model based methods [1], template matching based methods [2] and statistical classification methods [3-5].

LBP (local binary pattern) feature, which is a string of bits obtained by binarizing Local neighborhood of pixels with Res Pect to the brightness of central pixel, were recently proposed to capture microscopic local image texture and was applied For human detection successfully

Hog-lbp

LTP (Local ternary patterns)

Centrist (census transform of histograms)

Unfortunately, since the Kinect mainly depends on speckle method [+], the depth map captured by Kinect often contains MUC H noise. So, Kinect uses this depth image to detect inaccurate, unstable

Overview of lbp-related Features

By defining the number of spatial transitions (0/1 changes) in LBP pattern with a U value defined as below in (3), the Uni Formity of LBP patterns, which refers to the patterns have limited transition or discontinuities (U2) in the circular bi Nary presentation, can be determined, where the U value is given as

The uniform LBP only have a bins, one for possible uniform patterns and one for all of the non-uniform ones.

Ternary LTP Code

In summary, the UNIFORM-LBP reduces the dimensions of LBP, while LTP extends LBP to three-valued codes and therefore Enhan Ces its anti-noise performance. The centrist introduces a pyramid structure to LBP and makes a multi-scale observation.

Feature Extraction

Noise Reduction Filters to Depth Map

Compared with TOF data, the depth map captured by Kinect have mounts of null-value areas, which present as ' white holes ' in Depth map

Traditional filters, E.I, mean filtering, Gaussian filtering, usually is utilized to remove salt and pepper noises

The pixel filter is designed to compensate the ' holes ' and the context filter is employed to further reduce noise in GE Neral.

It should is noted that the frames waiting to be retrieved is limited, because the method ignores the scene change Betwee N Frames.

The LTDP Feature

LTDP (local ternary direction pattern) is derived from the lbp-related feature descriptors by plugging a Specific Kirsch mask [] and three-valued codes to it

At first, the LTDP was calculated by comparing the relativ E Edge response value of a pixel in different directions. The Edge response value (S0~S7) of a particular pixel is calculated with the Kirsch mask at eight different directio Ns. The Masks (M0~M7) is shown in fig.3.

Second, the 3-valued codes for the eight directions with threshold T is defined as follows

I n the experiment of this paper, a uniform pattern argument are designed and a coding scheme is used to split each ternary p Attern to its positive and negative halves and subsequently treating them as-separate channels of LTDP features for Which separate histograms is computed by combining the results is at the end of the computation.

Histogram of LTDP Feature for Depth Map

A depth map should is divided into several non-overlapping rectangular blocks.

A spatial histogram, concatenating the histograms of all blocks can is employed to represent the whole image.

Classification algorithm

SVM is relatively-robust and easy-to-be implemented. In this study, both linear kernel SVM and nonlinear kernel SVM is used as classification algorithm.

Detection class Read the first paper, also pointed out the idea of face detection bar ~ ~ ~

A novel Human Detection approach Based on Depth Map via Kinect

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More