People counting is a challenging task with many applications. We propose a method with a fixed stereo camera that is based on projecting a template onto the depth image. The method was tested on a challenging outdoor dataset with good results and runs in real time.
DOCUMENT
In this paper we propose a head detection method using range data from a stereo camera. The method is based on a technique that has been introduced in the domain of voxel data. For application in stereo cameras, the technique is extended (1) to be applicable to stereo data, and (2) to be robust with regard to noise and variation in environmental settings. The method consists of foreground selection, head detection, and blob separation, and, to improve results in case of misdetections, incorporates a means for people tracking. It is tested in experiments with actual stereo data, gathered from three distinct real-life scenarios. Experimental results show that the proposed method performs well in terms of both precision and recall. In addition, the method was shown to perform well in highly crowded situations. From our results, we may conclude that the proposed method provides a strong basis for head detection in applications that utilise stereo cameras.
MULTIFILE
In this paper, we address the problem of people detection and tracking in crowded scenes using range cameras. We propose a new method for people detection and localisation based on the combination of background modelling and template matching. The method uses an adaptive background model in the range domain to characterise the scene without people. Then a 3D template is placed in possible people locations by projecting it in the background to reconstruct a range image that is most similar to the observed range image. We tested the method on a challenging outdoor dataset and compared it to two methods that each shares one characteristic with the proposed method: a similar template-based method that works in 2D and a well-known baseline method that works in the range domain. Our method performs significantly better, does not deteriorate in crowded environments and runs in real time.
DOCUMENT