Hand detection in cluttered scene images using Fourier-Mellin invariant features
نویسنده
چکیده
This paper proposes an automatic hand detection system that combines the Fourier-Mellin Transform along with other computer vision techniques to achieve hand detection in cluttered scene color images. The proposed system uses the Fourier-Mellin Transform as an invariant feature extractor to perform RST invariant hand detection. In a first stage of the system a simple non-adaptive skin color-based image segmentation and an interest point detector based on corners are used in order to identify regions of interest that contains possible matches. A sliding window algorithm is then used to scan the image at different scales performing the FMT calculations only in the previously detected regions of interest and comparing the extracted FM descriptor of the windows with a hand descriptors database obtained from a train image set. The results of the performed experiments suggest the use of Fourier-Mellin invariant features as a promising approach for automatic hand detection. Index Terms — Automatic hand detection, Fourier-Mellin Transform, RST-invariant object representation. —————————— ——————————
منابع مشابه
Invariant Neural-Network Based Face Detection with Orthogonal Fourier-Mellin Moments
In this paper, we apply a recently developed type of moments, Orthogonal Fourier-Mellin Moments (OFMMs) [7], to the specijic problem of fully translation-, scaleand inplane rotation-invariant detection of human faces in twodimensional static color images, and we compare theirperformance with that of the generalized Hu's moments or nonorthogonal Fourier-Mellin moments (FMMs). As compared to the ...
متن کاملColor Fourier-Mellin descriptors for image recognition
We propose new sets of Fourier-Mellin descriptors for color images. They are constructed using the Clifford Fourier Transform of Batard et al. (2010) and are an extension of the classical Fourier-Mellin descriptors for grayscale images. These are invariant under direct similarity transformations (translations, rotations, scale) and marginal treatment of colors images is avoided. An implementati...
متن کاملZhile Ren | Research Statement
Figure 1: COG descriptor encodes orientation-invariant gradient feature for objects with different views. I develop new representations and algorithms for three-dimensional (3D) scene understanding from cluttered indoor RGB-D images and outdoor video sequences. I introduce novel representations for 3D object detection systems that localize objects with cuboids and describe room layouts by Manha...
متن کاملComparative Performance of Different Chrominance Spaces for Color Segmentation and Detection of Human Faces in Complex Scene Images
Color is a powerful fundamental cue that can be used at an early stage to detect objects in complex scene images. This paper presents an analysis of the performance of nine different chrominance spaces in the specific problem of automatically detecting and locating human faces in twodimensional still scene images. For each space, we use a skin color model based on the Mahalanobis metric to segm...
متن کاملTexture Segmentation using Circular-Mellin Operators
Texture is an important preattentive cue in region-based segmentation of images. In this paper, we discuss the use of circular-Mellin features for segmenting an image into homogenous regions. The circular-Mellin operators represent the spectral decomposition of the image scene in the polar-log coordinate system and are invariant to both scale and orientation of the target. Coupled with the uniq...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011