Tensor decomposition and non-linear manifold modeling for 3D head pose estimation

  • Authors
  • Derkach D, Ruiz A, Sukno FM
  • UPF authors
  • SUKNO, FEDERICO ; RUIZ OVEJERO, ADRIÀ; DERKACH ., DMYTRO;
  • Type
  • Scholarly articles
  • Journal títle
  • International Journal of Computer Vision
  • Publication year
  • 2019
  • Volume
  • 127
  • Number
  • 10
  • Pages
  • 1565-1585
  • ISSN
  • 0920-5691
  • Publication State
  • Published
  • Abstract
  • Head pose estimation is a challenging computer vision problem with important applications in different scenarios such as human¿computer interaction or face recognition. In this paper, we present a 3D head pose estimation algorithm based on non-linear manifold learning. A key feature of the proposed approach is that it allows modeling the underlying 3D manifold that results from the combination of rotation angles. To do so, we use tensor decomposition to generate separate subspaces for each variation factor and show that each of them has a clear structure that can be modeled with cosine functions from a unique shared parameter per angle. Such representation provides a deep understanding of data behavior. We show that the proposed framework can be applied to a wide variety of input features and can be used for different purposes. Firstly, we test our system on a publicly available database, which consists of 2D images and we show that the cosine functions can be used to synthesize rotated versions from an object from which we see only a 2D image at a specific angle. Further, we perform 3D head pose estimation experiments using other two types of features: automatic landmarks and histogram-based 3D descriptors. We evaluate our approach on two publicly available databases, and demonstrate that angle estimations can be performed by optimizing the combination of these cosine functions to achieve state-of-the-art performance.
  • Complete citation
  • Derkach D, Ruiz A, Sukno FM. Tensor decomposition and non-linear manifold modeling for 3D head pose estimation. International Journal of Computer Vision 2019; 127(10): 1565-1585.
Bibliometric indicators
  • 2 times cited Scopus
  • 2 times cited WOS
  • Índex Scimago de 4.121 (2019)