Robust facial alignment with internal denoising auto-encoder

  • Authors
  • Aspandi D, Martinez O, Sukno FM, Binefa X
  • UPF authors
  • SUKNO, FEDERICO ; LATIF ., DECKY ASPANDI; MARTINEZ PUJOL, ORIOL; BINEFA VALLS, XAVIER;
  • Authors of the book
  • -
  • Book title
  • 16th Conference on Computer and Robot Vision (CRV), Kingston
  • Publisher
  • CRV
  • Publication year
  • 2019
  • Pages
  • 143-150
  • ISBN
  • 978-1-7281-1838-3
  • Abstract
  • The development of facial alignment models is growing rapidly thanks to the availability of large facial landmarked datasets and powerful deep learning models. However, important challenges still remain for facial alignment models to work on images under extreme conditions, such as severe occlusions or large variations in pose and illumination. Current attempts to overcome this limitation have mainly focused on building robust feature extractors with the assumption that the model will be able to discard the noise and select only the meaningful features. However, such an assumption ignores the importance of understanding the noise that characterizes unconstrained images, which has been shown to benefit computer vision models if used appropriately on the learning strategy. Thus, in this paper we investigate the introduction of specialized modules for noise detection and removal, in combination with our state-of-the-art facial alignment module and show that this leads to improved robustness both to synthesized noise and in-the-wild conditions. The proposed model is built by combining two major subnetworks: internal image denoiser (based on the Auto-Encoder architecture) and facial landmark localiser (based on the inception-resnet architecture). Our results on the 300-W and Menpo datasets show that our model can effectively handle different types of synthetic noise, which also leads to enhanced robustness in real-world unconstrained settings, reaching top state-of-the-art accuracy.
  • Complete citation
  • Aspandi D, Martinez O, Sukno FM, Binefa X. Robust facial alignment with internal denoising auto-encoder. In: -. 16th Conference on Computer and Robot Vision (CRV), Kingston. 1 ed. CRV; 2019. p. 143-150.
Bibliometric indicators
  • 3 times cited Scopus
  • 4 times cited WOS
  • Índex Scimago de 0