Volume 40, Issue 6 pp. 47-61
Article

Action Unit Driven Facial Expression Synthesis from a Single Image with Patch Attentive GAN

Yong Zhao

Yong Zhao

Shaanxi Key Laboratory on Speech and Image Information Processing, National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University (NPU), Xi'an, China

Audio Visual Signal Processing (AVSP) Research Laboratory, Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Brussels, Belgium

Search for more papers by this author
Le Yang

Le Yang

Shaanxi Key Laboratory on Speech and Image Information Processing, National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University (NPU), Xi'an, China

Search for more papers by this author
Ercheng Pei

Ercheng Pei

Shaanxi Key Laboratory on Speech and Image Information Processing, National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University (NPU), Xi'an, China

Search for more papers by this author
Meshia Cédric Oveneke

Meshia Cédric Oveneke

Audio Visual Signal Processing (AVSP) Research Laboratory, Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Brussels, Belgium

Artificial Intelligence Research Lab, Fit-For Purpose Technologies, Brussels, Belgium

Search for more papers by this author
Mitchel Alioscha-Perez

Mitchel Alioscha-Perez

Audio Visual Signal Processing (AVSP) Research Laboratory, Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Brussels, Belgium

Search for more papers by this author
Longfei Li

Longfei Li

Ant Financial Service, Alibaba, Hangzhou, China

Search for more papers by this author
Dongmei Jiang

Corresponding Author

Dongmei Jiang

Shaanxi Key Laboratory on Speech and Image Information Processing, National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University (NPU), Xi'an, China

Peng Cheng Laboratory, Shenzhen, Guangdong China

[email protected]

Search for more papers by this author
Hichem Sahli

Hichem Sahli

Audio Visual Signal Processing (AVSP) Research Laboratory, Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Brussels, Belgium

Interuniversity Microelectronics Centre (IMEC), Heverlee, Belgium

Search for more papers by this author
First published: 17 March 2021
Citations: 1

[Correction added on 8 April 2021, after first online publication: Reference [ACK*17] and the in text citations to it had been mistakenly omitted and were restored.]

Abstract

Recent advances in generative adversarial networks (GANs) have shown tremendous success for facial expression generation tasks. However, generating vivid and expressive facial expressions at Action Units (AUs) level is still challenging, due to the fact that automatic facial expression analysis for AU intensity itself is an unsolved difficult task. In this paper, we propose a novel synthesis-by-analysis approach by leveraging the power of GAN framework and state-of-the-art AU detection model to achieve better results for AU-driven facial expression generation. Specifically, we design a novel discriminator architecture by modifying the patch-attentive AU detection network for AU intensity estimation and combine it with a global image encoder for adversarial learning to force the generator to produce more expressive and realistic facial images. We also introduce a balanced sampling approach to alleviate the imbalanced learning problem for AU synthesis. Extensive experimental results on DISFA and DISFA+ show that our approach outperforms the state-of-the-art in terms of photo-realism and expressiveness of the facial expression quantitatively and qualitatively.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.