Paper
1 August 2022 FDFST: facial structure constrained landmark detection using visual transformer
Qianyu Zhou, Yanxin Wang, Xuliang Li, Jiquan Ma
Author Affiliations +
Proceedings Volume 12257, 4th International Conference on Information Science, Electrical, and Automation Engineering (ISEAE 2022); 122571O (2022) https://doi.org/10.1117/12.2640359
Event: 4th International Conference on Information Science, Electrical, and Automation Engineering (ISEAE 2022), 2022, Guangzhou, China
Abstract
Facial landmark detection is challenging with occlusion, pose or inadequate training samples. We proposed a two-branch facial landmark detection network (Facial Detection with Face Structure and Transformers: FDFST) considering face structure constraints. Existing regression-based facial landmark detection models have not fully considered the general facial structure for landmark detection, that usually lead a unstable prediction. In contrast to facial landmarks, facial structure is more likely to be accurately estimated in the real scenario. Therefore, we try to provide a facial structure guidance for the facial landmark detection by a facial structure estimation sub-network. In this way, two targets are predefined to supervise our model, one is the facial structure described by five landmarks, the other is facial landmark denoted by 96 points. To address the lack of occlusion samples, we proposed a novel data augmentation to boost the training process on the public data sets WFLW. Experiments have revealed that our FDFST network on the WFLW dataset achieved significant improvement.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qianyu Zhou, Yanxin Wang, Xuliang Li, and Jiquan Ma "FDFST: facial structure constrained landmark detection using visual transformer", Proc. SPIE 12257, 4th International Conference on Information Science, Electrical, and Automation Engineering (ISEAE 2022), 122571O (1 August 2022); https://doi.org/10.1117/12.2640359
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Data modeling

Facial recognition systems

Drug discovery

Visualization

Head

Simulation of CCA and DLA aggregates

RELATED CONTENT

Research on Tibetan-Chinese cross-linguistic summarization
Proceedings of SPIE (October 20 2022)
Research on rumor detection based on BERT
Proceedings of SPIE (December 16 2022)
Rotary transformer for image captioning
Proceedings of SPIE (September 09 2022)

Back to Top