Facial identity recognition using StyleGAN3 inversion and improved tiny YOLOv7 model

Kumar, A; Bhattacharjee, S; Kumar, A; Jayakody, D. N. K

Please use this identifier to cite or link to this item: https://rda.sliit.lk/handle/123456789/4147

Title:	Facial identity recognition using StyleGAN3 inversion and improved tiny YOLOv7 model
Authors:	Kumar, A Bhattacharjee, S Kumar, A Jayakody, D. N. K
Keywords:	Face detection Facial attribute manipulation Facial identity recognition StyleGAN3 Tiny YOLOv7
Issue Date:	17-Mar-2025
Publisher:	www.nature.com
Citation:	Kumar, A., Bhattacharjee, S., Kumar, A. et al. Facial identity recognition using StyleGAN3 inversion and improved tiny YOLOv7 model. Sci Rep 15, 9102 (2025). https://doi.org/10.1038/s41598-025-93096-0
Series/Report no.:	Scientific Reports;volume 15, Article number: 9102 (2025)
Abstract:	Facial identity recognition is one of the challenging problems in the domain of computer vision. Facial identity comprises the facial attributes of a person’s face ranging from age progression, gender, hairstyle, etc. Manipulating facial attributes such as changing the gender, hairstyle, expressions, and makeup changes the entire facial identity of a person which is often used by law offenders to commit crimes. Leveraging the deep learning-based approaches, this work proposes a one-step solution for facial attribute manipulation and detection leading to facial identity recognition in few-shot and traditional scenarios. As a first step towards performing facial identity recognition, we created the Facial Attribute Manipulation Detection (FAM) Dataset which consists of twenty unique identities with thirty-eight facial attributes generated by the StyleGAN3 inversion. The Facial Attribute Detection (FAM) Dataset has 11,560 images richly annotated in YOLO format. To perform facial attribute and identity detection, we developed the Spatial Transformer Block (STB) and Squeeze-Excite Spatial Pyramid Pooling (SE-SPP)-based Tiny YOLOv7 model and proposed as FIR-Tiny YOLOv7 (Facial Identity Recognition-Tiny YOLOv7) model. The proposed model is an improvised variant of the Tiny YOLOv7 model. For facial identity recognition, the proposed model achieved 10.0% higher mAP in the one-shot scenario, 30.4% higher mAP in the three-shot scenario, 15.3% higher mAP in the five-shot scenario, and 0.1% higher mAP in the traditional 70% − 30% split scenario as compared to the Tiny YOLOv7 model. The results obtained with the proposed model are promising for general facial identity recognition under varying facial attribute manipulation.
URI:	https://rda.sliit.lk/handle/123456789/4147
ISSN:	2045-2322
Appears in Collections:	Department of Electrical and Electronic Engineering

Files in This Item:

File	Description	Size	Format
s41598-025-93096-0.pdf		3.92 MB	Adobe PDF	View/Open

Show full item record