Please use this identifier to cite or link to this item: https://rda.sliit.lk/handle/123456789/4147
Title: Facial identity recognition using StyleGAN3 inversion and improved tiny YOLOv7 model
Authors: Kumar, A
Bhattacharjee, S
Kumar, A
Jayakody, D. N. K
Keywords: Face detection
Facial attribute manipulation
Facial identity recognition
StyleGAN3
Tiny YOLOv7
Issue Date: 17-Mar-2025
Publisher: www.nature.com
Citation: Kumar, A., Bhattacharjee, S., Kumar, A. et al. Facial identity recognition using StyleGAN3 inversion and improved tiny YOLOv7 model. Sci Rep 15, 9102 (2025). https://doi.org/10.1038/s41598-025-93096-0
Series/Report no.: Scientific Reports;volume 15, Article number: 9102 (2025)
Abstract: Facial identity recognition is one of the challenging problems in the domain of computer vision. Facial identity comprises the facial attributes of a person’s face ranging from age progression, gender, hairstyle, etc. Manipulating facial attributes such as changing the gender, hairstyle, expressions, and makeup changes the entire facial identity of a person which is often used by law offenders to commit crimes. Leveraging the deep learning-based approaches, this work proposes a one-step solution for facial attribute manipulation and detection leading to facial identity recognition in few-shot and traditional scenarios. As a first step towards performing facial identity recognition, we created the Facial Attribute Manipulation Detection (FAM) Dataset which consists of twenty unique identities with thirty-eight facial attributes generated by the StyleGAN3 inversion. The Facial Attribute Detection (FAM) Dataset has 11,560 images richly annotated in YOLO format. To perform facial attribute and identity detection, we developed the Spatial Transformer Block (STB) and Squeeze-Excite Spatial Pyramid Pooling (SE-SPP)-based Tiny YOLOv7 model and proposed as FIR-Tiny YOLOv7 (Facial Identity Recognition-Tiny YOLOv7) model. The proposed model is an improvised variant of the Tiny YOLOv7 model. For facial identity recognition, the proposed model achieved 10.0% higher mAP in the one-shot scenario, 30.4% higher mAP in the three-shot scenario, 15.3% higher mAP in the five-shot scenario, and 0.1% higher mAP in the traditional 70% − 30% split scenario as compared to the Tiny YOLOv7 model. The results obtained with the proposed model are promising for general facial identity recognition under varying facial attribute manipulation.
URI: https://rda.sliit.lk/handle/123456789/4147
ISSN: 2045-2322
Appears in Collections:Department of Electrical and Electronic Engineering

Files in This Item:
File Description SizeFormat 
s41598-025-93096-0.pdf3.92 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.