Skip to main content
Kent Academic Repository

Ear-based Person Recognition using Pix2Pix GAN Augmentation

Alomari, Eman, Yang, Su, Hoque, Sanaul, Deravi, Farzin (2024) Ear-based Person Recognition using Pix2Pix GAN Augmentation. In: 2024 International Conference of the Biometrics Special Interest Group (BIOSIG). . IEEE (doi:10.1109/BIOSIG61931.2024.10786744) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:108210)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication)
Official URL:
https://doi.org/10.1109/BIOSIG61931.2024.10786744

Abstract

This study presents a robust framework that leverages advanced deep-learning techniques for ear-based human recognition. Faced with the challenge of dataset sizes, our approach is developed based on a generative adversarial network (GAN) method namely Pix2Pix to augment the dataset. It is demonstrated that this approach offers the ability to produce complementary images for ear recognition. To be more specific, Pix2Pix GAN is employed to generate missing sides in ear image pairs (i.e., creating corresponding left ear images for right ear images and vice versa). As such, this augmentation could substantially increase the dataset size, making it more diverse and of significantly greater use for training purposes. The employed dataset consisted of several images of the right ear and only one left ear for each individual. A series of corresponding synthetic left-ear images is generated using Pix2Pix GAN as a tool for augmenting the available data and mitigate the dataset’s lack of left ear images. The experiment framework used the EarNet model and conducted comparative evaluations before and after Pix2Pix GAN augmentation using the AMI Ear dataset. By employing the Pix2Pix GAN, the proposed approach can effectively double the size of a dataset and, in the process, provide significantly greater utility regarding how that data can be utilised in real-world applications scenarios. The resulting accuracy reaches 98% on the AMI dataset, demonstrating that this technique can improve model performance for ear-based human recognition.

Item Type: Conference or workshop item (Paper)
DOI/Identification number: 10.1109/BIOSIG61931.2024.10786744
Uncontrolled keywords: deep learning, generative adversarial networks (GAN), ear biometrics, data augmentation
Subjects: Q Science > Q Science (General) > Q335 Artificial intelligence
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7882.B56 Biometric identification
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Engineering and Digital Arts
Depositing User: Sanaul Hoque
Date Deposited: 18 Dec 2024 13:08 UTC
Last Modified: 18 Dec 2024 13:55 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/108210 (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.