Ear-based Person Recognition using Pix2Pix GAN Augmentation

Alomari, Eman, Yang, Su, Hoque, Sanaul, Deravi, Farzin (2024) Ear-based Person Recognition using Pix2Pix GAN Augmentation. In: 2024 International Conference of the Biometrics Special Interest Group (BIOSIG). . pp. 1-6. IEEE E-ISBN 979-8-3503-7371-4. (doi:10.1109/BIOSIG61931.2024.10786744) (KAR id:108210)

PDF Author's Accepted Manuscript Language: English
Download this file (PDF/349kB)	Preview
Request a format suitable for use with assistive technology e.g. a screenreader
Official URL: https://doi.org/10.1109/BIOSIG61931.2024.10786744

Abstract

This study presents a robust framework that leverages advanced deep-learning techniques for ear-based human recognition. Faced with the challenge of dataset sizes, our approach is developed based on a generative adversarial network (GAN) method namely Pix2Pix to augment the dataset. It is demonstrated that this approach offers the ability to produce complementary images for ear recognition. To be more specific, Pix2Pix GAN is employed to generate missing sides in ear image pairs (i.e., creating corresponding left ear images for right ear images and vice versa). As such, this augmentation could substantially increase the dataset size, making it more diverse and of significantly greater use for training purposes. The employed dataset consisted of several images of the right ear and only one left ear for each individual. A series of corresponding synthetic left-ear images is generated using Pix2Pix GAN as a tool for augmenting the available data and mitigate the dataset’s lack of left ear images. The experiment framework used the EarNet model and conducted comparative evaluations before and after Pix2Pix GAN augmentation using the AMI Ear dataset. By employing the Pix2Pix GAN, the proposed approach can effectively double the size of a dataset and, in the process, provide significantly greater utility regarding how that data can be utilised in real-world applications scenarios. The resulting accuracy reaches 98% on the AMI dataset, demonstrating that this technique can improve model performance for ear-based human recognition.

Item Type:	Conference or workshop item (Paper)
DOI/Identification number:	10.1109/BIOSIG61931.2024.10786744
Additional information:	© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Uncontrolled keywords:	deep learning, generative adversarial networks (GAN), ear biometrics, data augmentation
Subjects:	Q Science > Q Science (General) > Q335 Artificial intelligence T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7882.B56 Biometric identification
Divisions:	Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Engineering and Digital Arts
Funders:	University of Kent (https://ror.org/00xkeyj56) Swansea University (https://ror.org/053fq8t95)
Depositing User:	Sanaul Hoque
Date Deposited:	18 Dec 2024 13:08 UTC
Last Modified:	17 Feb 2025 10:02 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/108210 (The current URI for this page, for reference purposes)

University of Kent Author Information

Hoque, Sanaul.

Creator's ORCID:	https://orcid.org/0000-0001-8627-3429
CReDIT Contributor Roles:

Deravi, Farzin.

Creator's ORCID:	https://orcid.org/0000-0003-0885-437X
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.