Alomari, Eman, Yang, Su, Hoque, Sanaul, Deravi, Farzin (2024) Ear-based Person Recognition using Pix2Pix GAN Augmentation. In: 2024 International Conference of the Biometrics Special Interest Group (BIOSIG). . IEEE (doi:10.1109/BIOSIG61931.2024.10786744) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:108210)
The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication) | |
Official URL: https://doi.org/10.1109/BIOSIG61931.2024.10786744 |
Abstract
This study presents a robust framework that leverages advanced deep-learning techniques for ear-based human recognition. Faced with the challenge of dataset sizes, our approach is developed based on a generative adversarial network (GAN) method namely Pix2Pix to augment the dataset. It is demonstrated that this approach offers the ability to produce complementary images for ear recognition. To be more specific, Pix2Pix GAN is employed to generate missing sides in ear image pairs (i.e., creating corresponding left ear images for right ear images and vice versa). As such, this augmentation could substantially increase the dataset size, making it more diverse and of significantly greater use for training purposes. The employed dataset consisted of several images of the right ear and only one left ear for each individual. A series of corresponding synthetic left-ear images is generated using Pix2Pix GAN as a tool for augmenting the available data and mitigate the dataset’s lack of left ear images. The experiment framework used the EarNet model and conducted comparative evaluations before and after Pix2Pix GAN augmentation using the AMI Ear dataset. By employing the Pix2Pix GAN, the proposed approach can effectively double the size of a dataset and, in the process, provide significantly greater utility regarding how that data can be utilised in real-world applications scenarios. The resulting accuracy reaches 98% on the AMI dataset, demonstrating that this technique can improve model performance for ear-based human recognition.
Item Type: | Conference or workshop item (Paper) |
---|---|
DOI/Identification number: | 10.1109/BIOSIG61931.2024.10786744 |
Uncontrolled keywords: | deep learning, generative adversarial networks (GAN), ear biometrics, data augmentation |
Subjects: |
Q Science > Q Science (General) > Q335 Artificial intelligence T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7882.B56 Biometric identification |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Engineering and Digital Arts |
Depositing User: | Sanaul Hoque |
Date Deposited: | 18 Dec 2024 13:08 UTC |
Last Modified: | 18 Dec 2024 13:55 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/108210 (The current URI for this page, for reference purposes) |
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):