Alomari, Eman, Yang, Su, Hoque, Sanaul, Deravi, Farzin (2024) Ear-based Person Recognition using Pix2Pix GAN Augmentation. In: 2024 International Conference of the Biometrics Special Interest Group (BIOSIG). . pp. 1-6. IEEE E-ISBN 979-8-3503-7371-4. (doi:10.1109/BIOSIG61931.2024.10786744) (KAR id:108210)
PDF
Author's Accepted Manuscript
Language: English |
|
Download this file (PDF/349kB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: https://doi.org/10.1109/BIOSIG61931.2024.10786744 |
Abstract
This study presents a robust framework that leverages advanced deep-learning techniques for ear-based human recognition. Faced with the challenge of dataset sizes, our approach is developed based on a generative adversarial network (GAN) method namely Pix2Pix to augment the dataset. It is demonstrated that this approach offers the ability to produce complementary images for ear recognition. To be more specific, Pix2Pix GAN is employed to generate missing sides in ear image pairs (i.e., creating corresponding left ear images for right ear images and vice versa). As such, this augmentation could substantially increase the dataset size, making it more diverse and of significantly greater use for training purposes. The employed dataset consisted of several images of the right ear and only one left ear for each individual. A series of corresponding synthetic left-ear images is generated using Pix2Pix GAN as a tool for augmenting the available data and mitigate the dataset’s lack of left ear images. The experiment framework used the EarNet model and conducted comparative evaluations before and after Pix2Pix GAN augmentation using the AMI Ear dataset. By employing the Pix2Pix GAN, the proposed approach can effectively double the size of a dataset and, in the process, provide significantly greater utility regarding how that data can be utilised in real-world applications scenarios. The resulting accuracy reaches 98% on the AMI dataset, demonstrating that this technique can improve model performance for ear-based human recognition.
Item Type: | Conference or workshop item (Paper) |
---|---|
DOI/Identification number: | 10.1109/BIOSIG61931.2024.10786744 |
Additional information: | © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Uncontrolled keywords: | deep learning, generative adversarial networks (GAN), ear biometrics, data augmentation |
Subjects: |
Q Science > Q Science (General) > Q335 Artificial intelligence T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7882.B56 Biometric identification |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Engineering and Digital Arts |
Funders: |
University of Kent (https://ror.org/00xkeyj56)
Swansea University (https://ror.org/053fq8t95) |
Depositing User: | Sanaul Hoque |
Date Deposited: | 18 Dec 2024 13:08 UTC |
Last Modified: | 17 Feb 2025 10:02 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/108210 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):