Optimisation of Deep Learning Small-Object Detectors with Novel Explainable Verification

Mohamed, Elhassan, Sirlantzis, Konstantinos, Howells, Gareth, Hoque, Sanaul (2022) Optimisation of Deep Learning Small-Object Detectors with Novel Explainable Verification. Sensors, 22 . pp. 1-23. E-ISSN 1424-8220. (doi:10.3390/s22155596) (KAR id:95957)

PDF Publisher pdf Language: English This work is licensed under a Creative Commons Attribution 4.0 International License.
Download this file (PDF/13MB)
Request a format suitable for use with assistive technology e.g. a screenreader
Official URL: https://doi.org/10.3390/s22155596
Additional URLs: Publisher

Abstract

In this paper, we present a novel methodology based on machine learning for identifying the most appropriate from a set of available state-of-the-art object detectors for a given application. Our particular interest is to develop a road map for identifying verifiably optimal selections, especially for challenging applications such as detecting small objects in a mixed-size object dataset. State-of�the-art object detection systems often find the localisation of small-size objects challenging since most are usually trained on large-size objects. These contain abundant information as they occupy a large number of pixels relative to the total image size. This fact is normally exploited by the model during training and inference processes. To dissect and understand this process, our approach systematically examines detectors’ performances using two very distinct deep convolutional networks. The first is the single-stage YOLO V3 and the second is the double-stage Faster R-CNN. Specifically, our proposed method explores and visually illustrates the impact of feature extraction layers, number of anchor boxes, data augmentation, etc., utilising ideas from the field of explainable Artificial Intelligence (XAI). Our results, for example, show that multi-head YOLO V3 detectors trained using augmented data produce better performance even with a fewer number of anchor boxes. Moreover, robustness regarding the detector’s ability to explain how a specific decision was reached is investigated using different explanation techniques. Finally, two new visualisation techniques are proposed, WS-Grad and Concat-Grad, for identifying explanation cues of different detectors. These are applied to specific object detection tasks to illustrate their reliability and transparency with respect to the decision process. It is shown that the proposed techniques can result in high resolution and comprehensive heatmaps of the image areas, significantly affecting detector decisions as compared to the state-of-the-art techniques tested.

Item Type:	Article
DOI/Identification number:	10.3390/s22155596
Additional information:	For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Uncontrolled keywords:	convolutional neural network; explainable artificial intelligence; small object detection
Subjects:	Q Science > Q Science (General) > Q335 Artificial intelligence Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, > QA76.87 Neural computers, neural networks T Technology > TA Engineering (General). Civil engineering (General) > TA1637 Image processing T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7882.P3 Pattern recognition systems
Institutional Unit:	Schools > School of Computing Schools > School of Engineering, Mathematics and Physics > Engineering
Former Institutional Unit:	Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Engineering and Digital Arts
Funders:	Engineering and Physical Sciences Research Council (https://ror.org/0439y7842)
Depositing User:	Sanaul Hoque
Date Deposited:	03 Aug 2022 12:20 UTC
Last Modified:	22 Jul 2025 09:10 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/95957 (The current URI for this page, for reference purposes)

University of Kent Author Information

Mohamed, Elhassan.

Creator's ORCID:	https://orcid.org/0000-0001-9746-1564
CReDIT Contributor Roles:

Sirlantzis, Konstantinos.

Creator's ORCID:	https://orcid.org/0000-0002-0847-8880
CReDIT Contributor Roles:

Howells, Gareth.

Creator's ORCID:	https://orcid.org/0000-0001-5590-0880
CReDIT Contributor Roles:

Hoque, Sanaul.

Creator's ORCID:	https://orcid.org/0000-0001-8627-3429
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.