Long, Ma, Cong, Shu, Shanshan, Huang, Zoujian, Wei, Xuhao, Wang, Yanxi, Wei (2023) SDDNet: Infrared small and dim target detection network. CAAI Transactions on Intelligence Technology, 8 (4). pp. 1226-1236. ISSN 2468-2322. (doi:10.1049/cit2.12165) (KAR id:104688)
PDF
Publisher pdf
Language: English
This work is licensed under a Creative Commons Attribution 4.0 International License.
|
|
Download this file (PDF/1MB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: https://doi.org/10.1049/cit2.12165 |
Abstract
This study focuses on developing deep learning methods for small and dim target detection. We model infrared images as the union of the target region and background region. Based on this model, the target detection problem is considered a two‐class segmentation problem that divides an image into the target and background. Therefore, a neural network called SDDNet for single‐frame images is constructed. The network yields target extraction results according to the original images. For multiframe images, a network called IC‐SDDNet, a combination of SDDNet and an interframe correlation network module is constructed. SDDNet and IC‐SDDNet achieve target detection rates close to 1 on typical datasets with very low false positives, thereby performing significantly better than current methods. Both models can be executed end to end, so both are very convenient to use, and their implementation efficiency is very high. Average speeds of 540+/230+ FPS and 170+/60+ FPS are achieved with SDDNet and IC‐SDDNet on a single Tesla V100 graphics processing unit and a single Jetson TX2 embedded module respectively. Additionally, neither network needs to use future information, so both networks can be directly used in real‐time systems. The well‐trained models and codes used in this study are available at https://github.com/LittlePieces/ObjectDetection.
Item Type: | Article |
---|---|
DOI/Identification number: | 10.1049/cit2.12165 |
Uncontrolled keywords: | artificial intelligence; computer networks and communications; computer vision and pattern recognition; human-computer interaction; information systems; deep learning; detection of moving objects |
Subjects: | Q Science |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
SWORD Depositor: | JISC Publications Router |
Depositing User: | JISC Publications Router |
Date Deposited: | 22 Jan 2024 15:50 UTC |
Last Modified: | 23 Jan 2024 08:55 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/104688 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):