Han, Zihao, De Wilde, Philippe, Santopietro, Marco (2025) Domain-Aligned OCT Pre-training: Enhancing Retinal Disease Diagnosis Through Cross-Anatomy Vision Transformers. In: Lecture Notes in Artificial Intelligence. Artificial Intelligence in Healthcare Second International Conference, AIiH 2025, Cambridge, UK, September 8–10, 2025, Proceedings, Part II. Lecture Notes in Computer Science . pp. 299-312. Springer Nature (doi:10.1007/978-3-032-00656-1_22) (KAR id:112898)
|
PDF
Author's Accepted Manuscript
Language: English |
|
|
Download this file (PDF/2MB) |
Preview |
| Request a format suitable for use with assistive technology e.g. a screenreader | |
| Official URL: https://doi-org.chain.kent.ac.uk/10.1007/978-3-032... |
|
Abstract
Medical imaging often suffers from limited labelled data and substantial domain gaps when transferring models pre-trained on general-purpose benchmarks such as ImageNet. This study systematically compares three training strategies for Vision Transformers (ViTs) on a four-class retinal Optical Coherence Tomography (OCT) dataset(CNV, DME, Drusen, Normal): (1) training from scratch, (2) conventional ImageNet-based pre-training, and (3) a novel domain-specific pre-training method using OCT breast cancer images (adipose tissue vs. cancer). Experimental results clearly show that the domain-specific OCT breast pre-training significantly improves classification accuracy compared to both ImageNet pre-training and training from scratch, particularly under limited-data scenarios. These findings challenge the prevailing view that general-domain pre-training has limited utility in medical imaging, instead emphasizing the essential role of domain alignment in pre-training datasets. Our results highlight the critical advantage of domain-specific pre-training in medical imaging AI, demonstrating improved accuracy and potential for earlier retinal disease detection even with scarce labelled data. Future research should focus on constructing larger OCT-specific pre-training datasets and exploring advanced self-supervised methods tailored explicitly for medical imaging tasks.
| Item Type: | Conference or workshop item (Paper) |
|---|---|
| DOI/Identification number: | 10.1007/978-3-032-00656-1_22 |
| Uncontrolled keywords: | Object vision; Ophthalmology; Pattern vision; Retina; Predictive medicine; Retinal diseases |
| Institutional Unit: | Schools > School of Natural Sciences > Biosciences |
| Former Institutional Unit: |
There are no former institutional units.
|
| Funders: | University of Kent (https://ror.org/00xkeyj56) |
| Depositing User: | Philippe De Wilde |
| Date Deposited: | 28 Jan 2026 14:22 UTC |
| Last Modified: | 04 Feb 2026 03:48 UTC |
| Resource URI: | https://kar.kent.ac.uk/id/eprint/112898 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):

https://orcid.org/0000-0002-4332-1715
Altmetric
Altmetric