Skip to main content
Kent Academic Repository

Domain-Aligned OCT Pre-training: Enhancing Retinal Disease Diagnosis Through Cross-Anatomy Vision Transformers

Han, Zihao, De Wilde, Philippe, Santopietro, Marco (2025) Domain-Aligned OCT Pre-training: Enhancing Retinal Disease Diagnosis Through Cross-Anatomy Vision Transformers. In: Lecture Notes in Artificial Intelligence. Artificial Intelligence in Healthcare Second International Conference, AIiH 2025, Cambridge, UK, September 8–10, 2025, Proceedings, Part II. Lecture Notes in Computer Science . pp. 299-312. Springer Nature (doi:10.1007/978-3-032-00656-1_22) (KAR id:112898)

Abstract

Medical imaging often suffers from limited labelled data and substantial domain gaps when transferring models pre-trained on general-purpose benchmarks such as ImageNet. This study systematically compares three training strategies for Vision Transformers (ViTs) on a four-class retinal Optical Coherence Tomography (OCT) dataset(CNV, DME, Drusen, Normal): (1) training from scratch, (2) conventional ImageNet-based pre-training, and (3) a novel domain-specific pre-training method using OCT breast cancer images (adipose tissue vs. cancer). Experimental results clearly show that the domain-specific OCT breast pre-training significantly improves classification accuracy compared to both ImageNet pre-training and training from scratch, particularly under limited-data scenarios. These findings challenge the prevailing view that general-domain pre-training has limited utility in medical imaging, instead emphasizing the essential role of domain alignment in pre-training datasets. Our results highlight the critical advantage of domain-specific pre-training in medical imaging AI, demonstrating improved accuracy and potential for earlier retinal disease detection even with scarce labelled data. Future research should focus on constructing larger OCT-specific pre-training datasets and exploring advanced self-supervised methods tailored explicitly for medical imaging tasks.

Item Type: Conference or workshop item (Paper)
DOI/Identification number: 10.1007/978-3-032-00656-1_22
Uncontrolled keywords: Object vision; Ophthalmology; Pattern vision; Retina; Predictive medicine; Retinal diseases
Institutional Unit: Schools > School of Natural Sciences > Biosciences
Former Institutional Unit:
There are no former institutional units.
Funders: University of Kent (https://ror.org/00xkeyj56)
Depositing User: Philippe De Wilde
Date Deposited: 28 Jan 2026 14:22 UTC
Last Modified: 04 Feb 2026 03:48 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/112898 (The current URI for this page, for reference purposes)

University of Kent Author Information

Han, Zihao.

Creator's ORCID:
CReDIT Contributor Roles:

De Wilde, Philippe.

Creator's ORCID: https://orcid.org/0000-0002-4332-1715
CReDIT Contributor Roles:

Santopietro, Marco.

Creator's ORCID:
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views of this page since July 2020. For more details click on the image.