Domain-Aligned OCT Pre-training: Enhancing Retinal Disease Diagnosis Through Cross-Anatomy Vision Transformers

Han, Zihao, De Wilde, Philippe, Santopietro, Marco (2025) Domain-Aligned OCT Pre-training: Enhancing Retinal Disease Diagnosis Through Cross-Anatomy Vision Transformers. In: Artificial Intelligence in Healthcare Second International Conference, AIiH 2025, Cambridge, UK, September 8–10, 2025, Proceedings, Part II. Lecture Notes in Computer Science Springer Nature (doi:10.1007/978-3-032-00656-1_22) (KAR id:112898)

PDF Author's Accepted Manuscript Language: English
Download this file (PDF/2MB)	Preview
Request a format suitable for use with assistive technology e.g. a screenreader
Official URL: https://doi-org.chain.kent.ac.uk/10.1007/978-3-032...

Abstract

Medical imaging often suffers from limited labelled data and substantial domain gaps when transferring models pre-trained on general-purpose benchmarks such as ImageNet. This study systematically compares three training strategies for Vision Transformers (ViTs) on a four-class retinal Optical Coherence Tomography (OCT) dataset(CNV, DME, Drusen, Normal): (1) training from scratch, (2) conventional ImageNet-based pre-training, and (3) a novel domain-specific pre-training method using OCT breast cancer images (adipose tissue vs. cancer). Experimental results clearly show that the domain-specific OCT breast pre-training significantly improves classification accuracy compared to both ImageNet pre-training and training from scratch, particularly under limited-data scenarios. These findings challenge the prevailing view that general-domain pre-training has limited utility in medical imaging, instead emphasizing the essential role of domain alignment in pre-training datasets. Our results highlight the critical advantage of domain-specific pre-training in medical imaging AI, demonstrating improved accuracy and potential for earlier retinal disease detection even with scarce labelled data. Future research should focus on constructing larger OCT-specific pre-training datasets and exploring advanced self-supervised methods tailored explicitly for medical imaging tasks.

Item Type:	Conference proceeding
DOI/Identification number:	10.1007/978-3-032-00656-1_22
Uncontrolled keywords:	Object vision; Ophthalmology; Pattern vision; Retina; Predictive medicine; Retinal diseases
Institutional Unit:	Schools > School of Natural Sciences > Biosciences
Former Institutional Unit:	There are no former institutional units.
Funders:	University of Kent (https://ror.org/00xkeyj56)
Depositing User:	Philippe De Wilde
Date Deposited:	28 Jan 2026 14:22 UTC
Last Modified:	04 Feb 2026 03:48 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/112898 (The current URI for this page, for reference purposes)

University of Kent Author Information

Han, Zihao.

Creator's ORCID:
CReDIT Contributor Roles:

De Wilde, Philippe.

Creator's ORCID:	https://orcid.org/0000-0002-4332-1715
CReDIT Contributor Roles:

Santopietro, Marco.

Creator's ORCID:
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.