Skip to main content
Kent Academic Repository

Cone-beam CT to synthetic CT translation using conditional 3D latent diffusion-based model

Al-Shalabi, Mohammed, Mahdi, Mohammed A., Elbarougy, Reda, Alnfrawy, Ehab Tawfeek, Hadi, Muhammad Usman, Ali, Rao Faizan (2026) Cone-beam CT to synthetic CT translation using conditional 3D latent diffusion-based model. IEEE Access, 14 . pp. 12680-12693. E-ISSN 2169-3536. (doi:10.1109/access.2026.3653234) (KAR id:112813)

Abstract

Accurate cone-beam CT (CBCT)-to-synthetic CT (sCT) translation is essential for image-guided adaptive radiotherapy (IGART), where Hounsfield unit (HU) fidelity and structural accuracy directly affect dose calculation. We propose a conditional 3D Latent Diffusion Model (3DLDFM) for volumetric CBCT-to-sCT synthesis. The framework comprises two stages: 1) a 3D variational autoencoder with KL regularization that compresses CBCT volumes into a three-channel latent representation, trained with a composite loss combining L1 reconstruction, perceptual, KL, and adversarial terms; and 2) a conditional 3D U-Net diffusion model that performs iterative denoising in latent space using a DDPM-style noise schedule, conditioned on the input CBCT. We evaluated 3DLDFM on the multi-center SynthRAD2023 dataset comprising 955 paired CBCT/CT volumes spanning head-and-neck, thorax, and abdominal sites. Performance is benchmarked against SwinUNETR, nnUNet, CycleGAN, and Pix2Pix using Mean Absolute Error (MAE), Peak Signal-to-Noise Ratio (PSNR), and Structural Similarity Index Measure (SSIM) within body masks. Across all regions, 3DLDFM achieves the lowest overall MAE (51.40 HU) and the highest overall SSIM (0.9124), while maintaining competitive PSNR (30.60 dB), surpassing all baselines in HU accuracy and structural fidelity. These results demonstrate that the proposed latent diffusion framework provides a robust and generalizable solution for CBCT-to-CT synthesis and strengthens the feasibility of simulation-free adaptive radiotherapy workflows.

Item Type: Article
DOI/Identification number: 10.1109/access.2026.3653234
Uncontrolled keywords: Cone-beam CT; synthetic CT; latent diffusion model; image-guided adaptive radiotherapy; image-to-image translation; deep learning
Subjects: Q Science > QA Mathematics (inc Computing science)
Institutional Unit: Schools > School of Computing
Former Institutional Unit:
There are no former institutional units.
SWORD Depositor: JISC Publications Router
Depositing User: JISC Publications Router
Date Deposited: 29 Jan 2026 10:51 UTC
Last Modified: 30 Jan 2026 10:38 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/112813 (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views of this page since July 2020. For more details click on the image.