Skip to main content
Kent Academic Repository

A multimodal feature fusion and large language model approach for the combustion stability diagnosis of 660 MWth coal-fired boilers

Pu, Sixu, Zhou, Yi, Hossain, Md. Moinul, Zhu, Xiaoyu, Chen, Guoqing, Xu, Chuanlong (2026) A multimodal feature fusion and large language model approach for the combustion stability diagnosis of 660 MWth coal-fired boilers. Energy and AI, 24 . Article Number 100744. E-ISSN 2666-5468. (doi:10.1016/j.egyai.2026.100744) (KAR id:113698)

Abstract

Under low-load flexible operation conditions, where coal-fired boilers often need to respond quickly to deep load changes, the combustion state of boiler burners is vital for furnace efficiency and safety. Although image-based flame detection offers higher monitoring accuracy than traditional methods, it cannot provide quantitative evaluations and clear diagnoses of flame stability. To address these issues, this study introduces a large-language-based approach utilizing a pre-trained Llama 3.2-3B model with lightweight fine-tuning for flame stability diagnosis. Temporal features from operational data and spatial features from flame videos are extracted using a Long Short-Term Memory network and a Vision Transformers advanced by exploring intrinsic inductive bias model. A custom multimodal fusion network then blends these complementary features to create a unified representation of combustion characteristics. Expert annotations are integrated during fine-tuning to improve understanding and diagnostic reasoning specific to combustion. Additionally, a multimodal feature database of stable conditions is built to enable quantitative evaluation of flame stability. Tests conducted on a 660MW opposed-fired boiler demonstrate the effectiveness of the proposed model. The results show the model's ability to distinguish between stable and unstable flame states across different operational loads. It accurately detects transitions between safe and unsafe combustion states and offers interpretable recommendations for adjustments. This provides a practical pathway for the safe, efficient, and intelligent operation of coal-fired boilers under variable load conditions.

Item Type: Article
DOI/Identification number: 10.1016/j.egyai.2026.100744
Uncontrolled keywords: coal-fired power plant; burner; combustion stability; flame imaging; multimodal feature; large language model
Subjects: Q Science > Q Science (General)
Institutional Unit: Schools > School of Engineering, Mathematics and Physics > Engineering
Former Institutional Unit:
There are no former institutional units.
Depositing User: Moinul Hossain
Date Deposited: 07 Apr 2026 07:06 UTC
Last Modified: 10 Apr 2026 10:04 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/113698 (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views of this page since July 2020. For more details click on the image.