Skip to main content
Kent Academic Repository

Knowledge Retention for Generic Reinforcement Learning Policies in Autonomous Cyber Defence

Sylvester, Joshua, de Lemos, Rogério (2025) Knowledge Retention for Generic Reinforcement Learning Policies in Autonomous Cyber Defence. In: 2nd International Workshop on Autonomous Cybersecurity (AutonomousCyber 2025), 22-26 September 2026, Toulouse, France. (In press) (KAR id:111722)

Abstract

Within autonomous cyber defence, building scalable agents that can generalise across attack behaviours is crucial to developing a truly autonomous system. These generic agents are pivotal, as over time, attackers will inevitably change their behaviour, requiring the defence mechanisms to adapt accordingly. Current approaches for generic agents use deep reinforcement learning policies to learn multiple attack behaviours and mitigate them. When a new attack behaviour is introduced, the generic policy is retrained to incorporate this behaviour and not forget previous attack behaviours. In this paper, we propose a novel solution based on a modified version of the Proximal Policy Optimisation (PPO) reinforcement learning algorithm that retains previously acquired knowledge, enabling a scalable and generic framework in which new attack behaviours can be incorporated modularly. The modified PPO algorithm demonstrates a 22.11% performance improvement compared to standard PPO when trained to sequentially learn two distinct attack behaviours. These results show a step towards building more scalable autonomous cyber defence systems capable of incorporating evolving cyber threats.

Item Type: Conference or workshop item (Paper)
Uncontrolled keywords: Reinfocement Learning
Subjects: Q Science > Q Science (General) > Q335 Artificial intelligence
Q Science > QA Mathematics (inc Computing science)
Institutional Unit: Schools > School of Computing
Institutes > Institute of Cyber Security for Society
Former Institutional Unit:
There are no former institutional units.
Funders: University of Kent (https://ror.org/00xkeyj56)
Depositing User: Joshua Sylvester
Date Deposited: 21 Oct 2025 11:50 UTC
Last Modified: 22 Oct 2025 02:41 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/111722 (The current URI for this page, for reference purposes)

University of Kent Author Information

Sylvester, Joshua.

Creator's ORCID:
CReDIT Contributor Roles:

de Lemos, Rogério.

Creator's ORCID: https://orcid.org/0000-0002-0281-6308
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views of this page since July 2020. For more details click on the image.