Grzes, Marek (2017) Reward Shaping in Episodic Reinforcement Learning. In: Proc. of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2017). . pp. 565-573. ACM (KAR id:60614)
PDF (Publisher PDF)
Publisher pdf
Language: English ![]() This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. |
||
Download (281kB)
Preview
|
|
|
This file may not be suitable for user of assistive technology. Request an accessible format. |
||
Official URL https://dl.acm.org/citation.cfm?id=3091208&CFID=83... |
Abstract
Recent advancements in reinforcement learning confirm that reinforcement learning techniques can solve large scale problems leading to high quality autonomous decision making. It is a matter of time until we will see large scale applications of reinforcement learning in various sectors, such as healthcare and cyber-security, among others. However, reinforcement learning can be time-consuming because the learning algorithms have to determine the long term consequences of their actions using delayed feedback or rewards. Reward shaping is a method of incorporating domain knowledge into reinforcement learning so that the algorithms are guided faster towards more promising solutions. Under an overarching theme of episodic reinforcement learning, this paper shows a unifying analysis of potential-based reward shaping which leads to new theoretical insights into reward shaping in both model-free and model-based algorithms, as well as in multi-agent reinforcement learning.
Item Type: | Conference or workshop item (Paper) |
---|---|
Uncontrolled keywords: | Reward structures for learning; Multiagent learning; Reward shaping; Reinforcement learning |
Subjects: |
Q Science > Q Science (General) > Q335 Artificial intelligence Q Science > QA Mathematics (inc Computing science) > QA273 Probabilities Q Science > QA Mathematics (inc Computing science) > QA 75 Electronic computers. Computer science |
Divisions: |
Faculties > Sciences > School of Computing Faculties > Sciences > School of Computing > Computational Intelligence Group |
Depositing User: | Marek Grzes |
Date Deposited: | 01 Mar 2017 09:22 UTC |
Last Modified: | 06 May 2020 03:15 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/60614 (The current URI for this page, for reference purposes) |
Grzes, Marek: | ![]() |
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):