Safe Reinforcement Learning for Continuous Spaces through Lyapunov-Constrained Behavior

Fjerdingen, Sigrud Aksnes; Kyrkjebø, Erik

dc.contributor.author	Fjerdingen, Sigrud Aksnes
dc.contributor.author	Kyrkjebø, Erik
dc.date.accessioned	2017-02-13T09:43:08Z
dc.date.available	2017-02-13T09:43:08Z
dc.date.created	2012-02-15T14:51:56Z
dc.date.issued	2011
dc.identifier.citation	Frontiers in Artificial Intelligence and Applications. 2011, 70-79.	nb_NO
dc.identifier.issn	0922-6389
dc.identifier.uri	http://hdl.handle.net/11250/2430386
dc.description.abstract	This paper presents a safe learning strategy for continuous state and action spaces by utilizing Lyapunov stability properties of the studied systems. The reinforcement learning algorithm Continous Actor-Critic Learning Automation (CACLA) is combined with the notion of control Lyapunov functions (CLF) to limit the learning and exploration behavior to operate inside the stability region of the system to ensure safe operation at all times. The paper extends previous results for discrete action sets to take advantage of the more general continuous actions sets, and show that the continuous method is able to find a comparable solution to the best discrete action choices while avoiding the need for good heuristic choices in the design process.
dc.language.iso	eng	nb_NO
dc.title	Safe Reinforcement Learning for Continuous Spaces through Lyapunov-Constrained Behavior	nb_NO
dc.type	Journal article	nb_NO
dc.type	Peer reviewed	nb_NO
dc.source.pagenumber	70-79	nb_NO
dc.source.journal	Frontiers in Artificial Intelligence and Applications	nb_NO
dc.identifier.cristin	909648
cristin.unitcode	7401,90,23,0
cristin.unitname	Anvendt kybernetikk
cristin.ispublished	true
cristin.fulltext	postprint
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: SINTEF+S19504.pdf
Størrelse:: 194.8Kb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Publikasjoner fra CRIStin - SINTEF AS [5802]
SINTEF Digital [2501]

Vis enkel innførsel