Reinforcement learning under circumstances beyond its control

Gaskett, Chris (2003) Reinforcement learning under circumstances beyond its control. In: Proceedings of the international conference on computational intelligence, robotics and autonomous systems. From: Proceedings of the international conference on computational intelligence for modelling control and automation (CIMCA2003), Vienna, Austria.

[img]
Preview
PDF
Download (967kB)
 
1163


Abstract

Decision theory addresses the task of choosing an action; it provides robust decision-making criteria that support decision-making under conditions of uncertainty or risk. Decision theory has been applied to produce reinforcement learning algorithms that manage uncertainty in state-transitions. However, performance when there is uncertainty regarding the selection of future actions must also be considered, since reinforcement learning tasks are multiple-step decision problems. This work proposes beta-pessimistic Q-learning—a reinforcement learning algorithm that does not assume complete control.

Item ID: 632
Item Type: Conference Item (Research - E1)
Keywords: Reinforcement learning, Beta-pessimistic, Risk, Uncertainty
Date Deposited: 04 Oct 2006
FoR Codes: 01 MATHEMATICAL SCIENCES > 0102 Applied Mathematics > 010205 Financial Mathematics @ 0%
17 PSYCHOLOGY AND COGNITIVE SCIENCES > 1702 Cognitive Science > 170203 Knowledge Representation and Machine Learning @ 0%
Downloads: Total: 1163
Last 12 Months: 28
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page