selectEpsilonGreedyAction {ReinforcementLearning} | R Documentation |
Performs \varepsilon
-greedy action selection
Description
Implements \varepsilon
-greedy action selection. In this strategy, the agent explores the environment
by selecting an action at random with probability \varepsilon
. Alternatively, the agent exploits its
current knowledge by choosing the optimal action with probability 1-\varepsilon
.
Usage
selectEpsilonGreedyAction(Q, state, epsilon)
Arguments
Q |
State-action table of type |
state |
The current state. |
epsilon |
Exploration rate between 0 and 1. |
Value
Character value defining the next action.
References
Sutton and Barto (1998). "Reinforcement Learning: An Introduction", MIT Press, Cambridge, MA.
[Package ReinforcementLearning version 1.0.5 Index]