selectEpsilonGreedyAction {ReinforcementLearning} | R Documentation |
Performs
-greedy action selection
Description
Implements -greedy action selection. In this strategy, the agent explores the environment
by selecting an action at random with probability
. Alternatively, the agent exploits its
current knowledge by choosing the optimal action with probability
.
Usage
selectEpsilonGreedyAction(Q, state, epsilon)
Arguments
Q |
State-action table of type |
state |
The current state. |
epsilon |
Exploration rate between 0 and 1. |
Value
Character value defining the next action.
References
Sutton and Barto (1998). "Reinforcement Learning: An Introduction", MIT Press, Cambridge, MA.
[Package ReinforcementLearning version 1.0.5 Index]