sampleGridSequence {ReinforcementLearning} | R Documentation |
Sample grid sequence
Description
Function uses an environment function to generate sample experience in the form of state transition tuples.
Usage
sampleGridSequence(N, actionSelection = "random", control = list(alpha
= 0.1, gamma = 0.1, epsilon = 0.1), model = NULL, ...)
Arguments
N |
Number of samples. |
actionSelection |
(optional) Defines the action selection mode of the reinforcement learning agent. Default: |
control |
(optional) Control parameters defining the behavior of the agent.
Default: |
model |
(optional) Existing model of class |
... |
Additional parameters passed to function. |
Value
An dataframe
containing the experienced state transition tuples s,a,r,s_new
.
The individual columns are as follows:
State
The current state.
Action
The selected action for the current state.
Reward
The reward in the current state.
NextState
The next state.
See Also
[Package ReinforcementLearning version 1.0.5 Index]