presample {Rborist} | R Documentation |
Forest-wide Observation Sampling
Description
Observations sampled for each tree to be trained. In the case of the Random Forest algorithm, this is the bag.
Usage
## Default S3 method:
presample(y,
nHoldout = 0,
samplingWeight = numeric(0),
nSamp = 0,
nRep = 500,
withRepl = TRUE,
verbose = FALSE,
nTree = 0,
...)
Arguments
y |
A vector to be sampled, typically the response. |
nHoldout |
Number of observations to omit from sampling. Augmented by unobserved response values. |
samplingWeight |
Per-observation sampling weights. Default is uniform. |
nSamp |
Size of sample draw. Default draws |
nRep |
Number of samples to draw. Replaces deprecated |
withRepl |
true iff sampling is with replacement. |
verbose |
true iff tracing execution. |
nTree |
Number of samples to draw. Deprecated. |
... |
not currently used. |
Value
an object of class Sampler
consisting of:
-
yTrain
the sampled vector. -
nSamp
the sample sizes drawn. -
nRep
the number of independent samples. -
nTree
synonymous withnRep
. Deprecated. -
samples
a packed data structure encoding the observation index and corresponding sample count. -
hash
a hashed digest of the data items.
References
Tille, Yves. Sampling algorithms. Springer New York, 2006.
Examples
## Not run:
y <- runif(1000)
# Samples with replacement, 500 vectors of length 1000:
ps <- presample(y)
# Samples, as above, with 63 observations held out:
ps <- presample(y, nHoldout = 63)
# Samples without replacement, 250 vectors of length 500:
ps2 <- presample(y, nTree=250, nSamp=500, withRepl = FALSE)
## End(Not run)