| fit.simpr_tibble {simpr} | R Documentation | 
Fit models to the simulated data
Description
Takes simulated data from
generate
and applies functions to it, usually
model-fitting functions.
Usage
## S3 method for class 'simpr_tibble'
fit(
  object,
  ...,
  .quiet = TRUE,
  .warn_on_error = TRUE,
  .stop_on_error = FALSE,
  .debug = FALSE,
  .progress = FALSE,
  .options = furrr_options()
)
## S3 method for class 'simpr_spec'
fit(
  object,
  ...,
  .quiet = TRUE,
  .warn_on_error = TRUE,
  .stop_on_error = FALSE,
  .debug = FALSE,
  .progress = FALSE,
  .options = furrr_options()
)
Arguments
| object | a  | 
| ... | 
 | 
| .quiet | Should simulation errors be broadcast to the user as they occur? | 
| .warn_on_error | Should there be a warning
when simulation errors occur? See
 | 
| .stop_on_error | Should the simulation stop immediately when simulation errors occur? | 
| .debug | Run simulation in debug mode, allowing objects, etc. to be explored for each attempt to fit objects. | 
| .progress | A logical, for whether or not to print a progress bar for multiprocess, multisession, and multicore plans . | 
| .options | The  | 
Details
This is the fourth step in the simulation
process: after generating the simulation data,
apply functions such as fitting a statistical
model to the data. The output is often then
passed to tidy_fits or
glance_fits to extract relevant
model estimates from the object.
Similar to
specify, the
model-fitting ... arguments can be
arbitrary R expressions (purrr-style
lambda functions, see
as_mapper) to specify
fitting models to the data. The functions are
computed within each simulation cell, so
dataset names are generally unnecessary: e.g.,
to compute regressions on each cell,
fit(linear_model = ~ lm(c ~ a + b).  If
your modeling function requires a reference to
the full dataset, use ., e.g.
fit(linear_model = ~lm(c ~ a + b, data =
.).
Value
a simpr_tibble object with
additional list-columns for the output of the
provided functions (e.g. model outputs). Just
like the output of
generate,
there is one row per repetition per
combination of metaparameters, and the
columns are the repetition number rep,
the metaparameter names, the simulated data
sim, with additional columns for the
function outputs specified in ....
If per_sim was called
previously, fit returns the object to
default simpr_tibble mode.
Examples
## Generate data to fit models
simple_linear_data = specify(a = ~ 2 + rnorm(n),
                               b = ~ 5 + 3*a + rnorm(n, 0, sd = 0.5)) %>%
  define(n = 100:101) %>%
  generate(2)
## Fit with a single linear term
linear_fit = simple_linear_data %>%
  fit(linear = ~lm(b ~ a, data = .))
linear_fit # first fit element also prints
## Each element of $linear is a model object
linear_fit$linear
## We can fit multiple models to the same data
multi_fit = simple_linear_data %>%
  fit(linear = ~lm(b ~ a, data = .),
      quadratic = ~lm(b ~ a + I(a^2), data = .))
## Two columns, one for each model
multi_fit
## Again, each element is a model object
multi_fit$quadratic
## Can view terms more nicely with tidy_fits
multi_fit %>%
  tidy_fits
## Can view model summaries with glance_fits
multi_fit %>%
  glance_fits
## Fit functions do not actually need to be any particular kind of model, they
## can be any arbitrary function. However, not all functions will lead to useful
## output with tidy_fits and glance_fits.
add_five_data = simple_linear_data %>%
  fit(add_five = ~ . + 5)  ## adds 5 to every value in dataset
add_five_data