dataset-to-R {crunch}R Documentation method for CrunchDataset


This method is defined principally so that you can use a CrunchDataset as a data argument to other R functions (such as stats::lm()) without needing to download the whole dataset. You can, however, choose to download a true data.frame.


## S3 method for class 'CrunchDataset'
  row.names = NULL,
  optional = FALSE,
  force = FALSE,
  categorical.mode = "factor",
  row.order = NULL,
  include.hidden = TRUE,

## S3 method for class 'CrunchDataFrame'
  row.names = NULL,
  optional = FALSE,
  include.hidden = attr(x, "include.hidden"),



a CrunchDataset or CrunchDataFrame


part of signature. Ignored.


part of signature. Ignored.


logical: actually coerce the dataset to data.frame, or leave the columns as unevaluated promises. Default is FALSE.


what mode should categoricals be pulled as? One of factor, numeric, id (default: factor)


vector of indices. Which, and their order, of the rows of the dataset should be presented as (default: NULL). If NULL, then the Crunch Dataset order will be used.


logical: should hidden variables be included? (default: TRUE)


additional arguments passed to (default method).


By default, the method for CrunchDataset does not return a data.frame but instead CrunchDataFrame, which behaves like a data.frame without bringing the whole dataset into memory. When you access the variables of a CrunchDataFrame, you get an R vector, rather than a CrunchVariable. This allows modeling functions that require select columns of a dataset to retrieve only those variables from the remote server, rather than pulling the entire dataset into local memory.

If you call on a CrunchDataset with force = TRUE, you will instead get a true data.frame. You can also get this data.frame by calling on a CrunchDataFrame (effectively calling on the dataset twice)

When a data.frame is returned, the function coerces Crunch Variable values into their R equivalents using the following rules:

Column names in the data.frame are the variable/subvariable aliases.


When called on a CrunchDataset, the method returns an object of class CrunchDataFrame unless force = TRUE, in which case the return is a data.frame. For CrunchDataFrame, the method returns a data.frame.

See Also


[Package crunch version 1.30.4 Index]