collect.disk.frame {disk.frame} | R Documentation |
Bring the disk.frame into R
Description
Bring the disk.frame into RAM by loading the data and running all lazy operations as data.table/data.frame or as a list
Bring the disk.frame into RAM by loading the data and running all lazy operations as data.table/data.frame or as a list
Usage
## S3 method for class 'disk.frame'
collect(x, ..., parallel = !is.null(attr(x, "recordings")))
collect_list(
x,
simplify = FALSE,
parallel = !is.null(attr(x, "recordings")),
...
)
## S3 method for class 'summarized_disk.frame'
collect(x, ..., parallel = !is.null(attr(x, "recordings")))
Arguments
x |
a disk.frame |
... |
not used |
parallel |
if TRUE the collection is performed in parallel. By default if there are delayed/lazy steps then it will be parallel, otherwise it will not be in parallel. This is because parallel requires transferring data from background R session to the current R session and if there is no computation then it's better to avoid transferring data between session, hence parallel = FALSE is a better choice |
simplify |
Should the result be simplified to array |
Value
collect return a data.frame/data.table
collect_list returns a list
collect return a data.frame/data.table
Examples
cars.df = as.disk.frame(cars)
# use collect to bring the data into RAM as a data.table/data.frame
collect(cars.df)
# clean up
delete(cars.df)
cars.df = as.disk.frame(cars)
# returns the result as a list
collect_list(cmap(cars.df, ~1))
# clean up
delete(cars.df)
cars.df = as.disk.frame(cars)
# use collect to bring the data into RAM as a data.table/data.frame
collect(cars.df)
# clean up
delete(cars.df)