rbind_parquet {parquetize} | R Documentation |
Function to bind multiple parquet files by row
Description
This function read all parquet files in 'folder' argument that starts with 'output_name',
combine them using rbind and write the result to a new parquet file.
It can also delete the initial files if 'delete_initial_files' argument is TRUE.
Be careful, this function will not work if files with different structures are present in the folder given with the argument 'folder'.
Usage
rbind_parquet(
folder,
output_name,
delete_initial_files = TRUE,
compression = "snappy",
compression_level = NULL
)
Arguments
folder |
the folder where the initial files are stored |
output_name |
name of the output parquet file |
delete_initial_files |
Boolean. Should the function delete the initial files ? By default TRUE. |
compression |
compression algorithm. Default "snappy". |
compression_level |
compression level. Meaning depends on compression algorithm. |
Value
Parquet files, invisibly
Examples
## Not run:
library(arrow)
if (file.exists('output')==FALSE) {
dir.create("output")
}
file.create(fileext = "output/test_data1-4.parquet")
write_parquet(data.frame(
x = c("a","b","c"),
y = c(1L,2L,3L)
),
"output/test_data1-4.parquet")
file.create(fileext = "output/test_data4-6.parquet")
write_parquet(data.frame(
x = c("d","e","f"),
y = c(4L,5L,6L)
), "output/test_data4-6.parquet")
test_data <- rbind_parquet(folder = "output",
output_name = "test_data",
delete_initial_files = FALSE)
## End(Not run)
[Package parquetize version 0.5.7 Index]