spark_read_warc_sample {sparkwarc}R Documentation

Loads the sample warc file in Spark

Description

Loads the sample warc file in Spark

Usage

spark_read_warc_sample(sc, filter = "", include = "")

Arguments

sc

An active spark_connection.

filter

A regular expression used to filter to each warc entry efficiently by running native code using Rcpp.

include

A regular expression used to keep only matching lines efficiently by running native code using Rcpp.


[Package sparkwarc version 0.1.6 Index]