read.pdf {pdfminer} | R Documentation |
Read a PDF
document.
Description
Extract PDF
document
Usage
read.pdf(
file,
pages = integer(),
method = c("csv", "sqlite", "PythonInR"),
laycntrl = layout_control(),
encoding = "utf8",
password = "",
caching = TRUE,
maxpages = Inf,
rotation = 0L,
image_dir = "",
pyexe = "python3"
)
Arguments
file |
a character string giving the name of the |
pages |
an integer giving the pages which should be extracted
(default is |
method |
a character string giving the data transfer method. Allowed values
are |
laycntrl |
a list of layout options, created by the function |
encoding |
a character string giving the encoding of the output
(default is |
password |
a character string giving the password necessary to access
the |
caching |
a logical if |
maxpages |
an integer giving the maximum number of pages to be extracted
(default is |
rotation |
an integer giving the rotation of the page, allowed values
are |
image_dir |
a character string giving the path to the folder, where the images
should be stored (default is |
pyexe |
a character string giving the path to the python executable
(default is |
Value
Returns a object of class "pdf_document"
.
Examples
if (is_pdfminer_installed()) {
pdf_file <- system.file("pdfs/cars.pdf", package = "pdfminer")
read.pdf(pdf_file)
}