PDFs_collect {metagear} | R Documentation |
Attempts to download PDFs from multiple DOI links.
Description
Tries to download a collection of PDF files using multiple digital object
identifier (DOI) links. Updates a data frame with the success of these
downloads. The function is a wrapper for PDF_download
. NOTE:
A single DOI may generate multiple PDF files. If running downloader
in Windows, having "WindowsProxy = TRUE"
will significantly improve
download success.
Usage
PDFs_collect(
aDataFrame,
DOIcolumn,
FileNamecolumn,
directory = getwd(),
randomize = FALSE,
seed = NULL,
buffer = FALSE,
validatePDF = TRUE,
quiet = FALSE,
showSummary = TRUE,
WindowsProxy = FALSE
)
Arguments
aDataFrame |
A data frame containing a column of DOIs and a column of individual file names for each downloaded PDF. |
DOIcolumn |
The label of the column containing all the DOI links. |
FileNamecolumn |
The label of the column containing all the strings that will be used to rename the downloaded files. |
directory |
A string of the location (directory) were downloaded PDF
files are to be saved. NOTE: helps to have this directory created before
initializing the |
randomize |
When |
seed |
An integer used to enforce repeatability when randomly downloading PDFs. |
buffer |
When |
validatePDF |
When |
quiet |
When |
showSummary |
When |
WindowsProxy |
When |
Value
The data frame with new column containing download-outcome successes.
See Also
Examples
## Not run:
data(example_references_metagear)
someRefs <- effort_initialize(example_references_metagear)
dir.create("metagear_downloads")
PDFs_collect(aDataFrame = someRefs, DOIcolumn = "DOI",
FileNamecolumn = "STUDY_ID", directory = "metagear_downloads",
WindowsProxy = TRUE)
## End(Not run)