screen_duplicates {revtools} | R Documentation |
Shiny app for locating and excluding duplicated entries in a dataset
Description
This is a simple app for calculating, displaying and screening potential duplicates in bibliographic data
Usage
screen_duplicates(x, max_file_size)
Arguments
x |
An (optional) object of class |
max_file_size |
Optional argument to set the maximum file size (in MB) that the app will accept. |
Details
This app is effectively a wrapper for find_duplicates
, with the added option to manually screen pairs of duplicates to check the results. Consequently, this is a more reliable method than extract_unique_references
of dealing with the duplicates identified by find_duplicates
, and for testing whether that function has returned sensible results for a given dataset.
Value
This function launches a Shiny app in the users' default browser, allowing the user to customize their parameters for duplicate detection, and visualise the results.
See Also
screen_titles
or screen_abstracts
for manual screening of individual articles.
Examples
# to run the app and upload data interactively
## Not run: screen_duplicates()
# or to specify data from the workspace
file_location <- system.file(
"extdata",
"avian_ecology_bibliography.ris",
package = "revtools")
x <- read_bibliography(file_location)
# to run the app using these data:
## Not run: screen_duplicates(x)
# or to run the app & save results to the workspace:
## Not run: result <- screen_duplicates(x)