meltt_duplicates {meltt} | R Documentation |
Return identified duplicate entries removed after integration.
Description
meltt_duplicates
returns all matching entries that are identified as matches during the integration process.
Usage
meltt_duplicates(object, columns = NULL)
Arguments
object |
object of class |
columns |
string vector referencing column names located in the input data. Default is to return all columns contained in the input data. |
Details
meltt_duplicates
returns all duplicated entries along with specified columns to compare which entries matched. Function allows for easy extraction all entries identified as duplicates.
Value
Returns an data.frame
where the first columns contain an index for the data.source and event for each data frame. The subsequent columns are all columns specified in the columns
argument, or all columns contained in the original input data if columns = NULL
.
An "event_type" column is added to the output data.frame
specifying if the match was between events or episode. See meltt_inspect
for handling flagged event-to-episode matches.
Author(s)
Karsten Donnay and Eric Dunford.
References
Karsten Donnay, Eric T. Dunford, Erin C. McGrath, David Backer, David E. Cunningham. (2018). "Integrating Conflict Event Data." Journal of Conflict Resolution.
See Also
meltt
, meltt_data
, meltt_inspect
Examples
data(crashMD)
output = meltt(crash_data1, crash_data2, crash_data3,
taxonomies = crash_taxonomies, twindow = 1, spatwindow = 3)
duplicates = meltt_duplicates(output, column = c("date", "longitude", "latitude"))
head(duplicates)