R: Remove duplicate rows from data.frame

remove_duplicates {basecamb}

R Documentation

Remove duplicate rows from data.frame

Description

Removes rows that are duplicates of another row in all columns except exclude_columns

Usage

remove_duplicates(
  data,
  exclude_columns = NULL,
  ID_column = NULL,
  quiet = FALSE
)

Arguments

`data`	data.frame to check
`exclude_columns`	character vector, these columns are not considered in determining whether two rows are equal
`ID_column`	character; column with identifiers to scan if possible duplicates remain
`quiet`	logical: Should messages be printed?

Details

Wraps unique()

Value

vector of row indices with non-unique data

Author(s)

J. Peter Marquardt

Examples

data <- data.frame(Study_ID = c("A", "B", "C"), ID = c(123, 456, 123), num_cars = c(10, 2, 10))
remove_duplicates(data, exclude_columns = "Study_ID")
remove_duplicates(data, exclude_columns = "Study_ID", ID_column = "ID")

[Package basecamb version 1.1.5 Index]