prolific_anonymize {umx}R Documentation

Clean up a prolific file for sharing by removing anonymity-compromising columns.

Description

prolific.ac IDs and other columns like IP and lat/long might compromise subject anonymity when shared. prolific_anonymize replaces PIDs with a simple numeric sequence, preserving repeated measures in long data, and removing other columns. You can delete additional columns by adding them to extraColumns. It is ideal for use when sharing data to https://researchbox.org which enforces anonymization.

Usage

prolific_anonymize(
  df = NULL,
  PID = "PID",
  extraColumns = NA,
  baseOffset = 10000
)

Arguments

df

Existing datafile to anonymize.

PID

The prolific ID col name to anonymize

extraColumns

Any extra columns to delete (default NA)

baseOffset

The numeric to start renumbering PIDs from (default = 1e4)

Value

References

See Also

Other Data Functions: noNAs(), prolific_check_ID(), prolific_read_demog(), umxFactor(), umxHetCor(), umx_as_numeric(), umx_cont_2_quantiles(), umx_lower2full(), umx_make_MR_data(), umx_make_TwinData(), umx_make_fake_data(), umx_make_raw_from_cov(), umx_merge_randomized_columns(), umx_polychoric(), umx_polypairwise(), umx_polytriowise(), umx_read_lower(), umx_rename(), umx_reorder(), umx_score_scale(), umx_select_valid(), umx_stack(), umx_strings2numeric(), umx

Examples

## Not run: 
tmp = prolific_anonymize(df, PID = "PID")

## End(Not run)

[Package umx version 4.20.0 Index]