R: Patient Cohort Re-Identification Processing

cohort_processing {SangerTools}

R Documentation

Patient Cohort Re-Identification Processing

Description

Population Health Management commonly leads practitioners to identify a cohort that will have an intervention applied. As a rule of thumb most analysts will work with pseudonymised data sets. For targeted interventions patients require re-identification; this process is generally carried out by a third party organisation. As third party organisations work with many health care providers they have a strict set of requirements. This has been based around SW CSU's required formatting.

Usage

cohort_processing(
  df,
  Split_by,
  path,
  prefix = "DSCRO",
  com_code = "11M",
  date_format = "%Y%m%d",
  suffix = "_REID_V01"
)

Arguments

`df`	a tidy dataframe in standard Master Patient Index format ie SangerTools::PopHealthData.
`Split_by`	A column within df that will be used to split the patients and will also appear in the file name. Ideally should be a health organisation code such as GP Practice Code or Hospital Trust Code. Should only have alpha-numeric values
`path`	A file path to which the CSV files will be written
`prefix`	File name prefix, default is "DSCRO" See more here: NHS DSCRO
`com_code`	Commissioner Code, default is "11M"; Gloucestershire.
`date_format`	A date format passed internally to 'format(Sys.Date())'; will form part of file name to denote date of generation. You can read more about date formatting in R from R lang
`suffix`	A file name suffix, default is "_REID_V01", To be left as blank use "", without spaces.

Value

n number of CSV files written to the location specified by path argument.

[Package SangerTools version 1.0.2 Index]