| stroke_reduced {graphPAF} | R Documentation | 
Simulated case control dataset for 6856 stroke cases and 6856 stroke controls
Description
Dataset containing simulated data on risk factors for 6856 stroke cases and 6856 stroke control, based on risk factors and associations in the INTERSTROKE study
Usage
stroke_reduced
Format
A data frame with 13712 rows and 19 variables:
- region
 Geographic region, 1: Western Europe, 2: Eastern/central Europe/Middle East 3: Africa, 4: South Asia, 5: China, 6: South East Asia, 7: South America
- case
 case control status, (1 for stroke cases)
- sex
 Gender of individual, 0: male, 1:female
- age
 Age of individual
- smoking
 Smoking status, 0: Never, 1: Current
- stress
 1: sometimes stressed, 0: never stressed
- waist_hip_ratio
 Waist hip ratio
- exercise
 Physical Activity. 1: mainly inactive, 0: mainly active
- alcohol
 Alcohol history and frequency, 1:never, 2:low/moderate, 3:high intake
- diabetes
 Diabetes, 0: No, 1: Yes
- diet
 Healthy eating score (higher is better)
- early_stage_heart_disease
 presence of risk factors for heart disease. 0: No, 1: yes
- lipids
 Ratio of Apolipoprotein B to Apolipoprotein A
- education
 Years of education. 1: No education, 2: 1-8 years, 3:9-12 years, 3:Technical college, 4: University
- high_blood_pressure
 Diagnosed hypertension: 0 No, 1: yes
- weights
 weights that are proportional to inverse sampling probabilities. We have scaled the weights to be 0.0035 for a case and 0.9965 for a control to reflect any approximate incidence of 1 serious stroke in every 0.9965/0.0035 person years in the population
- time
 simulated time variable (for illustrating survival models)
- event
 simulated event indicator (0 if censored, 1 if event happened): for illustrating survival models
- strata
 Strata number based on sex and region. For illustrating conditional regression
Source
Data simulated based on relationships described in https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(16)30506-2/fulltext