smqP {VGAMdata} | R Documentation |
Smoking Data in the NHANES 2017–2020 Cycle
Description
Selected variables mainly focussed on the smoking questionnaire of the National Health and Nutrition Examination Survey collected during 1.5 cycles just prior to Covid-19.
Usage
data(smqP)
Format
A data frame with the following variables.
- SEQN
-
Identifier for individuals that can be used to merge in other data sets from the same cycle.
- TC30
-
Numeric. Tobacco consumption: average number of cigarettes/day during past 30 days. Aka
SMD650
with some preprocessing. - SCA
-
Smoking cessation age (SCA). Is missing for those who have not quit. Computing this involved variables
SMQ050Q
,SMQ050U
andage
. The variableSMQ050Q
is forHow long since quit smoking cigarettes?
andSMQ050U
are the units (e.g., years, months, days). The variableSMQ050Q
is right-censored at 50 years (66666 means 50 or more years) and for such people SCA was set toNA
. - TCquit
-
Tobacco consumption: number of cigarettes smoked per day when quit. Aka
SMD057
with some preprocessing. - SIA
-
Smoking initiation age (SIA): age when individuals started smoking cigarettes regularly. Aka
SMD030
with some preprocessing. - age, gender
-
Age (
RIDAGEYR
) when surveyed; the value 80 is right-censored. Gender isRIAGENDR
. - race, race2
-
Race (
RIDRETH1
) and a binary simplification of race ("Non-Hispanic White"
versus"Others"
). - educ, marital
-
Education (
DMDEDUC2
) and marital status (DMDMARTZ
). Both variables were not collected for those aged 12–19, hence those areNA
s. - poverty_ratio
-
Ratio of family income to poverty (
INDFMPIR
). - meals_fastfood_pizza, diet, readytoeat_foods
-
For example, for the first one, this is the number of meals from a fast food or pizza place (
DBD900
) during the last 7 days, where the value 5555 means more than 21 meals per week, the value 7777 means the person refused to answer, and the value 9999 means the person didn't know. The other variables may be described later in more detail. - frozen_meals
-
To be described later.
- people_fam_smoking, people_home_smoking
-
Numeric. For
people_fam_smoking
, this is Number of people who live here smoke tobacco? (SMD460
). For this, 0 is: No one in household is a smoker; 1 is: 1 household member is a smoker; 2 is: 2 or more household members are smokers.For
people_home_smoking
, this is the number of people who smoke inside this home? (SMD470
). For this, 0 is: No one smokes inside the house; 1 is: 1 household member smokes inside the house; 2 is: 2 or more household members smoke inside the house. - use_cigarettes, use_pipes, use_cigars
-
Use during the last 5 days of smoking variants. The codes are
SMQ690A
,SMQ690B
,SMQ690C
. - use_hookah, use_ecigarettes
-
The codes are
SMQ690G
,SMQ690H
. - use_snuff, use_otherSmokeless
-
The codes are
SMQ690E
,SMQ690K
. - passiveSmoke_job, passiveSmoke_rest
-
Binary 0 (no) or 1 (yes) measuring exposure to passive smoke at certain places in the past 7 days, e.g., While you were working at a job or business outside of the home, did someone else smoke cigarettes or other tobacco products indoors? The codes are
SMQ858
,SMQ862
. - passiveSmoke_bar, passiveSmoke_car
-
See above. The codes are
SMQ868
,SMQ872
. - passiveSmoke_home, passiveSmoke_other
-
See above. The codes are
SMQ876
,SMQ880
. - passiveSmoke_ecigarettes
-
See above. The code is
SMQ940
.
Details
The National Health and Nutrition Examination Survey
(NHANES)
is a well-known longitudinal study located in USA
(a country just north of Mexico).
This data frame shares a selection of variables mainly
to do with the smoking questionnaire
(codeword: SMQ
);
some demographic and anthropometric variables
might also be included and/or added later.
The significance of P
is that the
2019–2020 cycle was not completed due to Covid-19,
hence this data concerns 2017–2020 and is a merging of
the 2017–2018 cycle (data codenamed J
) with
further data collected just prior to the pandemic.
The original data has been preprocessed and/or simplified.
For example, "Don't know"
and "Refused"
usually have been converted to a NA
.
This data frame is subject to change, especially with the addition of new variables.
Source
The URL https://wwwn.cdc.gov/Nchs/Nhanes/ provides an entry point from which many data sets may be downloaded. Comprehensive documentation is available there too.
The data was downloaded in late 2022 by Luca Frigau, University of Cagliari, and some subsequent edits were made by Thomas Yee.
Examples
summary(smqP)