New York City Health & Nutrition Examination Survey Datasets & Resources
The New York City Health and Nutrition Examination Survey (NYC HANES), modeled on the National Health and Nutrition Examination Survey, is a population-based, cross-sectional study with data collected from a physical examination and laboratory tests, as well as a face-to-face interview and an audio computer-assisted self-interview.
Survey Design and Methodology Overview
The first survey, 2004 NYC HANES, was conducted between June and December 2004. Data collection for the second NYC HANES occurred between August 2013 and June 2014. The survey used a probability sample of non-institutionalized adult New York City residents (ages 20 years or older) to provide representative citywide estimates.
The survey was designed using a three-stage cluster household sampling plan. Of the 2,472 eligible households randomly selected for the 2013–2014 survey, 1,827 households completed the eligibility screening questionnaires, and 1,527 selected individuals participated in the survey, yielding an overall response rate of 36 percent. A detailed methodology paper has been published in the peer-reviewed literature.
Datasets and Resources
Below are resources for public health researchers to access and analyze the data, including datasets, codebooks, sample SAS code, and training materials.
The New York City Department of Health and Mental Hygiene has also placed NYC HANES data on EpiQuery, a web-based, user-friendly system designed to provide access to health data at the click of the mouse, without needing to generate statistical analysis codes.
To properly access and analyze the data, you will need the following materials:
- Analytic Guidelines
- Control Acasi SAS File
- Control Blood SAS File
- Control Capi SAS File
- Control Exam SAS File
- Control Saliva SAS File
- Control Urine SAS File
- Data Documentation
- NYC HANES Analytic Datasets SAS File
- Public Dataset Codebook
- Sample SAS Code
- SAS Format Code
- Training Part I: Survey Basics
- Training Part II: Weighting and Age-Adjustment
- Training Part III: Data Analysis Using SAS and SUDAAN
- Variable List
Please note that you must use SAS, SPSS, or a similar statistical software package to access and view SAS files.
R Programming Language Users
Researchers have released a GitHub R package that contains a function to call in NYC HANES public data and merge it with individual microbiome information for a selected sample of 297. This GitHub package also includes programs to reproduce results of a submitted manuscript looking at sociodemographic variation in clinically significant oral microbiome taxa.
Individual mouthwash specimen 16s microbiome reads are also available on the National Center for Biotechnology Information’s “Sequence Read Archive” (BioProject ID PRJNA484874) NYCHANES smoking and oral microbiome study page.