Functions for data processing

Functions to parse and load data

get_cdc_url()

Get CDC data URL for a given section and year

get_html_page()

Get an HTML page from a URL

decode_all()

Decode all coded columns in a mortality dataset

decode_column()

Decode a coded column using a pipe-delimited key=label string

decode_preview()

Preview decoded columns from a mortality dataset

download_cdc()

download cdc data object

load_data()

Load CDC Fixed-Width Data

load_data_birth_cohort()

Load CDC Linked Birth/Infant Death Cohort Data

read_section()

Read a CDC vital statistics fixed-width file using a metadata table

scrape_all_sections()

Scrape All CDC Vital Statistics Sections

scrape_cdc_section()

Scrape file links from a CDC Vital Statistics section

scrape_mortality_user_guide()

Scrape CDC Mortality Multiple Cause User Guides

scrape_mortality_user_guide_specific()

Scrape CDC Mortality User Guide for a Specific Year

build_cdc_lookup()

Build CDC Vital Statistics Link Lookup Table

build_link_lookup()

Build CDC Vital Statistics Link Lookup Table

build_test_data()

sample function to get a potential test dataset

cdc_import()

Import and decode a CDC vital statistics dataset

cdc_link_lookup

CDC Vital Statistics Link Lookup Table

parse_file_size_mb()

Convert file size strings to megabytes

clean_all_sections()

Clean and Reshape All CDC Vital Statistics Sections

usdeaths usdeaths-package

usdeaths: Scrape CDC Vital Statistics File Links

Mortality Multiple Deaths data_mortality_multiple_[YEAR]

Multiple cause-of-death mortality data files are record-level datasets from U.S. death certificates that include both the underlying cause of death and all other reported conditions contributing to death (“multiple causes”). For each decedent, they provide demographic details, place and time of death, and a full list of coded medical conditions, allowing researchers to study mortality patterns, comorbidities, and trends in causes of death over time.

data_mortality_multiple_1968 data_mortality_multiple_1969 data_mortality_multiple_1970 data_mortality_multiple_1971 data_mortality_multiple_1972 data_mortality_multiple_1973 data_mortality_multiple_1974 data_mortality_multiple_1975 data_mortality_multiple_1976 data_mortality_multiple_1977 data_mortality_multiple_1978 data_mortality_multiple_1979 data_mortality_multiple_1980 data_mortality_multiple_1981 data_mortality_multiple_1982 data_mortality_multiple_1983 data_mortality_multiple_1984 data_mortality_multiple_1985 data_mortality_multiple_1986 data_mortality_multiple_1987 data_mortality_multiple_2010 data_mortality_multiple_2011 data_mortality_multiple_2012 data_mortality_multiple_2013 data_mortality_multiple_2014 data_mortality_multiple_2015 data_mortality_multiple_2016 data_mortality_multiple_2017 data_mortality_multiple_2018 data_mortality_multiple_2019 data_mortality_multiple_2020 data_mortality_multiple_2021 data_mortality_multiple_2022 data_mortality_multiple_2024 data_mortality_multiple_2003 data_mortality_multiple_2004 data_mortality_multiple_2005 data_mortality_multiple_2006 data_mortality_multiple_2007 data_mortality_multiple_2008 data_mortality_multiple_2009 data_mortality_multiple_2023

Mortality multiple cause data layouts

Birth Cohorts Linked data_birth_cohorts_[YEAR]

Birth Cohort Linked Birth-Infant Death data link each infant death to the infant’s own birth certificate within a given birth year (cohort), providing detailed information on both maternal/birth characteristics and infant mortality outcomes for that birth cohort.

data_birth_cohort_1983 data_birth_cohort_2008 data_birth_cohort_2009 data_birth_cohort_2010 data_birth_cohort_2011 data_birth_cohort_2012

Birth Cohort Layouts

load_data_birth_cohort()

Load CDC Linked Birth/Infant Death Cohort Data

Births data_births_[YEAR]

The Birth Data files are record-level datasets from U.S. birth certificates that provide detailed information on births each year, including infant characteristics, maternal demographics and health behaviors, and pregnancy and delivery details, for use in independent statistical and epidemiologic analyses.

data_births_1968 data_births_1969 data_births_1970 data_births_1971 data_births_1972 data_births_1973 data_births_1974 data_births_1975 data_births_1976 data_births_1977 data_births_1978 data_births_2018 data_births_2019 data_births_2020 data_births_2021 data_births_2022 data_births_2023 data_births_2024

Births Layouts

Fetal Deaths data_fetal_[YEAR]

Fetal Death data files are record-level datasets from U.S. fetal death reports that provide detailed information on late-pregnancy losses, including fetal characteristics (such as gestational age and birthweight), maternal demographics and health behaviors, pregnancy risk factors and complications, and, for recent years, cause of fetal death, to support statistical and epidemiologic analysis of fetal mortality patterns.

data_fetal_death_1982

Fetal Death Layouts