Click here to read the documentation:
This project intends to describe cancer epidemiology in Arizona using data from the following 3 sources:
- United States Cancer Statistics.
- Arizona Cancer Registry Database Query Module.
- Surveillance, Epidemiology, and End Results (SEER) Program ( SEER*Stat Database.
To produce tables and visualizations to compare and contrast differences in cancer incidence and mortality.
- United States
- Arizona
- University of Arizona Cancer Center (UAZCC) Catchment
- Arizona Counties
- UAZCC Catchment Counties
- Hispanic, non-Hispanic, and American Indian populations
The USCS data tidy script will download data from the website and prepare it for further analysis. Four RDS files will be generated to describe incidence and mortality by:
- cancer
- age
- state
- states and counties
These data files can be processed in the USCS data processing script.
The Arizona Department of Health Services (ADHS) Indicator Based Information System for Public Health (IBIS-PH) system allows the public to query cancer rates, mortality rates and population estimates for Arizona. These public health data sets are intended to support evidenced-based decision making in Arizona to plan and improve service delivery, evaluate health care systems, and inform policy decisions. Other uses are not permissible. Available at
National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database: NPCR and SEER Incidence – U.S. Cancer Statistics Public Use Research Database, 2019 submission (2001–2017), United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Released June 2020. Available at
Surveillance, Epidemiology, and End Results (SEER) Program ( SEER*Stat Database: Mortality - All COD, Aggregated With County, Total U.S. (1990-2018) <Katrina/Rita Population Adjustment> - Linked To County Attributes - Total U.S., 1969-2018 Counties, National Cancer Institute, DCCPS, Surveillance Research Program, released May 2020. Underlying mortality data provided by NCHS (
On RPubs:
Tidy data files should follow the following naming pattern, separated by underscore:
"event type"_"geography"_"data source"_"year-range"_"other details"