journal article Apr 27, 2023

Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review

Abstract
Background
Big data from large, government-sponsored surveys and data sets offers researchers opportunities to conduct population-based studies of important health issues in the United States, as well as develop preliminary data to support proposed future work. Yet, navigating these national data sources is challenging. Despite the widespread availability of national data, there is little guidance for researchers on how to access and evaluate the use of these resources.


Objective
Our aim was to identify and summarize a comprehensive list of federally sponsored, health- and health care–related data sources that are accessible in the public domain in order to facilitate their use by researchers.


Methods
We conducted a systematic mapping review of government sources of health-related data on US populations and with active or recent (previous 10 years) data collection. The key measures were government sponsor, overview and purpose of data, population of interest, sampling design, sample size, data collection methodology, type and description of data, and cost to obtain data. Convergent synthesis was used to aggregate findings.


Results
Among 106 unique data sources, 57 met the inclusion criteria. Data sources were classified as survey or assessment data (n=30, 53%), trends data (n=27, 47%), summative processed data (n=27, 47%), primary registry data (n=17, 30%), and evaluative data (n=11, 19%). Most (n=39, 68%) served more than 1 purpose. The population of interest included individuals/patients (n=40, 70%), providers (n=15, 26%), and health care sites and systems (n=14, 25%). The sources collected data on demographic (n=44, 77%) and clinical information (n=35, 61%), health behaviors (n=24, 42%), provider or practice characteristics (n=22, 39%), health care costs (n=17, 30%), and laboratory tests (n=8, 14%). Most (n=43, 75%) offered free data sets.


Conclusions
A broad scope of national health data is accessible to researchers. These data provide insights into important health issues and the nation’s health care system while eliminating the burden of primary data collection. Data standardization and uniformity were uncommon across government entities, highlighting a need to improve data consistency. Secondary analyses of national data are a feasible, cost-efficient means to address national health concerns.
Topics

No keywords indexed for this article. Browse by subject →

References
30
[6]
Booth, A EVIDENT Guidance for Reviewing the Evidence: A Compendium of Methodological Literature and Websites (2015)
[10]
Behavioral risk factor surveillance system (BRFSS)Centre for Disease Control and Prevention20202021-04-12https://www.cdc.gov/brfss/index.html
[11]
National ambulatory medical care survey (NAMCS)Centers for Disease Control and Prevention2021-04-12https://www.cdc.gov/nchs/ahcd/index.htm
[12]
Health information national trends survey (HINTS)National Cancer Institute2021-04-12https://hints.cancer.gov/about-hints/learn-more-about-hints.aspx
[13]
Compendium of U.S. health systemsAgency for Healthcare Research and Quality20182021-04-12https://www.ahrq.gov/chsp/data-resources/compendium-2018.html
[14]
National death indexCenters for Disease Control and Prevention2021-04-12https://www.cdc.gov/nchs/ndi/index.htm
[15]
National health and nutrition examination surveyCenters for Disease Control and Prevention2021-04-12https://www.cdc.gov/nchs/nhanes/index.htm
[16]
National longitudinal surveysUS Bureau of Labor Statistics2021-04-12https://www.bls.gov/nls/home.htm
[17]
The health and retirement studyNational Institute on Aging2021-04-12https://hrsonline.isr.umich.edu/
[18]
Nationwide emergency department sample (NEDS)Agency for Healthcare Research and Quality2021-04-12https://www.hcup-us.ahrq.gov/
[19]
Overview of the national (nationwide) inpatient sample (NIS)Agency for Healthcare Research and Quality2021-04-12https://www.hcup-us.ahrq.gov/nisoverview.jsp
[20]
Surveys on patient safety culture (SOPS)Agency for Healthcare Research and Quality2021-04-12https://www.ahrq.gov/sops
[21]
Medicare and medicaid data filesCenters for Medicare & Medicaid Services2021-04-12https://resdac.org/
[22]
Data that helps you better understand CMS programsCenters for Medicare & Medicaid Services2021-04-01https://data.cms.gov/
[23]
Find, request and use CMS dataResearch Data Assistance Center2021-04-01https://resdac.org/
[25]
Healthcare cost and utilization project (HCUP)Agency for Healthcare Research and Quality2021-04-12https://www.ahrq.gov/data/hcup/index.html
[26]
The FAIR Guiding Principles for scientific data management and stewardship

Mark D. Wilkinson, Michel Dumontier, IJsbrand Jan Aalbersberg et al.

Scientific Data 10.1038/sdata.2016.18
[27]
Agency for Healthcare Research and Quality2021-04-01https://www.hcup-us.ahrq.gov/
[30]
PCORI methodology standardsPatient-Centered Outcomes Research Institute2021-05-20https://www.pcori.org/research/about-our-research/research-methodology/pcori-methodology-standards
Metrics
4
Citations
30
References
Details
Published
Apr 27, 2023
Vol/Issue
25
Pages
e43802
Cite This Article
Ann Annis, Crista Reaves, Jessica Sender, et al. (2023). Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review. Journal of Medical Internet Research, 25, e43802. https://doi.org/10.2196/43802