Motivation

Health policy in the United States is complicated, and several forms of healthcare coverage exist, including both coverage by federal goverment-led healthcare policy, and by private insurance companies. Before making any inference about the relationship between health condition and health policy, it is important for us to have a general idea about healthcare economics in the United States. Thus, we are interested in getting sense of healthcare coverage and healthcare spending across States. More specifically, the questions are:

  1. Is there a relationship between healthcare coverage and healthcare spending in the United States?
  2. How does the spending distribution change across geographic regions in the United States?
  3. Does the relationship between healthcare coverage and healthcare spending in the United States change from 2013 to 2014?

In this case study, we’ll walk you through collecting data, importing data, cleaning data, wrangling data, and visualizing the data, using well-established and commonly used packages, including datasets, tidyr, dplyr, ggplot2, and ggrepel.

What is the data?