Introduction
Everyone deserves access to quality health care. However, this is not a reality in the United States. There are systemic health inequities, and the marginalized populations suffer the consequences. To examine this issue, we focus our BLOG on regional health insurance coverage status, the role of intersectionality in health disparities, and people’s experiences with public insurance.
We begin this BLOG with an overview of health care disparities in the United States by looking at how health insurance coverage status differs across states over time. We then analyze how socially disadvantaged populations experience greater obstacles to health based on their racial/ethnic group; socioeconomic status; gender and sexual orientation. Finally, in response to our shared passion for improving minority groups’ access to quality health care, we present an analysis of quality of public health insurance by examining people’s opinions about their health insurance plans captured in Medicare ratings data set.
Affordable Care Act
The Affordable Care Act, which was passed under Obama administration in 2010 and began to take effect in 2014, attempted to expand healthcare coverage to low-income Americans of up to 138% of the poverty line. Yet, obtaining health insurance does not necessarily ensure better access to health care. It is reported that Medicare patients still face barriers to care, such as delayed treatment of chronic conditions, resulting in poorer health outcomes.
Questions of Interest
- What are the geographical differences of health care coverage in the United States and how has this changed over the years?
- Have the rates of health care coverage changed at a consistent pattern across races? Do some races have a higher rates of coverage?
- What are the factors that drive health care inequity? Does intersectionality play a role in exacerbating inequality even in health care settings?
- How can we address health inequality? Would improving Medicare help enhance the overall quality of health care?
Data
Dataset 1: United States Census Bureau
Description: These data sets are from the United States Census Bureau. There is one dataset for each of the years from 2012-2019. Each data set contains information on the amount of people insured and uninsured in each state, with an addition of break downs by different demographic groups. They contain information on age, race, gender, income group, state, and county. The data set was in .csv format.
These datasets were used for both the map and the time series. Each of the demographic variables had an option that encapsulated all people in the state (i.e “all races” or “both genders”). For this project, all people in each state were evaluated. A race breakdown was used for one visualization. Additionally, the numbers chosen to be reported from this data set are the percent of people who are uninsured in each state. The percent was chosen because it allows for a better state-by-state comparison rather than just total numbers of people, since not all states have the same population.
Dataset 2: Centers for Disease Control and Prevention
- Description: The National Center for Health Statistics, which is part of the CDC, conducts the National Health Interview Survey annually to monitor the health of the United States population. We use the 2019 survey data that sampled adults aged 18 years old or older. It contains a wide variety of variables, including respondents’ demographic information, socioeconomic status, and their barriers to accessing and using appropriate health care, which we choose to focus on. Each row in the data set represents a sample adult. Given the special nature of survey data that the responses are categorized into number codes, we wrangled them to be categorical variables with more descriptive values. Our final data set thus contains a mix of quantitative and categorical variables, and we deploy appropriate clustering algorithm for such data.
Dataset 3: Centers for Medicare & Medicaid Services
Description:The Centers for Medicare & Medicaid Services (CMS) uses a five-star quality rating system to provide information about health plan quality. It measures Medicare beneficiaries’ experience with their health plans and the health care system. CMS assigns Star Ratings based on grouping measure scores, and we used the data set that contains the individual measures of quality under five categories such as member experience or how well the plan provides screenings and immunization. The data set was in .csv format, and we compared data over time-from 2016 to 2022.
The dataset contains various metrics of health care quality for each Medicare plan, so it allows users to compare one plan to another based on those metrics. Each measure reflects beneficiaries’ perspectives of the care they received, and we focus on measures related to preventive services, treatments, and consumer satisfaction.
Visualizations
Interactive Map
Map
Discussion
This app depicts how the percent of uninsured people has changed in the continental United States over the years 2012-2019. The redder states indicate a lower percentage of people that are uninsured and a bluer color indicates a higher percentage of people that are uninsured. After watching this animation, it is clear that between the years of 2014 and 2015, there is a pretty significant decrease in the percentage of people who are uninsured. This directly aligns with the introduction of the Affordable Care Act and Obamacare in 2014. This tells that the Affordable Care Act did do it’s job of allowing more people to obtain healthcare, but as stated before this does not mean that everyone has the same access to healthcare.
Overall, there is a drastic difference in the percent of people that are uninsured between 2012 and 2019. The northeast, eastern midwest, and the west coast tend to have lower rates of uninsured people, whereas the south and the western middle of the country have higher rates. Could this potentially be due to political aspects? Most liberal states are the north east and west coast (where more people have insurance), where more conservative states are typically the south and middle of the country states(where less people have insurance). As of Nov 2021, only 38 states (and DC) have adopted the Medicaid expansion and there are still 12 states who have yet to adopt the expansion. The 12 states are: Texas, Wyoming, South Dakota, Kansas, Wisconsin, Tennessee, Mississippi, Alabama, Georgia, North Carolina, South Carolina, and Florida. This aligns with what the map shows, where most of those states are the ones that are bluer. Additionally, some of the states that very recently adopted the Medicaid expansion like Missouri, Idaho, and Oklahoma all still have higher rates of uninsured people, but hopefully those numbers will start to go down within the next few years. (Visit Status of State Medicaid Expansion Decisions for more information about Medicaid expansion.)
Timeseries
Timeseries
Discussion
The time series allows us to see how the trends per state have fluctuated over the years by racial group. This is a similar data breakdown as the map, however allows you to see a state-by-state comparison and the differences between the 3 races included in the data (white, black, and hispanic). This visualization allows you to see, again, the decrease that happens when the Affordable Care act begins to take force; almost all states have a decrease from about 2013-2016 no matter what percentage they originally started at. Some states have a pretty consistant low rate, like Massachusetts and DC. Compared to the other states they look seemingly unchanged, but when zoomed in (only that state selected), you can see that they too have a drop from 2013-2016.
When different races are selected, the overall layout of the graphs don’t really change too much: most of the slopes stay the same and the placement of each state as well. The one major difference is the scale of the graph; when switch from race to race, the scale on the y-axis changes. With white, the highest number shown on the scale is about 15%. When switched to black, it goes up to 20%. Then with hispanic it goes up to 30%. This shows that although the change of rates of uninsured people has stayed pretty consistent over all 3 races, the proportion is not the same. Hispanic people are uninsured at a higher rate than both black and white individuals. White people have the lowest proportion of individuals who are uninsured and black people fall in the middle.
This tells us that the Affordable Care Act effected these 3 racial groups the same way; none of them had a significantly different proportion of people who were able to obtain health insurance in comparison to the other groups. This is important information because it shows that the Affordable Care Act was equal, but not equitable. Hispanic and Black groups needed access to health insurance at a higher rate than White people did – hispanics needed it at the highest proportion.
Clustering Analysis
Background
What is Intersectionality?
Intersectionality is a research approach that explores the interaction of different social categories such as race, ethnicity, gender, class, religion and sexuality. It assumes that social categories intersect in dynamic ways to privilege or disadvantage different people depending on their characteristics and contexts.
We believe the intersectionality is an important concept for understanding and responding to health inequalities. Because interactions between social identities can be multiplicative in effect, either positively or negatively, people who possess multiple minority identities are more likely to experience barriers to health care. In other words, health care inequity may disproportionately affect patients who are most in need of medical cae based on their social locations. Thus, understanding intersectionality would lead to precise insights about what drives health inequalities and who is affected by them.
Clustering
We employ clustering analysis approach to examine the effect of intersectionality on health care inequities. We use the national health interview survey data set, which contains health insurance coverage status and various socioeconomic information of its respondents; we focus on race, sexual orientation, citizenship status, education level, and income. We hope to identify clusters of survey participants with various socioeconomic identities. By doing so, we aim to explore how coverage status differs according to different combinations of social locations.
| Coverage Status | Number of Respondents |
|---|---|
| Covered | 27479 |
| Not covered | 2672 |
set.seed(112)
demographic_coverage <- demographic2 %>%
group_by(coverage) %>%
sample_n(2500)
Due to the fact that the data set contains a disproportionate number of survey respondents for each health insurance coverage status (i.e. whether insured or uninsured), our initial clustering analysis results were dominated by those insured. Instead, we drew 2500 stratified random samples from each coverage status group.
Given that the basic premise of clustering is the similarity/dissimilarity between the data points, we compute the Gower distance by using the daisy function from the cluster package. We choose the Gower distance because it allows us to calculate the distance matrix for data that contains both categorical and quantitative variables, as opposed to the popular Euclidean or Manhattan distance metrics. The Gower distance is scaled in a numerical range of 0 (identical) and 1 (maximally dissimilar). Note that we transformed the income variable into square roots because Gower distance requires variables to be symmetric.(Visit daisy: Dissimilarity Matrix Calculation for more details.) While daisy function logarithmically transforms asymmetric variables by default, our exploratory data analysis revealed that square root transformation achieves symmetry better than the log transformation for the income variable.
silhouette <- c()
silhouette = c(silhouette, NA)
for(i in 2:10){
pam_clusters = pam(as.matrix(gower_df),
diss = TRUE,
k = i)
silhouette = c(silhouette, pam_clusters$silinfo$avg.width)
}
plot(1:10, silhouette,
xlab = "Clusters",
ylab = "Silhouette Width")
lines(1:10, silhouette)
Silhouette Width vs Number of Clusters
Just like K-means and hierarchical clustering uses Euclidean distance matrix, Partitioning Around Medoids (PAM) algorithm goes hand-in-hand with the Gower distance. PAM iterates over and over until the medoids don’t change their positions, instead of centroids in K-means clustering. The medoid of a cluster is a member of the cluster which is representative of the median of all the variables under consideration.
When running the PAM algorithm, we look at the silhouette width to select the optimal number of clusters. The silhouette width measures the similarity of each point to its cluster, and compares that to the similarity of the point with the closest neighboring cluster; a higher value is desirable. We calculate this metric for a range of cluster numbers and find where it is maximized. When we plot the silhouette width against the number of clusters, we can see the value is maximized at 5 clusters.
pam_nhis = pam(gower_df, diss = TRUE, k = 5)
demographic_coverage[pam_nhis$medoids, ]
| Cluster | Sex | Sexual Orientation | Race | Education | Coverage Status | Citizenship Status | Income |
|---|---|---|---|---|---|---|---|
| 1 | Male | Straight | White | Bachelor’s degree | Covered | US Citizen | $79,524 |
| 2 | Female | Straight | White | Bachelor’s degree | Covered | US Citizen | $64,516 |
| 3 | Male | Straight | White | High school graduate | Not covered | US Citizen | $40,000 |
| 4 | Female | Straight | White | High school graduate | Not covered | US Citizen | $32,761 |
| 5 | Male | Straight | Hispanic | Less than a high school diploma | Not covered | Non-US Citizen | $27,556 |
We construct a PAM model with 5 clusters and examine medioids to interpret the clusters. In the medioids table above, each row represents a cluster; two clusters of respondents are insured, and the other three are not. We can also infer from the table that respondents belonging to Cluster 5 are uninsured and tend to be male; straight; Hispanic; non-US citizens; have less than a high school degree; earn the lowest income among all clusters. In other words, the uninsured tend to possess multiple social identities often identified as minority. However, it seems difficult to gain meaningful insights from the other 4 clusters because all of them tend to show very similar characteristics. This is due to the fact that there are a disproportionate number of survey participants who are White or straight. In order to address this issue, we apply clustering algorithm to the stratified random samples from each group of either race or sexual orientation.
| Cluster | Sex | Sexual Orientation | Race | Education | Coverage Status | Citizenship Status | Income |
|---|---|---|---|---|---|---|---|
| 1 | Female | Straight | White | Bachelor’s degree | Covered | US Citizen | $99,856 |
| 2 | Male | Straight | Asian | Bachelor’s degree | Covered | US Citizen | $79,524 |
| 3 | Male | Straight | Black/African American | Some college, no degree | Covered | US Citizen | $47,524 |
| 4 | Female | Straight | Hispanic | High school graduate | Covered | US Citizen | $40,000 |
| 5 | Female | Straight | American Indian and Alaska Native | Some college, no degree | Covered | US Citizen | $19,881 |
| 6 | Female | Straight | Asian | Bachelor’s degree | Not covered | Non-US Citizen | $69,696 |
| 7 | Male | Straight | White | High school graduate | Not covered | US Citizen | $37,636 |
| 8 | Male | Straight | Hispanic | Less than a high school diploma | Not covered | Non-US Citizen | $27,889 |
| 9 | Female | Straight | Black/African American | Associate degree | Not covered | US Citizen | $26,896 |
| 10 | Male | Straight | American Indian and Alaska Native | Less than a high school diploma | Not covered | US Citizen | $24,964 |
We created 75 stratified random samples from each race for each coverage status. The medioids table for 10 clusters grouped by race shows that uninsured respondents tend to possess less than a college degree; their income tends to be less than that of their racial counterparts who have health insurance. Moreover, those with multiple social minority statuses (e.g. Female Asian who is a non-US citizen) are more likely to be uninsured than those who hold single minority identity (e.g. Male Asian who is a US-citizen).
tsne_object <- Rtsne(gower_df, is_distance = TRUE)
tsne_df <- tsne_object$Y %>%
data.frame() %>%
setNames(c("X", "Y")) %>%
mutate(cluster = factor(pam_nhis$clustering))
To visualize our clustering analysis results, we use the t-Distributed Stochastic Neighbor Embedding (t-SNE) technique, a dimension reduction technique that enables us to visualize a multi-dimensional data set in a two-dimensional plot.
We visualized our multi-dimensional data into a two-dimensional plot by using t-SNE technique. We focus only on the uninsured individuals in this plot. Users can identify demographic and socioeconomic information of each respondent by hovering over each point. The clusters seem to overlap a bit, but they are fairly distinctive overall. (Visit T-SNE Clearly Explained for more information about how to interpret the visualization generated by t-SNE.) Looking at the plot as well as the medioids table, we find that our clustering results are dominated by those who identify as straight. We then move on to run clustering on stratified random samples drawn from each sexual orientation group.
| Cluster | Sex | Sexual Orientation | Race | Education | Coverage Status | Citizenship Status | Income |
|---|---|---|---|---|---|---|---|
| 1 | Male | Straight | White | Associate degree | Covered | US Citizen | $90,000 |
| 2 | Male | Gay/Lesbian | White | Bachelor’s degree | Covered | US Citizen | $59,536 |
| 3 | Female | Straight | White | Bachelor’s degree | Covered | US Citizen | $44,944 |
| 4 | Female | Bisexual | White | High school graduate | Covered | US Citizen | $29,929 |
| 5 | Female | Bisexual | White | High school graduate | Not covered | US Citizen | $40,000 |
| 6 | Male | Straight | White | Some college, no degree | Not covered | US Citizen | $40,000 |
| 7 | Female | Gay/Lesbian | Black/African American | Bachelor’s degree | Not covered | US Citizen | $31,684 |
| 8 | Female | Straight | Hispanic | Less than a high school diploma | Not covered | Non-US Citizen | $19,881 |
We draw 30 stratified random samples from combinations of coverage status and sexual orientation due to the limited number of respondents for each combination. The medioids table shows that most individuals who are not covered by health insurance identify as sexual minorities; do not have college degrees; earn less than those insured. We observe the interaction of different social categories being in effect when it comes to barriers to health care.
Discussion
Intersectionality approach deepens our understanding of health inequities through the complexity of the real world. It moves us beyond interpreting barriers to health care as unique circumstances of individuals towards considering the fundamental drivers of systemic inequality and examining power dynamics behind it. Through our clustering analysis, we discovered that health inequality has disproportionately adverse effects on marginalized groups with multiple minority identities–for example, White gay without a college degree–as opposed to those with each identity in isolation–White straight men with Bachelor’s degree. We learned that the convergence of race, sexual orientation, education level, and economic status advantages some and disadvantages others, placing some people at greater risk for poor health outcomes. It is thus important to be cognizant of the role of intersectionality in patients’ experience of discrimination in the health care system; this understanding would lead to precise insights about what causes health disparities and who is affected by them. Considering social circumstances to which one belongs and power structures underpinning them will help address root causes of barriers to health care, allowing for more targeted and effective policies that can redress structural imbalance.
Limitations
One limitation of this clustering analysis is small sample size. Since there were disproportionate number of survey respondents with certain identities (e.g. White) in our data, we had to sample from each combination of race/sexual orientation and coverage status. However, there were very few survey participants for some combinations (e.g. there were only 32 Gay/Lesbians who were uninsured). Despite the fairly large samples–31,997 respondents–in the original data, our sample sizes shrunk to less than hundred. The small sample size may prevent us from drawing meaningful conclusions. We used stratified random sampling to account for the disproportion among observations, but we would like to explore other ways to address that issue for future study.
Discussion
Intersectionality approach deepens our understanding of health inequities through the complexity of the real world. It moves us beyond interpreting barriers to health care as unique circumstances of individuals towards considering the fundamental drivers of systemic inequality and examining power dynamics behind it. Through our clustering analysis, we discovered that health inequality has disproportionately adverse effects on marginalized groups with multiple minority identities–for example, White gay without a college degree–as opposed to those with each identity in isolation–White straight men with Bachelor’s degree. We learned that the convergence of race, sexual orientation, education level, and economic status advantages some and disadvantages others, placing some people at greater risk for poor health outcomes. It is thus important to be cognizant of the role of intersectionality in patients’ experience of discrimination in the health care system; this understanding would lead to precise insights about what causes health disparities and who is affected by them. Considering social circumstances to which one belongs and power structures underpinning them will help address root causes of barriers to health care, allowing for more targeted and effective policies that can redress structural imbalance.
Wordcloud
Background
What is Medicare?
Medicare helps provide healthcare coverage to U.S. citizens who are 65 years of age or older, as well as people with certain disabilities. Medicare consists of four parts, and we focus on Part C: Medical Advantage. Medical Advantage is an all-in-one alternative to Original Medicare, which includes drug coverage and extra benefits not covered by Original Medicare. These plans are provided by contracting private insurance companies instead of the federal government.
Given that Medicare patients still face barriers to care, we use Medicare quality ratings data to examine how to strengthen public insurance. We believe that examining reviews for Medicare would help ensure that its beneficiaries are able to access the medical care they need.
Wordcloud
We examine the word cloud to get a sense about people’s experiences with Medicare. The quality ratings data set measures percentage of plan members who took advantage of each service offered by the plan, such as diabetes care, or had certain experiences with the plan, such as whether they had access to care or appointments in a timely fashion. Since we aim to find room for improvement regarding the quality of care, we performed some data wrangling to reverse the percentages and focus on people who did NOT utilize the services or felt unsatisfied with their health insurance. Through the word cloud, we hope to gain knowledge about people’s opinions about Medicare and explore how the public health insurance can be strengthened.
The above word cloud shows what patients have said about their health care experience from 2016 to 2020; text size corresponds to percentage of plan members who reported to have certain experiences, and words are colored by different types of metrics. By default, it displays all three types (i.e. prevention, treatment, customer satisfaction). Words classified as Treatment seem to stand out in their sizes across the board, which implies that a lot of plan members did not avail the services related to treatment of chronic/acute conditions, such as diabetes care. Also, people’s experiences with preventive services such as screenings or immunizations are the second largest words displayed in the visualization. However, we should not discount this as sign of Medicare’s poor performance in preventive care because according to Centers for Medicare & Medicaid Services, the Affordable Care Act allowed the Medicare beneficiaries to have access to free preventive services. That being said, the plan members simply may not have taken advantage of those services because they were not in need of them, not because of certain barriers or plan’s poor performance.
In order to extend our knowledge about actual performance of Medicare plans, we pay close attention to members’ general opinions about the plan’s care and services, which had the least presence in the across-the-board visualization. We do so by selecting Customer Satisfaction as the sole input in the Shiny app. The most conspicuous words in the word cloud filtered by the customer satisfaction are “Less Timely Care/Appointments” and “Not Getting Needed Care”. This suggests that a lot of people were not able to get care or appointments quickly, nor could they get needed care. In other words, although they are enrolled in Medicare, they still experience some barriers to health care access. We associate those obstacles with long wait times that disincentivize seeking care; cost-sharing charges in health insurance plans; administrative burdens that discourage access to care.
Another notable finding is that the words “Complaints” and “Members Choosing to Leave the Plan” increase in their sizes from 2019 to 2021. This means that there has been an increasing number of people who filed complaints against their Medicare plans and chose to leave the plan over that time period. This may be attributable to the COVID-19 outbreak as the pandemic may have resulted in decline in plan’s performance; the job losses caused by the pandemic may have also disrupted job-based health coverage for millions of people.
Discussion
The examination of word cloud revealed that a lot of people specifically did not (or could not) receive treatment for their health conditions, although they were covered by Medicare. This may be plausibly attributable to the fact that people tend to go without needed treatments because of financial barriers. Co-payments, deductibles, or co-insurance may have deterred even people with insurance from getting those services. This finding suggests that there is a pressing need to drive down costs for Medicare beneficiaries to make it more affordable for them to fully take advantage of critical health care services and stay healthy.
Another improvement to be made would be the quality of service delivery–specifically, access to care. Taking a closer look at the words classified as Customer Satisfaction, we found that people lacked timely access to care. We believe timeliness of access to care and other services is an extremely important indicator of the quality of the overall healthcare experience; not only are the prolonged wait times a source of potential frustration and dissatisfaction with care, but also they can directly translate into health outcomes, even possibly increasing mortality and morbidity. Thus, Medicare would have to work on streamlining its delivery systems to reduce wait times and ensure its beneficiaries receive the needed care at one of the most important times for them–as soon as possible. Additionally, we could also notice how the word cloud reflected the impact of the COVID-19 pandemic on health care system.
Discussion
The examination of word cloud revealed that a lot of people specifically did not (or could not) receive treatment for their health conditions, although they were covered by Medicare. This may be plausibly attributable to the fact that people tend to go without needed treatments because of financial barriers. Co-payments, deductibles, or co-insurance may have deterred even people with insurance from getting those services. This finding suggests that there is a pressing need to drive down costs for Medicare beneficiaries to make it more affordable for them to fully take advantage of critical health care services and stay healthy.
Another improvement to be made would be the quality of service delivery–specifically, access to care. Taking a closer look at the words classified as Customer Satisfaction, we found that people lacked timely access to care. We believe timeliness of access to care and other services is an extremely important indicator of the quality of the overall healthcare experience; not only are the prolonged wait times a source of potential frustration and dissatisfaction with care, but also they can directly translate into health outcomes, even possibly increasing mortality and morbidity. Thus, Medicare would have to work on streamlining its delivery systems to reduce wait times and ensure its beneficiaries receive the needed care at one of the most important times for them–as soon as possible. Additionally, we could also notice how the word cloud reflected the impact of the COVID-19 pandemic on health care system.
Conclusion
The map revealed that the Affordable Care Act did indeed decrease the amount of individuals that were uninsured. It especially was seen from 2012-2015. Overall, the northeast and the west coast tended to have a lower proportion of uninsured individuals compared to the south and middle of the country. This seems correlated to which states adopted the Medicaid expansion and which states did not.
Through the time series it was discovered that most states in the United States tended to follow similar patterns of fluctuations of the percentage of individuals that were uninsured. The main takeaway from this was that the Affordable Care Act was not equitable over the 3 races that were studied since all 3 races seemed to be affected by the Affordable Care Act equally.
Through clustering analysis, we discovered that interaction of social categories, rather than each in isolation, and the power structures underpinning them result in persistent gaps in access to affordable, high-quality health care coverage. In other words, socially disadvantaged groups with overlapping identities are more likely to experience barriers to care.
The word cloud revealed that Medicare has not been effective in delivering quality health care; even if those insured by Medicare are not fully taking advantage of it presumably due to high bills. Medicare patients also lack timely access to care, which is often a critical aspect of patient experience and may have fatal consequences in their health outcomes. Thus, reducing costs of care and closely monitoring timeliness of access to care would be an important step toward ensuring Medicare beneficiaries to access the medical care they need.
Citations
“Affordable Care Act (ACA) - Healthcare.gov Glossary.” HealthCare.gov, https://www.healthcare.gov/glossary/affordable-care-act/.
“Status of State Medicaid Expansion Decisions: Interactive Map.” KFF, 19 Nov. 2021, https://www.kff.org/medicaid/issue-brief/status-of-state-medicaid-expansion-decisions-interactive-map/.
Shendre, Sushrut. “Clustering Datasets Having Both Numerical and Categorical Variables.” Medium, Towards Data Science, 11 May 2020, https://towardsdatascience.com/clustering-datasets-having-both-numerical-and-categorical-variables-ed91cdca0677.
“Daisy: Dissimilarity Matrix Calculation.” RDocumentation, https://www.rdocumentation.org/packages/cluster/versions/2.1.2/topics/daisy.
Erdem, Kemal. “T-SNE Clearly Explained.” Medium, Towards Data Science, 22 Apr. 2020, https://towardsdatascience.com/t-sne-clearly-explained-d84c537f53a.
Data
- Small Area Health Insurance Estimates (SAHIE). U.S. Census Bureau, April 2021, https://www2.census.gov/programs-surveys/sahie/datasets/time-series/estimates-acs/. Accessed 3 Nov. 2021. Dataset.
- 2019 National Health Interview Survey Sample Adult Interview. Centers for Disease Control and Prevention, September 2020, https://www.cdc.gov/nchs/nhis/2019nhis.htm. Accessed 3 Nov. 2021. Dataset.
- Medicare Part C and D Star Ratings Centers for Medicare & Medicaid Services, December 2021, https://www.cms.gov/Medicare/Prescription-Drug-Coverage/PrescriptionDrugCovGenIn/PerformanceData. Accessed 3 Nov. 2021. Dataset.
Packages
- Rousseeuw et al. “``Finding Groups in Data’’: Cluster Analysis Extended.” The Comprehensive R Archive Network, Comprehensive R Archive Network (CRAN), 17 Apr. 2021, https://cran.r-project.org/web/packages/cluster/.
- Krijthe, Jesse. “Rtsne: T-Distributed Stochastic Neighbor Embedding Using a Barnes-Hut Implementation.” GitHub, 10 Nov. 2018, https://github.com/jkrijthe/Rtsne.