Due:
Submit:
Instructions: This is an individual assignment. You may
use course materials, textbooks, and R documentation, but you may not
collaborate with others or use external code without citation.
In your own words, answer the following:
Consider the simple linear regression model:
\[ Y_i = \beta_0 + \beta_1 X_i + \varepsilon_i \]
where \(\varepsilon_i \sim N(0, \sigma^2)\).
Explain in words what the Gauss-Markov theorem guarantees about the OLS estimator in this context.
For the remainder, you may use data and a research problem of your choice, or rely on the one provided below. If you use your own, please briefly explain it.
Use the states dataset provided in the
poliscidata package, which contains state-level political
and demographic variables.
library(poliscidata)
## Registered S3 method overwritten by 'gdata':
## method from
## reorder.factor gplots
data("states")
# Variables include:
# vep12_turnout: voter turnout rate in 2012
# uninsured_pct: % without health insurance
# college: % with college degree
# prcapinc: per capita income
# cig_tax: tax rate on cigarettes
turnout is the dependent
variable and uninsued_pct, college, and
cig_tax are independent variables.Using the model from Problem 3:
sandwich and lmtest.Political scientists often argue that the effects of economic and educational inequalities on turnout aren’t linear.
uninsured_pct and college in your model from
Problem 3.ggeffects or
margins)Prepare a publication-quality regression table and figure.
modelsummary,
stargazer, gt, or kableExtra)
that displays:
ggplot2) showing the relationship between
uninsured_pct and college.