Main Task
This week’s assignment is an extension of last week’s. You will once again use the Neumayer (2005) data; see last week’s handout for details. You will begin by fitting the same model you ended up selecting in last week’s assignment, this time on the full dataset instead of just half of it. From there:
- Before running any tests, think about your model. Do you think the errors are heteroskedastic? If so, what variables do you think are associated with the variance of the error term? 
- Conduct the following tests for heteroskedasticity:
- A Breusch-Pagan test using the variables in your model.
- A White test using the variables in your model.
- A Breush-Pagan test using all the independent variables in the dataset. (Ignore any variable not listed on last week’s handout.)
 Interpret the results of the tests. If you reject the null hypothesis of homoskedasticity, what seems to be the source of the heteroskedasticity? Does it line up with your expectations? 
- Estimate Huber-White standard errors for your model. Re-run the composite test(s) you performed last week with the new estimated variance matrix. Do your main conclusions change? 
- Based on your findings from step 2, pick a reasonable set of weights \(\omega\) and obtain the corresponding weighted least squares estimates. How do the WLS coefficient estimates compare to the OLS estimates? Re-run your composite test(s) using the WLS estimates. Once again, do your main conclusions change? 
- Think about what you just did in step 4. You ran one model (the original OLS), ran some tests on it (the Breusch-Pagan and White tests), used the results of those tests to set up a new model (the WLS), and then finally reported some test results from it (of the relevant composite hypothesis). Is this \(p\)-hacking? Can we trust the \(p\)-value you reported alongside your final composite hypothesis test? Why or why not? - For the purposes of this question, ignore the fact that the variable specification itself was “hacked” in last week’s assignment—imagine the specification was handed down from on high. I only want you to focus on the dimensions related to the testing and modeling of heteroskedasticity. 
- Which of your sets of estimates do you trust most: OLS with ordinary standard errors, OLS with Huber-White standard errors, or WLS? Why?