Description
You may work together to help each other solve problems, but you should create your own solutions and hand in your own work without copying others’ work.
Data: “Sales_sample.csv” (same one as used in HW 5).
- Fit the linear regression model with sale price as response variable and SQFT, LOT_SIZE, BEDS, and BATHS as predictor variables (Model 1 from HW 5). Calculate robust standard errors for the coefficient estimates. Display a table with estimated coefficients, the usual standard errors that assume constant variance, and robust standard errors.
- Which set of standard errors should be used? Explain by referring to HW 5.
- Perform the Wald test for testing that the coefficient of the LOT_SIZE variable is equal to 0. Use the usual standard errors that assume constant variance. Report the test statistic and p-value.
- Perform the robust Wald test statistic for testing that the coefficient of the LOT_SIZE variable is equal to 0. Report the test statistic and p-value.
- Use the jackknife to estimate the SE for the coefficient of the LOT_SIZE variable. Report the jackknife estimate of the SE.
- Use the jackknife estimate of the SE to test the null hypothesis that the coefficient of the LOT_SIZE variable is equal to 0. Report the test statistic and p-value.
- Do the tests in Q3, Q4, and Q6 agree? Which of these tests are valid?
- Remove the LOT_SIZE variable from Model 1 (call this Model 1A). Fit Model 1A and report the table of coefficients, the usual standard errors that assume constant variance, and robust standard errors.
- Add the square of the LOT_SIZE variable to Model 1 (call this Model 1B). Fit Model 1B and report the table of coefficients, the usual standard errors that assume constant variance, and robust standard errors.
- Perform the F test to compare Model 1A and Model 1B. Report the p-value.
- State the null hypothesis being tested in Q10 either in words or by using model formulas.
- Perform the robust Wald test to compare Model 1A and Model 1B. Report the p-value.
- Compare the results of the tests in Q10 and Q12. Which test is valid?
The following questions use the LOG_PRICE variable as in HW 5. Fit models corresponding to Model 1A and Model 1B with LOG_PRICE as the response variable. Call these models Model 1A_Log and Model 1B_Log.
- Perform the F test to compare Model 1A_Log and Model 1B_Log. Report the p-value.
- State the null hypothesis being tested in Q14 either in words or by using model formulas.
- Perform the robust Wald test to compare Model 1A_Log and Model 1B_Log. Report the p-value.
- Compare the results of the tests in Q14 and Q16. Do they give the same conclusion?
- Based on all of the analyses performed, answer the following question. Is there evidence for an association between the size of the lot and sales price? Explain.



