-1. \(a) Linearity: With so many observations in the dataset, we look for particularly extreme outliers in the histogram of residuals and do not see any. We also do not see a non-linear pattern emerging in the residuals vs. predicted plot. Independent observations: The sample is random and there does not seem ti be a trend in the residuals vs. order of data collection plot. Normality: The histogram of residuals appears to be unimodal and symmetic, centered at 0. Constant or equal variability: The residuals vs. predicted values plot reveals some outliers. This plot for only babies with predicted birth weights between 6 and 8.5 pounds looks a lot better, suggesting that for bulk of the data the constant variance condition is met. All concerns raised here are relatively mild. There are some outliers, but there is so much data that the influence of such observations will be minor. (b) $H_0$: The true slope coefficient of habit is zero ($\beta_5 = 0$). $H_A$: The true slope coefficient of height is different than zero ($\beta_5 \neq 0$). The p-value for the two-sided alternative hypothesis ($\beta_5 \ne 0$) is incredibly 0.0007 (smaller than 0.05), so we reject $H_0$. The data provide convincing evidence that height and weight are positively correlated, given the other variables in the model. The true slope parameter is indeed greater than 0.
0 commit comments