Research Question: How well can we use the explanatory variables to predict height?
Population: All BYU students.
Parameters of Interest: The regression line parameters (all slopes and spread (\(\sigma\)))
Sample: A convenience sample of 1727 BYU students who are in Stat 121.
Research Question: What is the average height for male students who have a 64 inch tall mom, 68 inch tall dad, did not play sports in HS and wears a size 9 shoe?
Fitted Model: \[ \begin{align} \hat{y} &= 23.26 + 0.28\times \text{MotherHeight}_i + 0.21\times \text{FatherHeight}_i + 0.35\times \text{Sports}_i + \\ & \qquad 3.19\times \text{Sex}_i + 1.06\times \text{ShoeSize}_i \end{align} \] How would you use the model to figure this out?
Research Question: What is the average height for male students who have a 64 inch tall mom, 68 inch tall dad, did not play sports in HS and wears a size 9 shoe?
Fitted Model: \[ \begin{align} \hat{y} &= 23.26 + 0.28\times \text{MotherHeight}_i + 0.21\times \text{FatherHeight}_i + 0.35\times \text{Sports}_i + \\ & \qquad 3.19\times \text{Sex}_i + 1.06\times \text{ShoeSize}_i \end{align} \] How would you use the model to figure this out? \[ \begin{align} \hat{y} &= 23.26 + 0.28\times 64 + 0.21\times 68 + 0.35\times 0 + 3.19\times 1 + 1.06\times 9 \\ &= 68.419 \end{align} \]
Thought Question: Is our prediction of \(\hat{y}\) = 68.419 of the average height for male students who have a 64 inch tall mom, 68 inch tall dad, did not play sports in HS and wears a size 9 shoe a sample estimate or population parameter?
Using similar principles as we have used in the past to build confidence intervals: \[ \hat{y} \pm t^\star\text{SE}(\hat{\beta}_0 + \hat{\beta}_1\text{MH} + \cdots + \hat{\beta}_5\text{Shoe}) \] Is a confidence interval for the average value of \(y\) given an \(x\) (the population average height for male students who has a 64 inch tall mom, 68 inch tall dad, did not play sports in HS and wears a size 9) where the value of \(t^\star\) is determined by the confidence level.
For our analysis, this comes out to be (68.177, 68.665) for a 95% interval.
Notes:
Remembering what we learned from simple linear regression, what is the difference between a prediction interval and a confidence interval in regression?
Remembering what we learned from simple linear regression, what is the difference between a prediction interval and a confidence interval in regression?
Remembering what we learned from simple linear regression, which interval is wider and why?
Remembering what we learned from simple linear regression, which interval is wider and why?
Research Question: Eddie is a male student who has a 64 inch tall mom, 68 inch tall dad, did not play sports in HS and wears a size 9. What will his height be?
Using similar principles as we have used in the past to build confidence intervals: \[ \hat{y} \pm t^\star \text{SE}(\hat{\beta}_0 + \hat{\beta}_1\text{MH} + \cdots + \hat{\beta}_5\text{Shoe} + \hat{\epsilon}) \] is a prediction interval for the value of \(y\) given an \(x\) (for example, Eddie’s height) where the value of \(t^\star\) is determined by the confidence level.
For our analysis, this comes out to be (64.928, 71.914) for a 95% interval.
Notes:
Confidence interval for prediction: An interval estimate for the average of \(y\) given the \(x's\).
Prediction interval for prediction: An interval estimate for the value of a single \(y\) given the \(x's\).
All of the steps are the same as in previous lectures…
Research Question: Jordan is a male student who has a 77 inch tall mom, 85 inch tall dad, did play sports in HS and wears a size 12 shoe. What will his height be?
Answer:
Hawaii is ocean bordering state with a latitude of 19.90 degrees and longitude of 155.67. What is a 90% interval for the melanoma mortality for this state? Enter -999 if doing this prediction is not appropriate.
Hawaii is ocean bordering state with a latitude of 19.90 degrees and longitude of 155.67. What is a 90% interval for the melanoma mortality for this state? Enter -999 if doing this prediction is not appropriate.
What method is used to determine if predictions from regression are accurate or not?
What method is used to determine if predictions from regression are accurate or not?
How do we know if our predictions are any good?
Notes:
Model 1 uses just latitude to predict mortality and has a RMSE of 15. Model 2 uses latitude and ocean to predict mortality and has a RMSE of 14. Model 3 uses latitude, ocean and longitude to predict mortality and has an RMSE of 14.9. Which model is the preferred model to use?
Model 1 uses just latitude to predict mortality and has a RMSE of 15. Model 2 uses latitude and ocean to predict mortality and has a RMSE of 14. Model 3 uses latitude, ocean and longitude to predict mortality and has an RMSE of 14.9. Which model is the preferred model to use?
Measuring possum head size can be difficult. However, various other factors can be used to predict head size? Use a multiple linear regression model (and the course app) to answer the following questions:
Hyrum found a huge (96 cm total, male, 7 years, 68cm skull, 42 length tail) possum, What is your predicted head length for this possum?
Hyrum found a huge (96 cm total, male, 7 years, 68cm skull, 42 length tail) possum. What is the average head length for possums of this size?
Hyrum found a baby (70 cm total, male, 0.5 years, 42cm skull, 28 length tail) possum. What is your predicted head length for this possum?
Is your model good or bad at possum head sizes?