Research Question: What is the average student height for students whose mother is 64 inches tall?
How would you figure this out?
Research Question: What is the average student height for students whose mother is 64 inches tall?
Answer: Use the best fit regression line to tell you the answer.
\(\hat{y} = \hat{\beta}_0 + \hat{\beta}_1 x = 35.653 + 0.503\times 64 = 67.845\)
Is our prediction of \(\hat{y}\) = 67.845 of the average height for students whose mother is 64 inches a sample statistic or population parameter?
Is our prediction \(\hat{y}\) = 67.845 of the average height for students whose mother is 64 inches a sample statistic or population parameter?
Lets build a confidence interval for the population parameter!
Using similar principles as we have used in the past to build confidence intervals: \[ \hat{y} \pm t^\star \hat{\sigma}\sqrt{\frac{1}{n}+\frac{(x-\bar{x})}{\sum_{i=1}^n(x_i - \bar{x})^2}} \] Is a confidence interval for the average value of \(y\) given an \(x\) (the population average student height for 64 inch tall mothers) where the value of \(t^\star\) is determined by the confidence level.
For our analysis, this comes out to be (67.662, 68.054) for a 95% interval.
Notes:
Research Question: Shaylee’s mom is 64 inches tall, what will her height be?
Thought Questions:
Is this the same question as above? If not, what is the difference?
Research Question: Shaylee’s mom is 64 inches tall, what will her height be?
Thought Questions:
Should our point prediction (1 number prediction) be the same or different?
Research Question: Shaylee’s mom is 64 inches tall, what will her height be?
Should our interval for the prediction be the same or different? Why or why not?
Using similar principles as we have used in the past to build confidence intervals: \[ \hat{y} \pm t^\star \hat{\sigma}\sqrt{1 + \frac{1}{n}+\frac{(x-\bar{x})}{\sum_{i=1}^n(x_i - \bar{x})^2}} \] is a prediction interval for the value of \(y\) given an \(x\) (for example, Shaylee’s height if her mom is 64 inches tall) where the value of \(t^\star\) is determined by the confidence level.
For our analysis, this comes out to be (60.449, 75.268) for a 95% interval.
Notes:
Confidence interval for prediction: An interval estimate for the average of \(y\) given an \(x\).
Prediction interval for prediction: An interval estimate for the value of a single \(y\) given an \(x\).
Prediction intervals are ALWAYS wider than confidence intervals. Why?
All previous steps in the tool are the same as covered in previous lecture notes:
Luis thinks he wants to move to Panama City Florida for the beaches (30.1588 north latitude). What is the UPPER bound of a 90% interval for the mortality rate for this state?
Luis thinks he wants to move to Panama City Florida for the beaches (30.1588 north latitude). What is the UPPER bound of a 90% interval for the mortality rate for this state?
Research Question: Lucy’s mom is 82 inches tall, what will her height be?
Answer:
How do we know if our predictions are any good? For example, how do we know if our prediction for Shaylee’s height was good or bad?
For the melanoma example, suppose we got an RMSE of 19. Which of the following is the correct interpretation of this value?
For the melanoma example, suppose we got an RMSE of 19. Which of the following is the correct interpretation of this value?
Measuring possum head size can be difficult. However, measuring total possum length is easier. What is the relationship between possum length and head size? Use a simple linear regression model (and the course app) to answer the following questions:
Sydney found a huge 96 cm possum. What is your predicted head length for this possum?
Sydney found a huge 96 cm possum. What is the average head length for possums of this size?
Sydney found a baby 70 cm possum. What is your predicted head length for this possum?
Is your model good or bad at predicting possum head sizes?