several variables on total winnings of 100 randomly selected
Solution
We are given that the independent variables are,
X1 = Age
X2 = AvgDriveYds
X3 = DriveAcc
X4 = GreensReg
X5 = AvgNumputts
X6 = SavePct
X7 = NumEvents
And the dependent variable (Y) is TotWinnings1000s
Fitted regression model:
Y = 18021.42 + 6.77*X1 - 22.61*X2 - 91.37*X3 + 290.32*X4 - 13745.68*X5 + 32.02*X6 - 24.25*X7
The fitted regression model only include variables that are statistically significant.
Statistically significance means we have to check P-value < 0.05
Variables having P-value < 0.05 that variables are statistically significant.
From the output we can see that P-value for X3 is 0.0081 < 0.05
P-value for X4 is 8.78E-08 < 0.05
P-value for X5 is 0.0053 < 0.05
So the variables X3, X4 and X5 are statistically significant.
The new model will be,
Y = 18021.42 - 91.37*X3 + 290.32*X4 - 13745.68*X5
When using an alpha or type I error 0.05 since the P-value in the ANOVA table is so small, you can conclude that at least one of the independent variables is statistically significant.
This is the correct answer or statement because there are three variables which are statistically significant.
At least one of the independent variable means greator than one independent variable is statistically significant.
R2 = 0.435= 43.5%
R2 represents expresses the proportion of the variation in Y which is explained by variables X1,..........,X7.
The model explains 43.5% of the variability of the response data around its mean.
Adjusted R2 = 0.392
The adjusted R-squared compares the explanatory power of regression models that contain different numbers of predictors.
The adjusted R-squared is a modified version of R-squared that has been adjusted for the number of predictors in the model.
The adjusted R-squared can be negative, but it’s usually not. It is always lower than the R-squared.
If we add independent variabel the R-squared increases, even if due to chance alone. It never decreases. Consequently, a model with more terms may appear to have a better fit simply because it has more terms.
The adjusted R-squared increases only if the new term improves the model more than would be expected by chance.
The proportion of total variability in TotWinnings100s is explained by the linear relationship between TotWinnings1000s with the seven independent variables is 0.435 that is 0.435*100 = 43.5%


