What is residual sum of squares?
The Residual Sum of Squares (RSS), also known as the Sum of Squared Errors (SSE), is a measure of the discrepancy between the actual data points and the values predicted by a regression model.
π Definition:
Where:
-
= actual value of the dependent variable
-
= predicted value from the regression model
-
= residual or error term
π― What Does RSS Represent?
RSS quantifies the total amount of variation in the dependent variable that is not explained by the regression model.
-
A smaller RSS means the model's predictions are closer to actual values → better fit.
-
A larger RSS indicates poor model fit, with more prediction errors.
π Where It Fits in Total Variance
In regression or ANOVA:
Type of Sum of Squares | Formula | Meaning |
---|---|---|
SST (Total) | Total variance in data | |
SSR (Regression) | Explained by model | |
RSS or SSE (Residual) | Not explained by model (errors) |
π§ Finance Example:
Suppose you're predicting monthly sales (₹ in '000s) based on advertising spend. You have the following data:
Obs | Actual Sales (Y) | Predicted Sales (ΕΆ) | Residual (Y - ΕΆ) | Residual² |
---|---|---|---|---|
1 | 220 | 215 | 5 | 25 |
2 | 270 | 265 | 5 | 25 |
3 | 310 | 305 | 5 | 25 |
So, the model left ₹75,000 (in squared units) of sales unexplained. The lower this number, the better your model.
π Summary:
Term | Meaning |
---|---|
RSS / SSE | Sum of squared deviations of actual values from predicted values |
Indicates | Error or unexplained portion in the regression model |
Ideal Value | As low as possible (but not zero unless overfitting) |
Used In | Computing R², Mean Squared Error (MSE), F-ratio |
Comments
Post a Comment