What is residual sum of squares?
The Residual Sum of Squares (RSS), also known as the Sum of Squared Errors (SSE), is a measure of the discrepancy between the actual data points and the values predicted by a regression model.
📌 Definition:
Where:
-
= actual value of the dependent variable
-
= predicted value from the regression model
-
= residual or error term
🎯 What Does RSS Represent?
RSS quantifies the total amount of variation in the dependent variable that is not explained by the regression model.
-
A smaller RSS means the model's predictions are closer to actual values → better fit.
-
A larger RSS indicates poor model fit, with more prediction errors.
📊 Where It Fits in Total Variance
In regression or ANOVA:
| Type of Sum of Squares | Formula | Meaning |
|---|---|---|
| SST (Total) | Total variance in data | |
| SSR (Regression) | Explained by model | |
| RSS or SSE (Residual) | Not explained by model (errors) |
🧠 Finance Example:
Suppose you're predicting monthly sales (₹ in '000s) based on advertising spend. You have the following data:
| Obs | Actual Sales (Y) | Predicted Sales (Ŷ) | Residual (Y - Ŷ) | Residual² |
|---|---|---|---|---|
| 1 | 220 | 215 | 5 | 25 |
| 2 | 270 | 265 | 5 | 25 |
| 3 | 310 | 305 | 5 | 25 |
So, the model left ₹75,000 (in squared units) of sales unexplained. The lower this number, the better your model.
🔁 Summary:
| Term | Meaning |
|---|---|
| RSS / SSE | Sum of squared deviations of actual values from predicted values |
| Indicates | Error or unexplained portion in the regression model |
| Ideal Value | As low as possible (but not zero unless overfitting) |
| Used In | Computing R², Mean Squared Error (MSE), F-ratio |
Comments
Post a Comment