202511171430
Status: idea
Tags: Datascience, Machine Learning, Model Evaluation Metrics

Regression Metrics

Regression metrics are essential tools used to evaluate the performance of a regression model. Regression models are statistical models that predict a continuous outcome (a real number) based on one or more input variables.

These metrics measure the difference between the predicted values ( $\overset{y}{^}_{i}$ ) and the actual observed values ( $y_{i}$ ), giving you an idea of how well the model is fitting the data and how accurate its predictions are. The image you provided lists three of the most common regression metrics: Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and the Coefficient of Determination ( $R^{2}$ ).

Common Regression Metrics

Mean Squared Error (MSE)

The Mean Squared Error (MSE) is the average of the squared differences between the predicted and actual values.

$MSE = \frac{1}{n} \sum_{i = 1}^{n} (y_{i} - \overset{y}{^}_{i})^{2}$

$y_{i}$ : The actual observed value.
$\overset{y}{^}_{i}$ : The predicted value from the model.
$n$ : The total number of data points.

Key features:

It penalizes large errors more heavily than small errors because the differences are squared.
The resulting error unit is squared, which can make it difficult to interpret in the context of the original target variable.
A lower MSE indicates a better model fit.

Root Mean Squared Error (RMSE)

The Root Mean Squared Error (RMSE) is the square root of the MSE.

$RMSE = MSE$

Key features:

It brings the error unit back to the same units as the target variable, making it more interpretable than the MSE.
Like MSE, a lower RMSE indicates a better model fit.
It is often the most preferred metric for evaluating regression models because of its interpretability.

Coefficient of Determination ( $R^{2}$ )

The Coefficient of Determination ( $R^{2}$ ) is a measure that indicates the proportion of the variance in the dependent variable that is predictable from the independent variables.

$R^{2} = 1 - \frac{\sum _{i = 1}^{n} ( y _{i} - y ^ _{i} ) ^{2}}{\sum _{i = 1}^{n} ( y _{i} - y ˉ ) ^{2}}$

$\overset{y}{ˉ}$ : The mean of the actual observed values.
The numerator is the unexplained variance (sum of squared errors, similar to MSE’s numerator).
The denominator is the total variance of the data (variance if the model was just the mean).

Key features:

The $R^{2}$ value ranges from 0 to 1.
An $R^{2}$ of 1 means the model perfectly predicts the target variable’s variance (perfect fit).
An $R^{2}$ of 0 means the model explains none of the variability of the response data around its mean.
It’s a relative measure and is often used to compare models.

References

Dit is iets wat we leren voor Datascience. dit was informatie vanuit avans 2-1 datascience 2025-11-10. en daarbij horen deze slides

🌵OldMartijntje

Explorer

Regression Metrics

Regression Metrics

Common Regression Metrics

Mean Squared Error (MSE)

Root Mean Squared Error (RMSE)

Coefficient of Determination ( $R^{2}$ )

References

Graph View

Table of Contents

Backlinks

🌵OldMartijntje

Explorer

Regression Metrics

Regression Metrics

Common Regression Metrics

Mean Squared Error (MSE)

Root Mean Squared Error (RMSE)

Coefficient of Determination (R2)

References

Graph View

Table of Contents

Backlinks

Coefficient of Determination ( $R^{2}$ )