Statistical Evaluation

Evaluating Machine Learning Models

Hans Georg Schaathun

NTNU, Noregs Teknisk-Naturvitskaplege Universitet

22 March 2023

How do you know if your machine learning model is good?

Step 1 (Training)
- Use training data $(\vec{x'}_i,\vec{y'}_i)$ to find a model $f_{\vec{c}}$.
- Count errors: $N$ objects, $F$ misclassified, error rate $\rho=F/N$
Step 2 (Testing)
- Independent dataset $(\vec{x'}_i,\vec{y'}_i)$ to estimate the error
- Count errors: $N'$ objects, $F'$ misclassified, error rate $\rho'=F'/N'$
Step 3 (Validation)
- Third dataset $(\vec{x''}_i,\vec{y''}_i)$
- Count errors: $N''$ objects, $F''$ misclassified, error rate $\rho''=F''/N''$

What do the error rates $\rho$, $\rho'$, $\rho''$ tell us?

$$\hat\sigma = \sqrt{\frac{\rho(1-\rho)}{N}}$$