Machine Learning Metrics
Regression
- Root Mean Square Error
- Mean Absolute Error
- Mean Error
- R Square
- Adjusted R Square
Classification
- \(\mathcal{X}\),
- the domain of inputs
- \(\mathcal{Y} := \{0, 1\}\),
- $f: \mathcal{X} \rightarrow \mathcal{Y}$,
- prediction model
- $n \in \mathbb{N}$,
- the number of data
- \(\{(x_{i}, y_{i})\}_{i=1,\ldots,n} \subseteq \mathcal{X} \times \mathcal{Y}\),
- true positive
- the number of samples which has label 1 and is indentified as 1 by the model
- the model correctly predicts the data belongs to 1 class
- true negative
- the number of samples which has label 0 and is indentified as 0 by the model
- the model correctly predicts the data belongs to 0 class
- False Positive
- falsly choose positve class
- the number of samples which has label 0 and is identified as 1 by the model
- the model predicts positive class but it turns out negative
- False Negative
- falsly choose negative class
- the number of samples which has label 1 and is identified as 0 by the model
- the model predicts negative class but it turns out positive
- accuracy
- True Positive plus True Negative over all data
- (TP + TN) over (TP + TN + FP + FN)
- the number of correct predictions over total number of predictions
- high accuracy means that
- precision
- True Positive over the number of predictions predicted as Positve class
- TP over TP + FP
- The value of precision is lower may mean the model predicts positive class too much
- recall
- True Positve Rate
- True Positive over the number of positive classes in data
- TP over TP + FN
- The value of recall is lower may mean that the model predicts negative class too much
- False Positive Rate
- FP over FP + TN
- high False Positive Rate means that the model fail to predict negative class more
- False Discovery Rate
- FP over FN + TP
- Receiver Operating Characteristic curve
- draw curve whose x axis is FP rate and whose y axis is TP rate by changing the threshold of the binary-prediction model
- Area Under the Curve
- ROC-AUC
- Accuracy
- Log-Loss
- F1-score
Unsupervisised
- Rand index
- Mutual Information
Others
- CV Error
- Heuristic methods to find K
- Bilingual Evaluation Understudy