Package 'ModelMetrics' reference manual

Title:	Rapid Calculation of Model Metrics
Description:	Collection of metrics for evaluating models written in C++ using 'Rcpp'. Popular metrics include area under the curve, log loss, root mean square error, etc.
Authors:	Tyler Hunt [aut, cre]
Maintainer:	Tyler Hunt <thunt@snapfinance.com>
License:	GPL (>= 2)
Version:	1.2.3
Built:	2025-03-08 04:30:49 UTC
Source:	https://github.com/jackstat/modelmetrics

Area Under the Curve

Description

Calculates the area under the curve for a binary classifcation model

Usage

auc(...)

## Default S3 method:
auc(actual, predicted, ...)

## S3 method for class 'glm'
auc(modelObject, ...)

## S3 method for class 'randomForest'
auc(modelObject, ...)

## S3 method for class 'glmerMod'
auc(modelObject, ...)

## S3 method for class 'gbm'
auc(modelObject, ...)

## S3 method for class 'rpart'
auc(modelObject, ...)
auc(...)

## Default S3 method:
auc(actual, predicted, ...)

## S3 method for class 'glm'
auc(modelObject, ...)

## S3 method for class 'randomForest'
auc(modelObject, ...)

## S3 method for class 'glmerMod'
auc(modelObject, ...)

## S3 method for class 'gbm'
auc(modelObject, ...)

## S3 method for class 'rpart'
auc(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels. Can be `numeric, character, or factor`
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

auc(testDF$y, Preds)
# using s3 method for glm
auc(glmModel)


data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

auc(testDF$y, Preds)
# using s3 method for glm
auc(glmModel)

Brier Score

Description

Calculates the Brier score

Usage

brier(...)

## Default S3 method:
brier(actual, predicted, ...)

## S3 method for class 'glm'
brier(modelObject, ...)

## S3 method for class 'randomForest'
brier(modelObject, ...)

## S3 method for class 'glmerMod'
brier(modelObject, ...)

## S3 method for class 'gbm'
brier(modelObject, ...)

## S3 method for class 'rpart'
brier(modelObject, ...)
brier(...)

## Default S3 method:
brier(actual, predicted, ...)

## S3 method for class 'glm'
brier(modelObject, ...)

## S3 method for class 'randomForest'
brier(modelObject, ...)

## S3 method for class 'glmerMod'
brier(modelObject, ...)

## S3 method for class 'gbm'
brier(modelObject, ...)

## S3 method for class 'rpart'
brier(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Classification error

Description

Calculates the classification error

Usage

ce(...)

## Default S3 method:
ce(actual, predicted, ...)

## S3 method for class 'lm'
ce(modelObject, ...)

## S3 method for class 'glm'
ce(modelObject, ...)

## S3 method for class 'randomForest'
ce(modelObject, ...)

## S3 method for class 'glmerMod'
ce(modelObject, ...)

## S3 method for class 'gbm'
ce(modelObject, ...)

## S3 method for class 'rpart'
ce(modelObject, ...)
ce(...)

## Default S3 method:
ce(actual, predicted, ...)

## S3 method for class 'lm'
ce(modelObject, ...)

## S3 method for class 'glm'
ce(modelObject, ...)

## S3 method for class 'randomForest'
ce(modelObject, ...)

## S3 method for class 'glmerMod'
ce(modelObject, ...)

## S3 method for class 'gbm'
ce(modelObject, ...)

## S3 method for class 'rpart'
ce(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `lm, glm, randomForest, glmerMod, gbm, rpart`

Confusion Matrix

Description

Create a confusion matrix given a specific cutoff.

Usage

confusionMatrix(actual, predicted, cutoff = 0.5, use_names = FALSE)
confusionMatrix(actual, predicted, cutoff = 0.5, use_names = FALSE)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values
`use_names`	If `TRUE`, the dimnames of the output matrix will be labelled to indicate which axes show the actual and predicted classes.

Examples

actual <- c(0, 0, 1)
predicted <- c(1, 0, 1)

confusionMatrix(actual, predicted)

confusionMatrix(actual, predicted, use_names = TRUE)

actual <- c(0, 0, 1)
predicted <- c(1, 0, 1)

confusionMatrix(actual, predicted)

confusionMatrix(actual, predicted, use_names = TRUE)

F1 Score

Description

Calculates the f1 score

Usage

f1Score(actual, predicted, cutoff = 0.5)
f1Score(actual, predicted, cutoff = 0.5)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

F Score

Description

Calculates the F score and allows different specifications of the beta value (F0.5)

Usage

fScore(actual, predicted, cutoff = 0.5, beta = 1)
fScore(actual, predicted, cutoff = 0.5, beta = 1)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values
`beta`	the desired beta value (lower increases weight of precision over recall). Defaults to 1

GINI Coefficient

Description

Calculates the GINI coefficient for a binary classifcation model

Usage

gini(...)

## Default S3 method:
gini(actual, predicted, ...)

## S3 method for class 'glm'
gini(modelObject, ...)

## S3 method for class 'randomForest'
gini(modelObject, ...)

## S3 method for class 'glmerMod'
gini(modelObject, ...)

## S3 method for class 'gbm'
gini(modelObject, ...)

## S3 method for class 'rpart'
gini(modelObject, ...)
gini(...)

## Default S3 method:
gini(actual, predicted, ...)

## S3 method for class 'glm'
gini(modelObject, ...)

## S3 method for class 'randomForest'
gini(modelObject, ...)

## S3 method for class 'glmerMod'
gini(modelObject, ...)

## S3 method for class 'gbm'
gini(modelObject, ...)

## S3 method for class 'rpart'
gini(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels. Can be `numeric, character, or factor`
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

gini(testDF$y, Preds)
# using s3 method for glm
gini(glmModel)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

gini(testDF$y, Preds)
# using s3 method for glm
gini(glmModel)

kappa statistic

Description

Calculates kappa statistic. Currently build to handle binary values in actual vector.

Usage

kappa(actual, predicted, cutoff = 0.5)
kappa(actual, predicted, cutoff = 0.5)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

Log Loss

Description

Calculates the log loss or entropy loss for a binary outcome

Usage

logLoss(...)

## Default S3 method:
logLoss(actual, predicted, distribution = "binomial", ...)

## S3 method for class 'glm'
logLoss(modelObject, ...)

## S3 method for class 'randomForest'
logLoss(modelObject, ...)

## S3 method for class 'glmerMod'
logLoss(modelObject, ...)

## S3 method for class 'gbm'
logLoss(modelObject, ...)

## S3 method for class 'rpart'
logLoss(modelObject, ...)
logLoss(...)

## Default S3 method:
logLoss(actual, predicted, distribution = "binomial", ...)

## S3 method for class 'glm'
logLoss(modelObject, ...)

## S3 method for class 'randomForest'
logLoss(modelObject, ...)

## S3 method for class 'glmerMod'
logLoss(modelObject, ...)

## S3 method for class 'gbm'
logLoss(modelObject, ...)

## S3 method for class 'rpart'
logLoss(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	a binary vector of the labels
`predicted`	a vector of predicted values
`distribution`	the distribution of the loss function needed `binomial, poisson`
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

logLoss(testDF$y, Preds)
# using s3 method for glm
logLoss(glmModel)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

logLoss(testDF$y, Preds)
# using s3 method for glm
logLoss(glmModel)

Mean absolute error

Description

Calculates the mean absolute error

Usage

mae(...)

## Default S3 method:
mae(actual, predicted, ...)

## S3 method for class 'glm'
mae(modelObject, ...)

## S3 method for class 'randomForest'
mae(modelObject, ...)

## S3 method for class 'glmerMod'
mae(modelObject, ...)

## S3 method for class 'gbm'
mae(modelObject, ...)

## S3 method for class 'rpart'
mae(modelObject, ...)
mae(...)

## Default S3 method:
mae(actual, predicted, ...)

## S3 method for class 'glm'
mae(modelObject, ...)

## S3 method for class 'randomForest'
mae(modelObject, ...)

## S3 method for class 'glmerMod'
mae(modelObject, ...)

## S3 method for class 'gbm'
mae(modelObject, ...)

## S3 method for class 'rpart'
mae(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Multiclass Area Under the Curve

Description

Calculates the area under the curve for a binary classifcation model

Usage

mauc(actual, predicted)
mauc(actual, predicted)

Arguments

`actual`	A vector of the labels. Can be `numeric, character, or factor`
`predicted`	A data.frame of predicted values. Can be `matrix, data.frame`

Examples

setosa <- glm(I(Species == 'setosa') ~ Sepal.Length, data = iris, family = 'binomial')
versicolor <- glm(I(Species == 'versicolor') ~ Sepal.Length, data = iris, family = 'binomial')
virginica <- glm(I(Species == 'virginica') ~ Sepal.Length, data = iris, family = 'binomial')

Pred <-
  data.frame(
    setosa = predict(setosa, type = 'response')
    ,versicolor = predict(versicolor, type = 'response')
    ,virginica = predict(virginica, type = 'response')
  )

Predicted = Pred/rowSums(Pred)
Actual = iris$Species

mauc(Actual, Predicted)

setosa <- glm(I(Species == 'setosa') ~ Sepal.Length, data = iris, family = 'binomial')
versicolor <- glm(I(Species == 'versicolor') ~ Sepal.Length, data = iris, family = 'binomial')
virginica <- glm(I(Species == 'virginica') ~ Sepal.Length, data = iris, family = 'binomial')

Pred <-
  data.frame(
    setosa = predict(setosa, type = 'response')
    ,versicolor = predict(versicolor, type = 'response')
    ,virginica = predict(virginica, type = 'response')
  )

Predicted = Pred/rowSums(Pred)
Actual = iris$Species

mauc(Actual, Predicted)

Matthews Correlation Coefficient

Description

Calculates the Matthews Correlation Coefficient

Usage

mcc(actual, predicted, cutoff)
mcc(actual, predicted, cutoff)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

Multiclass Log Loss

Description

Calculated the multi-class log loss

Usage

mlogLoss(actual, predicted)
mlogLoss(actual, predicted)

Arguments

`actual`	A vector of the labels. Can be `numeric, character, or factor`
`predicted`	matrix of predicted values. Can be `matrix, data.frame`

Mean Square Error

Description

Calculates the mean square error

Usage

mse(...)

## Default S3 method:
mse(actual, predicted, ...)

## S3 method for class 'lm'
mse(modelObject, ...)

## S3 method for class 'glm'
mse(modelObject, ...)
mse(...)

## Default S3 method:
mse(actual, predicted, ...)

## S3 method for class 'lm'
mse(modelObject, ...)

## S3 method for class 'glm'
mse(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `lm`

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

mse(testDF$y, Preds)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

mse(testDF$y, Preds)

Mean Squared Log Error

Description

Calculates the mean square log error

Usage

msle(...)

## Default S3 method:
msle(actual, predicted, ...)

## S3 method for class 'lm'
msle(modelObject, ...)

## S3 method for class 'glm'
msle(modelObject, ...)

## S3 method for class 'randomForest'
msle(modelObject, ...)

## S3 method for class 'glmerMod'
msle(modelObject, ...)

## S3 method for class 'gbm'
msle(modelObject, ...)

## S3 method for class 'rpart'
msle(modelObject, ...)
msle(...)

## Default S3 method:
msle(actual, predicted, ...)

## S3 method for class 'lm'
msle(modelObject, ...)

## S3 method for class 'glm'
msle(modelObject, ...)

## S3 method for class 'randomForest'
msle(modelObject, ...)

## S3 method for class 'glmerMod'
msle(modelObject, ...)

## S3 method for class 'gbm'
msle(modelObject, ...)

## S3 method for class 'rpart'
msle(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Negative Predictive Value

Description

True Negatives / (True Negatives + False Negatives)

Usage

npv(actual, predicted, cutoff = 0.5)
npv(actual, predicted, cutoff = 0.5)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

npv(testDF$y, Preds, cutoff = 0)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

npv(testDF$y, Preds, cutoff = 0)

Positive Predictive Value

Description

True Positives / (True Positives + False Positives)

Usage

ppv(actual, predicted, cutoff = 0.5)
ppv(actual, predicted, cutoff = 0.5)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

ppv(testDF$y, Preds, cutoff = 0)
precision(testDF$y, Preds, cutoff = 0)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

ppv(testDF$y, Preds, cutoff = 0)
precision(testDF$y, Preds, cutoff = 0)

Recall, Sensitivity, tpr

Description

True Positives / (True Positives + False Negatives)

Usage

recall(actual, predicted, cutoff = 0.5)
recall(actual, predicted, cutoff = 0.5)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

recall(testDF$y, Preds, cutoff = 0)
sensitivity(testDF$y, Preds, cutoff = 0)
tpr(testDF$y, Preds, cutoff = 0)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

recall(testDF$y, Preds, cutoff = 0)
sensitivity(testDF$y, Preds, cutoff = 0)
tpr(testDF$y, Preds, cutoff = 0)

Root-Mean Square Error

Description

Calculates the root mean square error

Usage

rmse(...)

## Default S3 method:
rmse(actual, predicted, ...)

## S3 method for class 'lm'
rmse(modelObject, ...)

## S3 method for class 'glm'
rmse(modelObject, ...)
rmse(...)

## Default S3 method:
rmse(actual, predicted, ...)

## S3 method for class 'lm'
rmse(modelObject, ...)

## S3 method for class 'glm'
rmse(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `lm`

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

rmse(testDF$y, Preds)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

rmse(testDF$y, Preds)

Root Mean Squared Log Error

Description

Calculates the mean square log error

Usage

rmsle(...)

## Default S3 method:
rmsle(actual, predicted, ...)

## S3 method for class 'lm'
rmsle(modelObject, ...)

## S3 method for class 'glm'
rmsle(modelObject, ...)

## S3 method for class 'randomForest'
rmsle(modelObject, ...)

## S3 method for class 'glmerMod'
rmsle(modelObject, ...)

## S3 method for class 'gbm'
rmsle(modelObject, ...)

## S3 method for class 'rpart'
rmsle(modelObject, ...)
rmsle(...)

## Default S3 method:
rmsle(actual, predicted, ...)

## S3 method for class 'lm'
rmsle(modelObject, ...)

## S3 method for class 'glm'
rmsle(modelObject, ...)

## S3 method for class 'randomForest'
rmsle(modelObject, ...)

## S3 method for class 'glmerMod'
rmsle(modelObject, ...)

## S3 method for class 'gbm'
rmsle(modelObject, ...)

## S3 method for class 'rpart'
rmsle(modelObject, ...)

Arguments

`...`	additional parameters to be passed the the s3 methods
`actual`	A vector of the labels
`predicted`	A vector of predicted values
`modelObject`	the model object. Currently supported `glm, randomForest, glmerMod, gbm`

Test data

Description

Test data

Specificity, True negative rate

Description

True Negatives / (True Negatives + False Positives)

Usage

tnr(actual, predicted, cutoff = 0.5)
tnr(actual, predicted, cutoff = 0.5)

Arguments

`actual`	A vector of the labels
`predicted`	A vector of predicted values
`cutoff`	A cutoff for the predicted values

Examples

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

tnr(testDF$y, Preds, cutoff = 0)
specificity(testDF$y, Preds, cutoff = 0)

data(testDF)
glmModel <- glm(y ~ ., data = testDF, family="binomial")
Preds <- predict(glmModel, type = 'response')

tnr(testDF$y, Preds, cutoff = 0)
specificity(testDF$y, Preds, cutoff = 0)

Package 'ModelMetrics'

Help Index

Area Under the Curve

Description

Usage

Arguments

Examples

Brier Score

Description

Usage

Arguments

Classification error

Description

Usage

Arguments

Confusion Matrix

Description

Usage

Arguments

Examples

F1 Score

Description

Usage

Arguments

F Score

Description

Usage

Arguments

GINI Coefficient

Description

Usage

Arguments

Examples

kappa statistic

Description

Usage

Arguments

Log Loss

Description

Usage

Arguments

Examples

Mean absolute error

Description

Usage

Arguments

Multiclass Area Under the Curve

Description

Usage

Arguments

Examples

Matthews Correlation Coefficient

Description

Usage

Arguments

Multiclass Log Loss

Description

Usage

Arguments

Mean Square Error

Description

Usage

Arguments

Examples

Mean Squared Log Error

Description

Usage

Arguments

Negative Predictive Value

Description

Usage

Arguments

Examples

Positive Predictive Value

Description

Usage

Arguments

Examples

Recall, Sensitivity, tpr

Description