Discriminant Analysis

Linear Discriminant Analysis (LDA) is a technique for multi-class classification and dimensionality reduction. It is based on the assumption that the observations in each class or group are distributed as a multivariate Gaussian distribution, and that all groups have the same covariance matrix. A variation where groups can have different covariance matrices is called quadratic discriminant analysis (QDA.

When used for dimensionality reduction, the features are projected onto the directions that most separate the classes. When used for classification, it is sufficient to consider the distance to the group centroids in the projected space.

Quadratic Discriminant Analysis (QDA) is a generalization of LDA where the covariance matrix of each group is allowed to be different. This can lead to a more flexible model, but it also requires more parameters to be estimated.

The choice between LDA and QDA involves a bias-variance tradeoff. LDA is simpler and more robust when training data is limited, as it estimates fewer parameters. QDA is more flexible and can capture class-specific feature relationships, but requires more training data to estimate the separate covariance matrices reliably. QDA is particularly useful when classes have notably different covariance structures or when the decision boundaries between classes are clearly non-linear.

Linear Discriminant Analysis

Linear discriminant analysis models are implemented by the LinearDiscriminantAnalysis class.

Constructing Linear Discriminant Analysis Models

The LinearDiscriminantAnalysis class has four constructors.

The first constructor takes two arguments. The first is a ICategoricalVector that represents the dependent variable. The second is a parameter array of Vector<T> objects that represent the independent variables.

var dependent = Vector.CreateCategorical(yData);
var independent1 = Vector.Create(x1Data);
var independent2 = Vector.Create(x2Data);
var model1 = new LinearDiscriminantAnalysis(dependent, independent1, independent2);

Visual Basic

Dim dependent = Vector.CreateCategorical(yData)
Dim independent1 = Vector.Create(x1Data)
Dim independent2 = Vector.Create(x2Data)
Dim model1 = New LinearDiscriminantAnalysis(dependent, independent1, independent2)

Visual Basic

No code example is currently available or this language may not be supported.

let dependent = Vector.CreateCategorical(yData)
let independent1 = Vector.Create(x1Data)
let independent2 = Vector.Create(x2Data)
let model1 = new LinearDiscriminantAnalysis(dependent, independent1, independent2)

The second constructor takes 3 arguments. The first argument is a IDataFrame (a DataFrame<R, C> or Matrix<T>) that contains the variables to be used in the analysis. The second argument is a string containing the name of the dependent variable. The third argument is a parameter array of strings containing the names of the independent variables. All the names must exist in the column index of the data frame specified by the first parameter.

var dataFrame = DataFrame.FromColumns(
    ( "y", dependent ), 
    ( "x1", independent1 ),
    ( "x2", independent2 ));
var model2 = new LinearDiscriminantAnalysis(dataFrame, "y", "x1", "x2");

Visual Basic

Dim frame = DataFrame.FromColumns(New Dictionary(Of String, Object)() From
      { { "y", dependent }, { "x1", independent1 }, { "x2", independent2 } })
Dim model2 = New LinearDiscriminantAnalysis(frame, "y", "x1", "x2")

Visual Basic

No code example is currently available or this language may not be supported.

let columns = Dictionary<string,obj>()
columns.Add("y", dependent)
columns.Add("x1", independent1) 
columns.Add("x2", independent2)
let dataFrame = DataFrame.FromColumns<string>(columns)
let model2 = new LinearDiscriminantAnalysis(dataFrame, "y", "x1", "x2")

The third constructor takes 2 arguments. The first argument is once again a IDataFrame (a DataFrame<R, C> or Matrix<T>) that contains the variables to be used in the analysis. The second argument is a string containing a formula that specifies the dependent and independent variables. See the section on Defining models using formulas for details of formula syntax.

var model3 = new LinearDiscriminantAnalysis(dataFrame, "y ~ x1 + x2");

Visual Basic

Dim model3 = New LinearDiscriminantAnalysis(frame, "y ~ x1 + x2")

Visual Basic

No code example is currently available or this language may not be supported.

let model3 = new LinearDiscriminantAnalysis(dataFrame, "y ~ x1 + x2")

Fitting the Model

The Fit method performs the actual analysis. Most properties and methods throw an exception when they are accessed before the Fit method is called. You can verify that the model has been calculated by inspecting the Fitted property.

model1.Fit();

Visual Basic

model1.Fit()

Visual Basic

No code example is currently available or this language may not be supported.

model1.Fit()

The Predictions property returns a CategoricalVector<T> that contains the values of the dependent variable as predicted by the model. The PredictedProbabilities property returns a Matrix<T> that gives the probability of each outcome for each observation. A related property, PredictedLogProbabilities property returns the natural logarithm of the predicted probabilities. The ProbabilityResiduals property returns a matrix containing the difference between the actual (0 or 1) and the predicted probabilities.

Discriminant Directions

The result of a discriminant analysis is a set of discriminant directions. These are linear functions of the variables in the direction that most separates the observations in each group from those in other groups. The LinearDiscriminantAnalysis class' DiscriminantDirections property returns a collection of LinearDiscriminantDirections object that represent the discriminant directions. The discriminant directions are constructed by the model. You cannot create them directly.

Discriminant directions perform a role similar to factors in Factor Analysis or components in Principal Component Analysis (PCA). Discriminant directions are based on a generalized eigenvalue decomposition. The corresponding eigenvalue and eigenvector can be accessed through the Eigenvalue and Eigenvector properties. The EigenvalueDifference property returns the difference between the eigenvalues of the discriminant function and the next most significant discriminant function. The ProportionOfVariance and CumulativeProportionOfVariance properties give the contribution of the discriminant function to the variation in the data in relative terms.

The CanonicalCorrelation property returns the canonical correlation between the discriminant function and the groups. A higher coefficient indicates a higher relevance. Similarly, WilksLambda returns Wilks' lambda, which is a statistic that is based on the canonical correlations.

The significance of a discriminant direction can be quantified further using a hypothesis test based on Wilks' lambda. This is an F test that is exact when the number of groups is less than 3 or when the number of included functions is 1 or 2. The GetFTest() method returns this test.

In the example below, these properties are printed for the discriminant directions from the earlier example:

Console.WriteLine(" #    Eigenvalue Difference Contribution Contrib. % Can.Corr  F stat. df1 df2");
for (int i = 0; i < model1.DiscriminantDirections.Count; i++)
{
    var fn = model1.DiscriminantDirections[i];
    var f = fn.GetFTest();
    Console.WriteLine("{0,2}{1,12:F4}{2,11:F4}{3,14:F3}%{4,10:F3}%{5,9:F4}{6,9:F3}{7,9:F3}{8,4}{9,4}",
        i, fn.Eigenvalue, fn.EigenvalueDifference,
        100 * fn.ProportionOfVariance,
        100 * fn.CumulativeProportionOfVariance,
        fn.CanonicalCorrelation,
        fn.WilksLambda,
        f.Statistic,
        f.NumeratorDegreesOfFreedom,
        f.DenominatorDegreesOfFreedom);
}

Visual Basic

Console.WriteLine(" ' #    Eigenvalue Difference Contribution Contrib. % Can.Corr  F stat. df1 df2")
For i As Integer = 0 To model1.DiscriminantDirections.Count - 1
    Dim fn = model1.DiscriminantDirections(i)
    Dim f = fn.GetFTest()
    Console.WriteLine("{0,2}{1,12:F4}{2,11:F4}{3,14:F3}%{4,10:F3}%{5,9:F4}{6,9:F3}{7,9:F3}{8,4}{9,4}",
        i, fn.Eigenvalue, fn.EigenvalueDifference,
        100 * fn.ProportionOfVariance,
        100 * fn.CumulativeProportionOfVariance,
        fn.CanonicalCorrelation,
        fn.WilksLambda,
        f.Statistic,
        f.NumeratorDegreesOfFreedom,
        f.DenominatorDegreesOfFreedom)
Next

Visual Basic

No code example is currently available or this language may not be supported.

printfn " #    Eigenvalue Difference Contribution Contrib. %% Can.Corr  F stat. df1 df2"
for i in 0..model1.DiscriminantDirections.Count-1 do
    let fn = model1.DiscriminantDirections[i]
    let f = fn.GetFTest()
    printfn "%2d%12.4f%11.4f%14.3f%10.3f%9.4f%9.3f%9.3f%4.0f%4.0f"
        i fn.Eigenvalue fn.EigenvalueDifference
        (100.0 * fn.ProportionOfVariance)
        (100.0 * fn.CumulativeProportionOfVariance)
        fn.CanonicalCorrelation
        fn.WilksLambda
        f.Statistic
        f.NumeratorDegreesOfFreedom
        f.DenominatorDegreesOfFreedom

Discriminant Functions

Another result of a discriminant analysis is a set of discriminant functions. These are linear functions of the variables that compute a score for each class. The class with the highest score is selected. Where discriminant directions are used for dimensionality reduction, discriminant functions are used for classification. There is one discriminant function for each class.

The LinearDiscriminantAnalysis class' DiscriminantFunctions property returns a collection of LinearDiscriminantFunction objects that represent the discriminant functions. The discriminant functions are constructed by the model. You cannot create them directly.

Each discriminant function provides the following properties:

The Mean property returns the group centroid (mean vector) in the original feature space.
The Coefficients property returns the linear coefficients used to calculate the discriminant score.
The Prior property returns the prior probability of group membership.

The discriminant function for a class assigns a score to each observation based on its position relative to the class centroid, weighted by the pooled covariance matrix. These scores, combined with prior probabilities, determine the probability that an observation belongs to each class.

Classification

When using a linear discriminant analysis for classification, the probability that an observation belongs to each class is computed. The class with the highest probability is selected. The discriminant functions are simple linear functions of the variables that compute a score for each class. The class with the highest score is selected. The scores can be used to compute the relative probability of each class if desired.

The Predict method that takes a vector or data frame and produces the model's prediction for the supplied data. When a single observation is supplied (as a vector), the method returns an integer that is the level index of the predicted class. When multiple observations are supplied, the method returns a vector of level indexes.

Similarly, the PredictProbabilities method returns the predicted probabilities for each class. When a single observation is supplied (as a vector), the method returns a vector that contains the probabilities that the observation belongs to each of the classes. When multiple observations are supplied, the method returns a matrix, where each row contains the probabilities for the corresponding observation.

var index = model1.Predict(Vector.Create(1.2, 3.0));
var input = Matrix.CreateRandom(10, 2);
var predictions = model1.Predict(input);

Visual Basic

Dim Index = model1.Predict(Vector.Create(1.2, 3.0))
Dim inputs = Matrix.CreateRandom(10, 2)
Dim predictions = model1.Predict(inputs)

Visual Basic

No code example is currently available or this language may not be supported.

let index = model1.Predict(Vector.Create(1.2, 3.0))
let input = Matrix.CreateRandom(10, 2)
let predictions = model1.Predict(input)

Dimensionality Reduction

When used for dimensionality reduction, the observations are projected onto the direction that most separate the classes. The LinearDiscriminantAnalysis class implements the ITransformationModel interface to support this operation. The Transform(Matrix<Double>) method performs this operation. It takes one argument: a matrix whose rows contain the observations. It returns a matrix whose columns are the projections of the original features on the discriminant directions.

var transformed = model1.Transform(dataFrame.ToMatrix<double>());

Visual Basic

Dim transformed = model1.Transform(frame.ToMatrix(Of Double))

Visual Basic

No code example is currently available or this language may not be supported.

let transformed = model1.Transform(dataFrame.ToMatrix<double>())

Quadratic Discriminant Analysis

Quadratic discriminant analysis models are implemented by the QuadraticDiscriminantAnalysis class. The class is very similar to LinearDiscriminantAnalysis. The main differences are in the discriminant functions, which are quadratic in the case of QDA, and in the diagnostics.

Constructing Quadratic Discriminant Analysis Models

The QuadraticDiscriminantAnalysis class has four constructors.

var dependent = Vector.CreateCategorical(yData);
var independent1 = Vector.Create(x1Data);
var independent2 = Vector.Create(x2Data);
var qda1 = new QuadraticDiscriminantAnalysis(dependent, independent1, independent2);

Visual Basic

Dim dependent = Vector.CreateCategorical(yData)
Dim independent1 = Vector.Create(x1Data)
Dim independent2 = Vector.Create(x2Data)
Dim qda1 = New QuadraticDiscriminantAnalysis(dependent, independent1, independent2)

Visual Basic

No code example is currently available or this language may not be supported.

let dependent = Vector.CreateCategorical(yData)
let independent1 = Vector.Create(x1Data)
let independent2 = Vector.Create(x2Data)
let qda1 = new QuadraticDiscriminantAnalysis(dependent, independent1, independent2)

var dataFrame = DataFrame.FromColumns(
    ("y", dependent),
    ("x1", independent1),
    ("x2", independent2));
var qda2 = new QuadraticDiscriminantAnalysis(dataFrame, "y", "x1", "x2");

Visual Basic

Dim frame = DataFrame.FromColumns(New Dictionary(Of String, Object)() From
      { { "y", dependent }, { "x1", independent1 }, { "x2", independent2 } })
Dim qda2 = New QuadraticDiscriminantAnalysis(frame, "y", "x1", "x2")

Visual Basic

No code example is currently available or this language may not be supported.

let columns = Dictionary<string,obj>()
columns.Add("y", dependent)
columns.Add("x1", independent1) 
columns.Add("x2", independent2)
let dataFrame = DataFrame.FromColumns<string>(columns)
let qda2 = new QuadraticDiscriminantAnalysis(dataFrame, "y", "x1", "x2")

var qda3 = new QuadraticDiscriminantAnalysis(dataFrame, "y ~ x1 + x2");

Visual Basic

Dim qda3 = New QuadraticDiscriminantAnalysis(frame, "y ~ x1 + x2")

Visual Basic

No code example is currently available or this language may not be supported.

let qda3 = new QuadraticDiscriminantAnalysis(dataFrame, "y ~ x1 + x2")

Fitting the Model

qda1.Fit();

Visual Basic

qda1.Fit()

Visual Basic

No code example is currently available or this language may not be supported.

qda1.Fit()

Discriminant Functions

The result of a discriminant analysis is a set of discriminant functions. These are quadratic functions of the variables in the direction that most separates the observations in each group from those in other groups. The QuadraticDiscriminantAnalysis class' DiscriminantFunctions property returns a collection of QuadraticDiscriminantFunction object that represent the discriminant functions. The discriminant functions are constructed by the model. You cannot create them directly.

Each discriminant function provides the following properties:

The Mean property returns the group centroid (mean vector) in the original feature space.
The Coefficients property returns the linear coefficients used to calculate the discriminant score.
The Constant property returns the intercept term of the discriminant function.
The Prior property returns the prior probability of group membership.
The ErrorRate property returns the estimated misclassification rate for this group.
The MahalanobisDistances property returns the distances from this group's centroid to all other group centroids.

In the example below, these properties are printed for the discriminant functions from the earlier example:

foreach (var function in qda1.DiscriminantFunctions)
{
    Console.WriteLine($"Mean: {function.Mean}");
    Console.WriteLine($"Covariance Matrix: {function.CovarianceMatrix}");
    Console.WriteLine($"Prior: {function.Prior}");
    Console.WriteLine($"Error Rate: {function.ErrorRate}");
    Console.WriteLine($"Mahalanobis Distances: {function.MahalanobisDistances}");
    Console.WriteLine($"Separability: {function.Separability}");
}

Visual Basic

For Each fun In qda1.DiscriminantFunctions
    Console.WriteLine($"Mean: {fun.Mean}")
    Console.WriteLine($"Covariance Matrix: {fun.CovarianceMatrix}")
    Console.WriteLine($"Prior: {fun.Prior}")
    Console.WriteLine($"Error Rate: {fun.ErrorRate}")
    Console.WriteLine($"Mahalanobis Distances: {fun.MahalanobisDistances}")
    Console.WriteLine($"Separability: {fun.Separability}")
Next

Visual Basic

No code example is currently available or this language may not be supported.

qda1.DiscriminantFunctions
|> Seq.iter (fun fn ->
    printfn "Mean: %A" fn.Mean
    printfn "Covariance Matrix: %A" fn.CovarianceMatrix
    printfn "Prior: %A" fn.Prior
    printfn "Error Rate: %A" fn.ErrorRate
    printfn "Mahalanobis Distances: %A" fn.MahalanobisDistances
    printfn "Separability: %A" fn.Separability)

Model Properties and Diagnostics

The QDA model provides several diagnostic properties to assess the quality of classification:

The GroupCovariances property returns the estimated covariance matrices for each group.
The GroupMeans property returns the centroids of each group in feature space.
The MahalanobisDistances property returns a matrix of distances between group centroids.
The GroupSeparability and TotalSeparability properties quantify how well groups can be distinguished.
The ErrorRates and GroupErrorRates properties provide theoretical estimates of misclassification rates.

These properties can be used to assess model fit and identify potential classification challenges:

// Examine group separability
Console.WriteLine($"Total separability: {qda1.TotalSeparability}");
Console.WriteLine($"Pairwise separability:\n{qda1.GroupSeparability}");

// Check error rates
Console.WriteLine($"Group error rates: {qda1.GroupErrorRates}");

Visual Basic

' Examine group separability
Console.WriteLine("Total separability: {0}", qda1.TotalSeparability)
Console.WriteLine("Pairwise separability:" & vbCrLf & "{0}", qda1.GroupSeparability)

' Check error rates
Console.WriteLine("Group error rates: {0}", qda1.GroupErrorRates)

Visual Basic

No code example is currently available or this language may not be supported.

// Examine group separability
printfn "Total separability: %f" qda1.TotalSeparability
printfn "Pairwise separability:\n%A" qda1.GroupSeparability

// Check error rates
printfn "Group error rates: %A" qda1.GroupErrorRates

Classification

When using a quadratic discriminant analysis for classification, the probability that an observation belongs to each class is computed. The class with the highest probability is selected.

var index = qda1.Predict(Vector.Create(1.2, 3.0));
var input = Matrix.CreateRandom(10, 2);
var predictions = qda1.Predict(input);

Visual Basic

Dim Index = qda1.Predict(Vector.Create(1.2, 3.0))
Dim inputs = Matrix.CreateRandom(10, 2)
Dim predictions = qda1.Predict(inputs)

Visual Basic

No code example is currently available or this language may not be supported.

let index = qda1.Predict(Vector.Create(1.2, 3.0))
let input = Matrix.CreateRandom(10, 2)
let predictions = qda1.Predict(input)

Discriminant Analysis

Linear Discriminant Analysis

Constructing Linear Discriminant Analysis Models

Fitting the Model

Discriminant Directions

Discriminant Functions

Classification

Dimensionality Reduction

Quadratic Discriminant Analysis

Constructing Quadratic Discriminant Analysis Models

Fitting the Model

Discriminant Functions

Model Properties and Diagnostics

Classification

See Also

Other Resources