F Distribution

The F distribution, also known as the variance ratio distribution or the Fisher-Snedecor distribution, is used to model the ratio of two variances. It is primarily used in ANOVA to determine the significance of the variation due to one or more effects compared to the total variation in the sample.

Definition

The F distribution has two parameters: the degrees of freedom of the numerator n and the degrees of freedom of the denominator m. These parameters act as shape parameters.

The probability density function (PDF) of the F distribution is:

f(x)=nn/2mm/2B(n2,m2)xn/21(m+nx)(n+m)/2

where n is the degrees of freedom of the numerator, and m is the degrees of freedom of the denominator.

The cumulative distribution function (CDF) is:

F(x)=Inxnx+m(n2,m2)

where Ix(a,b) is the regularized incomplete beta function.

The F distribution is defined for any non-negative real value (x0), with the parameters n and m being strictly positive integers.

Applications

The F distribution plays a crucial role in statistical inference and hypothesis testing across multiple fields.

  • In Analysis of Variance (ANOVA), the F distribution is used to test whether there are significant differences between the means of multiple groups by comparing the variation between groups to the variation within groups.

  • In regression analysis, the F-test helps determine whether a proposed linear model provides a better fit to the data than a model with no predictors.

  • The F-test is employed in testing the equality of variances between two populations, which is particularly important in determining whether two samples come from populations with the same variability.

  • In multivariate statistics, the F distribution is used in MANOVA (Multivariate Analysis of Variance) to assess the significance of differences between groups across multiple dependent variables simultaneously.

  • Quality control applications use the F distribution to monitor process variability and test whether manufacturing processes maintain consistent variance over time.

Properties

Statistical Properties
PropertyValue
Meanmm2,m>2
Variance2m2(n+m2)n(m2)2(m4),m>4
Skewness(2m+n2)8(m4)(m6)n(m+n2),m>6
Excess Kurtosis12(m2(5n+22)+n(n2))n(m6)(m8)(n+m2),m>8

The F distribution has several notable properties:

  • If X follows an F distribution with parameters n,m, then 1X follows an F distribution with parameters m,n

  • The F distribution is related to the ratio of chi-square distributions: if XFn,m, then X=dU1/nU2/m where U1χn2 and U2χm2

Relationships to Other Distributions

  • The square of a Student's t distribution with n degrees of freedom is equivalent to an F distribution with parameters 1 and n

  • The F distribution is a ratio of independent chi-square distributions divided by their respective degrees of freedom

The FDistribution class

The F distribution is implemented by the FDistribution class. It has one constructor which has two parameters. The following constructs an F distribution with 4 degrees of freedom for the numerator, and 25 degrees of freedom for the denominator:

C#
var f = new FDistribution(4, 25);

The FDistribution class has two specific properties, DenominatorDegreesOfFreedom and NumeratorDegreesOfFreedom, which return the parameters of the distribution.

FDistribution has one static (Shared in Visual Basic) method, Sample, which generates a random sample using a user-supplied uniform random number generator.

C#
var random = new Pcg32();
double sample = FDistribution.Sample(random, 4, 25);

The above example uses the Pcg32 class to generate uniform random numbers.

References

For more information on the F distribution, refer to the following sources:

  • Johnson, N. L., Kotz, S., & Balakrishnan, N. (1994). "Continuous Univariate Distributions, Volume 2", Chapter 27. Wiley Series in Probability and Statistics.

  • Forbes, C., Evans, M., Hastings, N., & Peacock, B. (2011). "Statistical Distributions", Chapter 20. Wiley Series in Probability and Statistics.

  • Fisher, R.A. (1925). "Statistical Methods for Research Workers".

  • Snedecor, G.W., & Cochran, W.G. (1989). "Statistical Methods".

See Also