Johnson Distributions

The Johnson family of distributions, first investigated by Norman L. Johnson in 1949, is based on transformations of the normal distribution. It includes four distinct types: normal, lognormal, bounded, and unbounded.

Definition

The Johnson family of distributions is defined by transformations of the normal distribution. Each variant has its own Probability Density Function (PDF) and Cumulative Distribution Function (CDF).

Normal (SN)

The identity transformation, equivalent to the normal distribution:

f(x)=1λ2πexp(12(xξλ)2)F(x)=Φ(xξλ)
Unbounded (SU)

Based on a hyperbolic sine transformation:

f(x)=δλ2π11+((xξ)/λ)2exp(12(γ+δsinh1(xξλ))2)F(x)=Φ(γ+δsinh1(xξλ))
Bounded (SB)

Based on a logistic transformation, bounded between ξ and ξ+λ:

f(x)=δλ2π1(xξ)(1(xξ)/λ)exp(12(γ+δln(xξλ(xξ)))2)F(x)=Φ(γ+δln(xξλ(xξ)))
LogNormal (SL)

Based on an exponential transformation, bounded on the left by ξ:

f(x)=δλ2π1xξexp(12(γ+δln(xξλ))2)F(x)=Φ(γ+δln(xξλ))

All Johnson distributions have a location parameter ξ and a scale parameter λ. The bounded, unbounded, and lognormal variants have two additional shape parameters γ and δ. Here, Φ represents the standard normal CDF.

Applications

Johnson distributions are used in various fields such as finance, hydrology, and environmental science to model data that do not fit well with normal distributions. They are particularly useful for modeling skewed and bounded data.

The SB bounded distribution is commonly used in quality control to model manufacturing processes with natural upper and lower limits. The SU unbounded distribution finds applications in financial modeling where data can be highly skewed but theoretically unbounded.

The SL lognormal variant is frequently applied in environmental monitoring to model pollutant concentrations, which cannot be negative but can have large positive values. In reliability engineering, Johnson distributions are used to model failure times and material strength distributions.

Properties

The statistical properties depend on the specific Johnson distribution type. For the SN type, see the normal distribution.

For the SU (Unbounded) type:

Statistical Properties (SU)
PropertyValue
Meanξλsinh(γ/δ)exp(12δ2)
Varianceλ22[exp(1/δ2)(cosh(2γ/δ)+2)1]
Modeξ+λsinh(γ/δ)
Medianξ+λsinh(γ/δ)

For the SL (LogNormal) type:

Statistical Properties (SL)
PropertyValue
Meanξ+λexp(γ/δ+12δ2)
Varianceλ2exp(2γ/δ+1/δ2)[exp(1/δ2)1]
Modeξ+λexp(γ/δ1/δ2)
Medianξ+λexp(γ/δ)

For the SB (Bounded) type:

Statistical Properties (SB)
PropertyValue
Meanξ+λ/(1+exp((γ/δ+1/2δ2)))
Varianceλ2p(1p)[1+(12p)(γ/δ)] where p=1/(1+exp(γ/δ))
Modeξ+λ/(1+exp(γ/δ))
Medianξ+λ/(1+exp(γ/δ))

Notable properties include:

  • All Johnson distributions are transformations of the standard normal distribution.

  • The SB type is bounded on both sides by ξ and ξ+λ.

  • The SL type is bounded on the left by ξ.

Relationships to Other Distributions

The JohnsonDistribution class

The Johnson family of distributions is implemented by the JohnsonDistribution class. It has two constructors. The first constructor takes five arguments. These are the location and scale parameters, the two shape parameters, and a JohnsonDistributionType value that specifies which variant is being defined.

The following constructs an SU distribution with location parameter 6.8, scale parameter 4.1, and shape parameters 1.3 and 3.2:

C#
var johnsonu = new JohnsonDistribution(6.8, 4.1, 1.3, 3.2, 
    JohnsonDistributionType.Unbounded);

The second constructor takes a vector argument and estimates the distribution from the provided sample:

C#
var sample = Vector.Create(1.2, 3.2, 4.1, 6.1, 2.3, 1.0, 2.1);
var johnson = new JohnsonDistribution(sample);

Note that parameter estimation says nothing about how well the estimated distribution fits the variable's distribution. Use one of the goodness-of-fit tests to verify the appropriateness of the choice of distribution.

The JohnsonDistribution class has five specific properties. LocationParameter and ScaleParameter return the location and scale parameters of the distribution. Gamma and Delta return the shape parameters, and Type returns the type of Johnson distribution.

References

  • Johnson, N. L. (1949). Systems of frequency curves generated by methods of translation. Biometrika, 36(1/2), 149-176.

  • Hill, I. D., Hill, R., & Holder, R. L. (1976). Fitting Johnson curves by moments. Applied Statistics, 180-189.

See Also