Hypergeometric Distribution
The hypergeometric distribution models the number of successes in a fixed number of draws from a finite population without replacement. It is also known as the discrete hypergeometric distribution. This distribution is used when sampling without replacement from a population consisting of two types of objects.
Definition
The hypergeometric distribution has three parameters: the number of tagged objects
The cumulative distribution function (CDF) is:
The domain of the hypergeometric distribution is
Applications
The hypergeometric distribution is widely used in various fields due to its applicability to sampling without replacement. Common applications include:
In quality control, the number of defective products out of a sample of fixed size follows a hypergeometric distribution.
In biology, the size of an animal population can be estimated by first tagging a known number of animals, and a short time later recapturing a fixed number of animals. The number of tagged animals in the recaptured samples follows a hypergeometric distribution.
Properties
The hypergeometric distribution has several important statistical properties:
Property | Value |
---|---|
Mean | |
Variance | |
Skewness | |
Kurtosis | |
Median | |
Mode | |
Support | |
Entropy | Not available |
Relationships to Other Distributions
The hypergeometric distribution is closely related to several other distributions:
The Binomial Distribution models the number of successes in a fixed number of Bernoulli trials with replacement.
The Negative Binomial Distribution models the number of failures before the
th success in a series of Bernoulli trials.
The HypergeometricDistribution class
The hypergeometric distribution is implemented by the HypergeometricDistribution class. It has one constructor that takes three arguments. The first argument is an integer that specifies the number of items in the untagged population. The second argument specifies the number of items in the tagged population. The third parameter specifies the number of samples drawn from the total population. The following constructs a hypergeometric distribution for 20 tagged items, 10 untagged items, and 15 samples:
var hyper = new HypergeometricDistribution(20, 10, 15);
The HypergeometricDistribution class has three specific properties: TaggedPopulation returns the number of items in the tagged population. UntaggedPopulation returns the number of items in the untagged population. NumberOfSamples returns the number of samples drawn from the entire population.
HypergeometricDistribution has one static (Shared in Visual Basic) method, Sample, which generates a random sample using a user-supplied uniform random number generator. The first argument is the random number generator. The remaining 3 parameters correspond to the parameters of the constructor.
var random = new Pcg32();
int sample = HypergeometricDistribution.Sample(random, 20, 10, 15);
The above example uses the Pcg32 class to generate uniform random numbers.
For details of the properties and methods common to all discrete probability distribution classes, see the topic on Discrete Probability Distributions.
References
"Introduction to Probability Models" by Sheldon M. Ross.
"Probability and Statistics" by Morris H. DeGroot and Mark J. Schervish.