Table of Contents
The hypergeometric distribution is a fundamental concept in probability theory. It describes the likelihood of a specific number of successes in a sequence of draws from a finite population without replacement. This distribution is particularly useful in scenarios where the population size is limited, and sampling occurs without putting items back.
What Is the Hypergeometric Distribution?
The hypergeometric distribution calculates the probability of obtaining exactly k successes in n draws, from a population containing a total of N items, of which K are considered successes. Unlike the binomial distribution, the hypergeometric distribution accounts for the fact that each draw affects the subsequent probabilities because items are not replaced.
Key Components of the Distribution
- N: Total population size
- K: Total number of successes in the population
- n: Number of draws
- k: Number of observed successes in draws
Practical Scenario: Quality Control
Imagine a factory produces 1,000 widgets, with 50 known to be defective. If a quality inspector randomly selects 20 widgets for inspection, what is the probability that exactly 3 of these are defective?
Using the hypergeometric distribution, we set:
- N = 1000
- K = 50
- n = 20
- k = 3
The probability can be calculated with the formula:
P(k) = [(C(K, k) * C(N – K, n – k)) / C(N, n)]
Calculating the Probabilities
Here, C(a, b) represents the combination function, which calculates the number of ways to choose b items from a set of a.
Applying the numbers:
P(3) = [(C(50, 3) * C(950, 17)) / C(1000, 20)]
Why Use the Hypergeometric Distribution?
This distribution is ideal for scenarios like quality control, card games, and sampling without replacement. It provides precise probabilities when the population is small or when the sampling process impacts the results.
Summary
The hypergeometric distribution helps us understand the chances of a specific number of successes in a limited, non-replacing sample. Recognizing its applications allows teachers and students to better grasp real-world problems involving finite populations and sampling without replacement.