Poisson Distribution
A Poisson random variable gives the probability of a given number of events in a fixed interval of time (or space). It makes the Poisson assumption that events occur with a known constant mean rate and independently of the time since the last event.
Poisson Random Variable
Notation: | |
---|---|
Description: | Number of events in a fixed time frame if (a) the events occur with a constant mean rate and (b) they occur independently of time since last event. |
Parameters: | |
Support: | |
PMF equation: | |
Expectation: | |
Variance: | |
PMF graph: |
Poisson Intuition
In this section we show the intuition behind the Poisson derivation. It is both a great way to deeply understand the Poisson, as well as good practice with Binomial distributions.
Let's work on the problem of predicting the chance of a given number of events occurring in a fixed time interval โ the next minute. For example, imagine you are working on a ride sharing application and you care about the probability of how many requests you get from a particular area. From historical data, you know that the average requests per minute is
: We could approximate a solution to this problem by using a binomial distribution! Lets say we split our minute into 60 seconds, and make each second an indicator Bernoulli variable โ you either get a request or you don't. If you get a request in a second, the indicator is 1. Otherwise it is 0. Here is a visualization of our 60 binary-indicators. In this example imagine we have requests at 2.75 and 7.12 seconds. the corresponding indicator variables are blue filled in boxes:
The total number of requests received over the minute can be approximated as the sum of the sixty indicator variables, which conveniently matches the description of a binomial โ a sum of Bernoullis. Specifically define
Now
Choose any value of
The larger
Poisson, a Binomial in the limit
Or if we really cared about making sure that we don't get two events in the same bucket, we can divide our minute into infinitely small buckets:
Proof: Derivation of the Poisson
What does the PMF of
While it may look intimidating, this expression simplifies nicely. This proof uses a few special limit rules that we haven't introduced in this book:
That is a beautiful expression! Now we can calculate the real probability of number of requests in a minute, if the historical average is
This is both more accurate and much easier to compute!
Changing time frames
Say you are given a rate over one unit of time, but you want to know the rate in another unit of time. For example, you may be given the rate of hits to a website per minute, but you want to know the probability over a 20 minute period. You would just need to multiply this rate by 20 in order to go from the "per 1 minute of time" rate to obtain the "per 20 minutes of time" rate.