What is a Bernoulli distribution ?

A Bernoulli distribution models binary outcomes, producing either 0 or 1 based on a specified success probability. It’s used for simple yes/no scenarios.

How does the binomial distribution work ?

The binomial distribution counts the number of successes in a series of independent trials, useful for predicting outcomes like the number of successful investments.

What are common applications of the Poisson distribution ?

The Poisson distribution is used to model the number of events occurring in a fixed period, such as customer arrivals or equipment failures.

When is the normal distribution used ?

The normal distribution is widely used for modeling data that clusters around a mean, such as financial returns, due to its bell-shaped curve.

What is the significance of the uniform distribution ?

The uniform distribution assumes that all outcomes within a specified range are equally likely, providing a basis for generating random numbers.

How does the lognormal distribution differ from the normal distribution ?

The lognormal distribution is used when the data is skewed and cannot be negative, often applied in modeling stock prices and financial data.

What is the purpose of the Chi-square distribution ?

The Chi-square distribution is often used in statistical tests, particularly for testing the goodness of fit and analyzing variance.

What are the characteristics of the Student’s t-distribution ?

The Student’s t-distribution has thicker tails than the normal distribution, making it useful for small sample sizes and data with outliers.

How can the normal distribution approximate a binomial distribution ?

The normal distribution can approximate a binomial distribution when dealing with a large number of trials, simplifying calculations and predictions.

What is the significance of the probability mass function (PMF) ?

The PMF provides the probabilities of different outcomes in a discrete distribution, offering insights into the likelihood of various events.

QTA 3. Common Univariate Random Variables

Introduction

There are three common discrete distributions:
- The Bernoulli is a general purpose distribution that is typically used to model binary events.
- The binomial distribution describes the sum of 𝑛 independent Bernoulli random variables.
- The Poisson distribution is commonly used to model hazard rates, which count the number of events that occur in a fixed unit of time (e.g., the number of corporations defaulting in the next quarter).
There is a wider variety of continuous distributions used by risk managers.
The most basic is a uniform distribution, which serves as a foundation for all random variables.
The most widely used distribution is the normal, which is used for tasks such as modeling financial returns and implementing statistical tests.
Many other frequently used distributions are closely related to the normal. These include
- Student’s 𝑡,
- Chi-square $(\chi^{2})$ , and
- F,
Mixture distributions are built using two or more distinct component distributions. Mixtures can be used to build distributions that match important features of financial data. For example, mixing two normal random variables with different variances produces a random variable that has a larger kurtosis than either of the mixture components.

Bernoulli Distribution

The Bernoulli is a discrete distribution for random variables that produces one of two values: 0 or 1. It applies to any problem with a binary outcome (e.g., bull and bear markets, corporate defaults, or the classification of fraudulent transactions).
The Bernoulli distribution depends on a single parameter, p, which is the probability that a success is observed.
Suppose that Y is a Bernoulli random variable with parameter p. This can be expressed as $Y\sim\text{Bernoulli}(p)$
The mean of 𝑌 is $E[Y]=p\times 1+(1-p)\times 0=p,$
The Variance of 𝑌 is
$V[Y]=E[Y^2]-E[Y]^2=\left(p\times 1^2+(1-p)\times 0^2\right)-p^2=p(1-p)=pq$

where

q=1-p is the failure probability.
The probability mass function (PMF) of a Bernoulli(p) is $f_Y(y)=p^y(1-p)^{(1-y)}$ This function only produces two values:
p, when y=1 and
1-p, when y=0.
The CDF of a Bernoulli(p) is a step function with three values: $F_Y(y)=\begin{cases}0&y<0\\1-p&0\leq y<1\\1&y\geq 1\end{cases}$

Binomial Distribution

A binomial random variable measures the total number of successes from 𝑛 independent Bernoulli random variables, where each has a probability of success equal to 𝑝. Hence, binomial distributions are used to model counts of independent events
A binomial distribution has two parameters:
1. n, the number of independent experiments; and
2. p, the probability that each experiment is successful.
If n variables $X_i\sim\text{Bernoulli}(p)$ are independent, then a binomial with parameters n and p is defined as $Y=\sum_{i=1}^{n}X_i$
and expressed as

Y=B(n, p)
The mean of a binomially distributed random variable is given by
E[Y]=np

The variance of 𝑌 is

V[Y]=np(1-p)
The skewness of a binomial depends on 𝑝, with small values producing right-skewed distributions.
The normal distribution provides a convenient approximation to the binomial if both $np\geq 10$
and

$n(1-p)\geq 10$
The PMF of a B(n, p) is $f_Y(y)=\binom{n}{y}p^y(1-p)^{n-y}$
where

$\binom{n}{y}={}^nC_y=\frac{n!}{y!(n-y)!}$
The CDF is the sum of the cumulated PMF between 0 and y $F_Y(y)=\sum_{i=0}^{\lfloor y\rfloor}\binom{n}{i}p^i(1-p)^{n-i}$
The probability that each stock will go up during an expansion is 0.80. An investor has 9 such stocks and the event of going up for any particular stock is independent of the other stocks going up. If X denote the number stocks that go up, what is the mean, variance and standard deviation of X? What is the probability that zero, five, seven or all stocks will go up?

Poisson Distribution

Poisson random variables are used to measure counts of events over fixed time spans. For example, one application of a Poisson is to model the number of loan defaults that occur each month.
Poisson random variables are always non-negative and integer-valued. The Poisson distribution has a single parameter, which is called the hazard rate and expressed as λ, that signifies the average number of events per interval.
The mean and variance of $Y\sim\text{Poisson}(\lambda)$ are simply: $E[Y]=V[Y]=\lambda$
If the rate at which events occur over time is constant, and the probability of any one event occurring is independent of all other events, then we say that the events follow a Poisson process, where: $P[Y=n]=\frac{(\lambda t)^n}{n!}e^{-\lambda t}$
where t is the time elapsed.
A useful feature of the Poisson (one that is uncommon among distributions) is that it is infinitely divisible.
$X_1 \sim \text{Poisson}(\lambda_1), \quad X_2 \sim \text{Poisson}(\lambda_2)$
$Y = X_1 + X_2, \quad \text{then} \quad Y \sim \text{Poisson}(\lambda_1 + \lambda_2)$ This feature makes the Poisson well-suited to work with time series data, because summing the number of events in a sampling interval (e.g., a week, month, or quarter) does not change the distribution.
The PMF of a Poisson random variable is $f_Y(y)=\frac{\lambda^y e^{-\lambda}}{y!}$
The CDF of a Poisson is defined as the sum of the PMF for values less than the input $F_Y(y)=e^{-\lambda}\sum_{i=0}^{\lfloor y\rfloor}\frac{\lambda^i}{i!}$
Poisson random variables are used to measure counts of events over fixed time spans. For example, one application of a Poisson is to model the number of loan defaults that occur each month.
Poisson random variables are always non-negative and integer-valued. The Poisson distribution has a single parameter, which is called the hazard rate and expressed as λ, that signifies the average number of events per interval.The mean and variance of $Y\sim\text{Poisson}(\lambda)$ are simply:
$E[Y]=V[Y]=\lambda$
If the rate at which events occur over time is constant, and the probability of any one event occurring is independent of all other events, then we say that the events follow a Poisson process, where: $P[Y=n]=\frac{(\lambda t)^n}{n!}e^{-\lambda t}$
where t is the time elapsed.
A useful feature of the Poisson (one that is uncommon among distributions) is that it is infinitely divisible.
$X_1 \sim \text{Poisson}(\lambda_1), \quad X_2 \sim \text{Poisson}(\lambda_2)$
are independent, and $Y=X_1+X_2,\text{then}Y\sim\text{Poisson}(\lambda_1+\lambda_2).$

This feature makes the Poisson well-suited to work with timeseries data, because summing the number of events in a sampling interval (e.g., a week, month, or quarter) does not change the distribution.

Poisson Distribution – Example

If the defaults in a portfolio of bonds follow a Poisson distribution and the expected number of defaults each month is five, then find the probability that there are exactly four defaults during the next month? Also find the probability that there are exactly five defaults over the next two months?
$P[Y = n] = \frac{(\lambda t)^n}{n!} e^{-\lambda t}$

Normal Distribution

The normal distribution is the most commonly used distribution in risk management. It is also commonly referred to as a Gaussian distribution (after Carl Friedrich Gauss) or a bell curve (which reflects the shape of the PDF). The normal distribution is popular for many reasons.
- Many continuous random variables are approximately normally distributed.
- The distribution of many discrete random variables can be well approximated by a normal.
- The normal distribution plays a key role in the Central Limit Theorem (CLT), which is widely used in hypothesis testing (i.e., the process where data is used to determine the truthfulness of an objective statement).
- Normal random variables are infinitely divisible, which makes them suitable for simulating asset prices in models that assume that prices are continuously evolving.
- The normal is closely related to many other important distributions, including the Students’s t, the $\chi^{2}$ and the F
- The normal is closed (i.e., weighted sums of normal random variables are normally distributed) under linear operations.
- Estimators derived under the assumption that the underlying observations are normally distributed often have simple closed forms.
It is completely described by its mean, $\mu$ , and variance, $\sigma^2$ , stated as $Y\sim{N}(\mu,\sigma^2)$
Skewness=0, thus $P[X\leq\mu]=P[\mu\leq X]=0.5$
Kurtosis=3. This kurtosis is often used as a benchmark when assessing whether another distribution is heavy/fat-tailed.
The normal can generate any value in (-∞, ∞), although it is unlikely to observe values more than 3𝜎 away from the mean. In fact, values more than 3𝜎 away from the mean are expected in only one in 370 realizations of a normal random variable.
The PDF of a normal distribution is $f_Y(y)=\frac{1}{\sqrt{2\pi\sigma^2}}\exp\left(-\frac{(y-\mu)^2}{2\sigma^2}\right)$
The CDF of Y is
the sums of independent normally distributed random variables are also normally distributed. If $X_1 \sim N(\mu_1, \sigma_1^2) \text{ and } X_2 \sim N(\mu_2, \sigma_2^2) \text{ are independent }$ , and $Y=X_1+X_2,\text{ then }Y\sim N(\mu_1+\mu_2, \sigma_1^2+\sigma_2^2)$
This property simplifies describing log returns at different frequencies; if daily log returns are independent and normal, then weekly and monthly returns are as well.
Tails get thinner and thinner and extend up to infinity.Mean=Median=Mode

Normal Distribution – Confidence Intervals

- 68% confidence interval is μ-σ to μ+σ
- 90% confidence interval is μ-1.645σ to μ+1.645σ
- 95% confidence interval is μ-1.96σ to μ+1.96σ
- 99% confidence interval is μ-2.58σ to μ+2.58σ

Example –

The return on a stock is 12% per annum and the standard deviation of annual returns is 15%. Assuming that the returns follow a normal distribution, find the 90% confidence interval for the stock return over this year.

Standard Normal Distribution

A standard normal distribution (or a z-distribution) is a normal distribution which has been standardized to have a mean of zero and a standard deviation of one. It is stated as X~N(0,1 )
It is also common to use ϕ(z) to denote the standard normal PDF and Φ(z) to denote the standard normal CDF.PDF is
$\phi=\frac{1}{\sqrt{2\pi}}e^{-\frac{1}{2}x^2}$
The z-value is the number of standard deviations a given observation is away from the mean. Standardization is converting an observed value x to its z value by using the formula: $z=\frac{x-\mu}{\sigma}$

Standard Normal Distribution – Example

Consider out last example where the annual return on a stock is normally distributed with a mean of 12% per annum and a standard deviation of 15%. Find the z-values corresponding to the returns -3% and 19.5%

Standard Normal Distribution And Probabilities

A standard normal table, or z-table, is a mathematical table for the values of Φ, which are values of the cumulative distribution function of the normal distribution, denoted by $\Phi(Z)=P[Z\leq z]$ . By symmetry of the normal distribution $\Phi(-Z)=1-\Phi(Z)$
The values in the z-table are the probabilities of observing a z-value that is less than a given value (or P[Z<z]). It can also be used to find the probability that a z-value is observed above ( by using $P[Z\geq z]=1-P[Z<z]$ ), or between values on the standard normal distribution, and by extension, any normal distribution.

Standard Normal Table – Example 1

Consider out last example where the annual return on a stock is normally distributed with a mean of 12% per annum and a standard deviation of 15%. Find the probability that the return is less than or equal to 18%.

Standard Normal Table – Example 2

Standard Normal Table – Example 3

Approximating Discrete Random Variables

A normal distribution can approximate a binomial random variable if both
np ≥ 10 and n(1 – p) ≥ 10
When these conditions are satisfied, a binomial has either many independent experiments or a probability that is not extreme (or both), and so the PMF is nearly symmetric and well approximated by a
N(np, np(1-p))
The Poisson can also be approximated by a normal random variable. When λ is large, then a Poisson(λ) can be well approximated by a Normal(λ, λ). This approximation is commonly applied when λ≥1000.

Lognormal Distribution

A variable Y is said to be log-normally distributed if the natural logarithm of Y is normally distributed. In other words, if X=ln⁡ Y, then is log-normally distributed if and only if X is normally distributed. Alternatively, a log-normal can be defined $Y=e^X$ where $X\sim{N}(\mu,\sigma)$
An important property of the log-normal distribution is that Y can never be negative, whereas X can be negative because it is normally distributed. This log-normal property can be desirable when constructing certain models. For example, if stock prices are assumed to be normally distributed, there is a positive (although perhaps tiny) probability that the stock price becomes negative. This is impossible under a log-normal model.
A log-normal distribution can be denoted as $Y\sim\text{LogN}(\mu,\sigma^2),\text{or equivalently as}\ln(Y)\sim{N}(\mu,\sigma^2)$ where ln⁡(Y) is normally distributed with mean μ and variance $\sigma^2$ .
Here, 𝑌 is a non-linear transformation of a normal random variable
The mean of Y is $E[Y]=e^{\mu+\frac{1}{2}\sigma^2}$
The variance of Y is $V[Y]=\left(e^{\sigma^2}-1\right)e^{2\mu+\sigma^2}$
Lognormal distribution is positively skewed.
It peaks at $e^{\mu-\sigma^2}$
The PDF of a log-normal is given by: $f_Y(y)=\frac{1}{y\sqrt{2\pi\sigma^2}}e^{-\frac{(\ln y-\mu)^2}{2\sigma^2}}$
The CDF of Y is $F_Y(y)=\Phi\left(\frac{n y-\mu}{\sigma}\right)$

Chi Square Distribution

- The $\chi^2$ (chi-squared) distribution is frequently encountered when testing hypotheses about model parameters. It is also used when modeling variables that are always positive, (e.g., the VIX Index).
- A $\chi_{\nu}^{2}$ random variable is defined as the sum of the squares of ν (Greek nu) independent standard normal random variables

$Y=\sum_{i=1}^{\nu}Z_i^2$

Note that a $\chi^2$ distribution has 𝜈 degrees of freedom, a concept that arises when dealing with models that have 𝑘 parameters using 𝑛 data points. Degrees of freedom measure the amount of data that is available to test model parameters, because estimating model parameters requires a minimum number of observations (e.g., 𝑘). In many models, the degree of freedom used in testing is n-k.
The mean of $Y\sim\chi_{\nu}^{2}$ is $E[Y]=\nu$
The variance of Y is $V[Y]=2\nu$
The PDF of a $\chi _{\nu}^{2}$ random variable is

$f(y)=\frac{1}{2^{\frac{\nu}{2}}\Gamma\left(\frac{\nu}{2}\right)}y^{\frac{\nu}{2}-1}e^{-\frac{y}{2}}$

where Γ(x) is known as the Gamma function.
The CDF of Y is

$F_Y(y)=P\left(\frac{\nu}{2},\frac{y}{2}\right)$

where P is the regularized gamma function.

Student’s T Distribution

The Student’s t distribution is closely related to the normal, but it has heavier tails. It was originally developed for testing hypotheses using small samples.
A Student’s t is a one-parameter distribution. This parameter, denoted by $\nu$ , is also called the degrees of freedom parameter. While it affects many aspects of the distribution, the most important effect is on the shape of the tails of the distribution.The Student’s t is the distribution of

$Y=\frac{Z}{\sqrt{W/\nu}}$
where
- Z is a standard normal
- W is a $\chi_{\nu}^{2}$ random variable, and
- Z and W are independent.
Dividing a standard normal by another random variable produces heavier tails than the standard normal. This is true for all values of 𝜈, although a Student’s t converges to a standard normal as ν→∞.
If $Y \sim t_v$ , then the mean is
E[Y]=0

The mean is only finite if ν>1
The variance of Y is $V[Y]=\frac{v}{v-2}$
The variance is only finite if ν>2
The kurtosis of Y is $kurtosis(Y)=3\frac{v-2}{v-4}$
The kurtosis is defined for ν>4 and is always larger than 3
Using the basic result that $V[aY]=a^2V[Y]$
It is easy to see that

$V\left[\sqrt{\frac{v-2}{v}}Y\right]=1$

when $Y\sim t_{\nu}$ .

This distribution is known as a standardized Student’s t, because it has mean 0 and variance 1 for any value of ν.
The above distribution can be rescaled to have any variance and re-centered to have any mean if ν>2. For example, $X=\mu+\sigma S$ is known as a generalized Student’s t and has mean μ, variance $\sigma^2$ , and degrees of freedom parameter ν. It is parameterized with three parameters reflecting the mean, variance, and degrees of freedom, and is denoted as $Gen.t_\nu(\mu,\sigma^2)$
As compared to the normal distribution, the generalized 𝑡 is better suited to model the returns of many assets. It captures the heavy tails in asset returns while retaining the flexibility of a normal random variable to directly set the mean and variance.

F Distribution

The 𝐹 is another distribution that is commonly encountered when testing hypotheses about model parameters. The 𝐹 has two parameters, $\nu_1$ and $\nu_2$
An F distribution is defined as the ratio of two independent $\chi^2$ random variables where each has been divided by its degree of freedom $Y = \frac{\frac{W_1}{\nu_1}}{\frac{W_2}{\nu_2}}$
where $W_1 \sim \chi^2_{\nu_1}$ , and $W_2 \sim \chi^2_{\nu_2}$ , and

$W_1\;and\;W_2$ are independent.
If $Y \sim F_{\nu_1, \nu_2}$ , then the mean of Y is
$E[Y] = \frac{\nu_2}{\nu_2 - 2}$

which is only finite when $\nu_2$ is larger than 2.
The variance of Y is
$V[Y] = \frac{2\nu_2^2(\nu_1 + \nu_2 - 2)}{\nu_1(\nu_2 - 2)^2(\nu_2 - 4)}$

and is only finite for $\nu_2 > 4$ .
When using an F in hypothesis testing, $\nu_1$ ; is usually determined by the hypothesis being tested and is typically small (e.g., 1, 2, 3, …), while $\nu_2$ is related to the sample size (and so is relatively large).

Exponential Distribution

The exponential distribution uses a single parameter, β, that determines both the mean and variance.
If $Y \sim \text{Exponential}(\beta)$ , then mean of Y is
E[Y]=β
The variance of Y is
V[Y]=β
The exponential distribution is closely related to the Poisson distribution.
For example, suppose X is a random variable that measures the number of loan defaults per quarter.
If X is Poisson distributed with parameter β, then the time between each subsequent loan default has as an exponential distribution with parameter β.
Exponential variables are also memoryless, meaning that their distributions are independent
of their histories.
The PDF of an Exponential(β) is $f_Y(y) = \frac{1}{\beta} \exp\left(-\frac{y}{\beta}\right), \, y \geq 0$
The CDF is
$F_Y(y) = 1 - \exp\left(-\frac{y}{\beta}\right)$

Exponential Distribution Example

Assume that the time to default for a specific segment of credit card consumers is exponentially distributed with a β of 5 years. Find the probability that the customer will not default before year 6

Beta Distribution

The beta distribution applies to continuous random variables with outcomes between 0 and 1. It is commonly used to model probabilities that naturally fall into this range. The beta distribution has two parameters, α and β, that jointly determine the mean and variance of a beta-distributed random variable.
If $Y \sim Beta(\alpha, \beta)$ , then mean of Y is $E[Y] = \frac{\alpha}{\alpha + \beta}$
The variance of Y is $V[Y] = \frac{\alpha\beta}{(\alpha + \beta)^2(\alpha + \beta + 1)}$
The PDF of a beta(α, β) is $f_Y(y) = \frac{y^{\alpha-1}(1-y)^{\beta-1}}{B(\alpha, \beta)}$
where $B(\alpha, \beta) = \frac{\Gamma(\alpha)\Gamma(\beta)}{\Gamma(\alpha + \beta)}$ , and Γ(⋅) is the Gamma function.
The CDF is $F_Y(y) = I_y(\alpha, \beta)$
where $I_y(\alpha, \beta)$ is the regularized incomplete beta function.

Mixtures Of Distributions

Mixture distributions build new, complex distributions using two or more component distributions.
A two-component mixture first draws a value from a Bernoulli random variable. Then, depending on the value (0 or 1), draws from one of two component distributions. This structure makes it simple to compute the CDF of the mixture when the components are normal random variables.
Mixing components with different means and variances produces a distribution that is both skewed and heavy-tailed.

Contact us

Common Univariate Random Variables - FRM

Learning Objectives

Chapter Contents

Introduction

Bernoulli Distribution

Binomial Distribution

Poisson Distribution

Poisson Distribution – Example

Normal Distribution

Normal Distribution – Confidence Intervals

Standard Normal Distribution

Standard Normal Distribution – Example

Standard Normal Distribution And Probabilities

Standard Normal Table – Example 1

Standard Normal Table – Example 2

Standard Normal Table – Example 3

Approximating Discrete Random Variables

Lognormal Distribution

Chi Square Distribution

Student’s T Distribution

F Distribution

Exponential Distribution

Exponential Distribution Example

Beta Distribution

Mixtures Of Distributions

Previous Chapter

Next Chapter

Go to Syllabus

Courses Offered

By : Micky Midha

VIEW DETAILS

By : Micky Midha

VIEW DETAILS

By : Micky Midha

VIEW DETAILS

By : Micky Midha

VIEW DETAILS

By : Shubham Swaraj

VIEW DETAILS

FAQs

No comments on this post so far:

Add your Thoughts: