Random variables

STAT1003 – Statistical Techniques

Dr. Emi Tanaka

Australian National University

These slides are best viewed on a modern browser like Google Chrome on a desktop or laptop. Some interactive components may require some time to fully load.

Random variables

A random variable is a variable whose possible values are numerical outcomes of a random phenomenon.

There are two main types of random variables:

  • Discrete random variables: possible values are a countable number of distinct values
    • E.g., number of heads in 10 coin tosses.
  • Continuous random variables: any values within a given range
    • E.g., the height of students in a class.

Notation

  • Random variables are usually represented by uppercase letters (e.g., \(X\) or \(Y\)).
  • Specific outcomes (realised values) are represented by corresponding lowercase letters (e.g., \(x\) or \(y\)).
  • For example:
    • Let \(X\) be the number of heads in 3 coin tosses.
    • Then \(x = 0\) represents the outcome of getting 0 heads.

Probability distribution

A probability distribution describes how probabilities are assigned to the possible values of a random variable.

  • For a discrete random variable, the distribution is described by a probability mass function (pmf).
  • For a continuous random variable, the distribution is described by a probability density function (pdf).

By convention we denote:

  • the pmf of a discrete random variable \(X\) as \(P(X = x)\) or \(p_X(x)\), and
  • the pdf of a continuous random variable \(X\) as \(f_X(x)\).
  • Probability distribution summarises characteristics of the population.
  • Barplots for discrete data and histograms and density plots for continuous data are visualisations of estimates of the probability distribution of the underlying random variable.

Parametric distributions

  • Parametric distribution are defined by just a handful of parameters.

Bernoulli distribution


Binomial distribution

Poisson distribution


Negative binomial distribution

Normal distribution

t distribution


Uniform distribution

Gamma distribution