Random Variable

Given an experiment with sample space $S$ , a random variable (r.v.) is function from the sample space $S$ to the real numbers $R$ . It is common, but not required, to denote random variables by capital letters.

Thus, a random variable $X$ assigns a number value $X (s)$ to each possible outcome $s$ of the experiment. The mapping itself is deterministic.

For example, consider an experiment where we toss a fair coin twice. The sample space consists of four possible outcomes $S = {HH, H T, T H, TT}$ . Then we can define $X$ be the number of Heads:

X (HH) = 2, X (H T) = X (T H) = 1, X (TT) = 0

Discrete Random Variable

A random variable $X$ is said to be discrete if there is a finite list of values $a_{1}, \dots, a_{n}$ or an infinite list of values $a_{1}, a_{2}, \dots$ such that $P (X = a_{j} for some j) > 0$ . If $X$ is a discrete r.v., then the finite or countable infinite set of values $x$ such that $P (X = x) > 0$ is called the support of $X$

In contrast, a continuous r.v. can take on any real value in an interval (possibly even the entire real line)

Info

It is also possible to have an r.v. that is a hybrid of discrete and continuous, such as by flipping a coin and then generating a discrete r.v. if the coin lands Heads and generating a continuous r.v. if the coin lands on Tails.

Indicator Random Variable

The indicator random variable of an event $A$ is the r.v. which equals $1$ if $A$ occurs and $0$ otherwise. We will denote the indicator r.v. of $A$ by $I_{A}$ or $I (A)$ . See Indicator Random Variables

Probability Mass Function

The Probability Mass Function (PMF) of a discrete r.v. $X$ is the function $p_{X}$ given by $p_{X} (x) = P (X = x)$ . Note that this is positive if $x$ is in the support of $X$ , and $0$ otherwise

Tip

In writing $P (X = x)$ , we are using $X = x$ to denote an event, consisting of all outcomes $s$ to which $X$ assigns to the number $x$

Let $X$ be a discrete r.v. with support $x_{1}, x_{2}, \dots$ , the PMF $p_{X}$ of $X$ must satisfy the following two criteria:

Nonnegative: $p_{X} (x) > 0$ if $x = x_{j}$ for some $j$ , and $p_{X} (x) = 0$ otherwise;
Sums to $1$ : $\sum_{j = 1}^{\infty} p_{X} (x_{j}) = 1$

Cumulative Distribution Function

The cumulative distribution function (CDF) of an r.v. $X$ is the function $F_{X}$ given by $F_{X} (x) = P (X \leq x)$ . When there is no risk of ambiguity, we sometimes drop the subscript and just write $F$ for a CDF.

Any CDF $F$ has the following properties:

Increasing: $F (x_{1}) \leq F (x_{2}) ⟺ x_{1} \leq x_{2}$
Right-continuous: wherever there is a jump, the CDF is continuous from the right. That is, for any $a$ , we have

F (a) = x \to a^{+} lim F (x)

Convergence to $0$ and $1$ in the limits:

x \to - \infty lim F (x) = 0 and x \to \infty lim F (x) = 1

Tip

We often say the distribution function of a discrete r.v. is its PMF, and the distribution function of a continuous r.v. is its CDF

Functions of Random Variables

For an experiment with sample space $S$ , an r.v. $X$ , and a function $g : R \to R$ . $g (X)$ is the r.v. that maps $s$ to $g (X (s))$ for all $s \in S$

Let $X$ be a discrete r.v. and $g : R \to R$ . Then the PMF of $g (X)$ is

P (g (X) = y) = x : g (x) = y \sum P (X = x)

as for the continuous case:

P (g (X) = y) = \int_{{x : g (x) = y}} P (X = x) d x

Example

For a continuous r.v. $X \sim F (x)$ where $F (x)$ is some CDF, then $F (X) \sim U (0, 1)$ , and $U (\cdot, \cdot)$ represents the uniform distribution We can prove this by converting the CDF of $F (X)$ to the CDF of $X$ , that is, $P (F (X) \leq y) = P (X \leq F^{- 1} (y)) = F (F^{- 1} (y)) = y$ Here $F^{- 1} (y)$ exists since $F (x)$ is continuous and strictly increasing (the property of CDF) See the universality of the uniform

Independence of Random Variables

Random variables $X$ and $Y$ are said to be independent if

P (X \leq x, Y \leq y) = P (X \leq x) P (y \leq Y)

for all $x, y \in R$ In the discrete cases, this is equivalent to the condition

P (X = x, y = y) = P (X = x) P (Y = y)

for all $x, y$ with $x$ in the support of $X$ and $y$ in the support of $Y$

Random variables $X_{1}, \dots, X_{n}$ are independent if

P (X_{1} \leq x_{1}, \dots, X_{n} \leq x_{n}) = P (X \leq x_{1}) \dots P (X_{n} \leq x_{n})

for all $x_{1}, \dots, x_{n} \in R$ . For infinitely many r.v.s, we say that they are independent if every finite subset of the r.v.s is independent.

Tip

Note that this criteria is different from that for independence of $n$ events. But in fact, if $X_{1}, \dots, X_{n}$ are independent, then they are also pairwise independent, i.e., $X_{i}$ is independent of $X_{j}$ for $i \neq = j$ . The idea behind proving that $X_{i}$ and $X_{j}$ are independent is to let all the $x_{k}$ other than $x_{i}, x_{j}$ go to $\infty$ in the definition of independence, since we already know $X_{k} < \infty$ is true. But pairwise independence does not imply independence in general.

If $X$ and $Y$ are independent r.v.s, then any function of $X$ is independent of any function of $Y$

Independent and Identically Distributed (i.i.d.)

We often work with random variables that are independent and have the same distribution. We call such r.v. independent and identically distributed, or i.i.d. for short.

If some r.v.s. are i.i.d., they provide no information about each other and have the same PMF and CDF

Lin's Notes Garden

Explorer

Random Variables

Random Variable

Discrete Random Variable

Indicator Random Variable

Probability Mass Function

Cumulative Distribution Function

Functions of Random Variables

Independence of Random Variables

Independent and Identically Distributed (i.i.d.)

Graph View

Table of Contents

Backlinks