Preliminaries

Hypothesis

A statistical hypothesis is an assertion about a population parameter that can be tested. There are two main types of hypotheses: - Null Hypothesis ( $H_{0}$ ): The default assumption that there is no effect or no difference. For example, it might state that a new drug has no effect on patients. - Alternative Hypothesis ( $H_{1}$ ): This represents the opposite of the null hypothesis, suggesting that there is an effect or a difference

Two Types of Error

Fact \ Decision	Accept $H_{0}$	Reject $H_{0}$
$H_{0}$ is true	Good	Type I Error
$H_{1}$ is true	Type II Error	Good

Significance Level

Significance level, denoted $α$ , is the threshold for determining whether to reject the null hypothesis (i.e., the probability for Type I Error to occur). A common choice for $α$ is 0.05, meaning there is a $α$ % risk of concluding that a difference exists when there is none.

Rejection Region

The rejection region (or critical region) is defined as the area on the distribution of the test statistic where, if the calculated statistic falls within this area, you would reject $H_{0}$ in favor of the alternative hypothesis ( $H_{1}$ ).

The critical values are the boundaries that separate the critical region from the acceptance region:

The chosen significance level ( $α$ )
Whether the test is one-tailed or two-tailed.

We say that a rejection region has a size $α$ , if it ensures a significance level $α$ , i.e. the probability of making a Type I error should be lower than $α$ . In the plot of PDF plot, this $α$ is the total area of the critical region, as shown in the figure below

Definition

To generalize the hypothesis testing problem. Let $Θ_{0}$ and $Θ_{1}$ be the parameter spaces of $θ$ when $H_{0}$ and $H_{1}$ are true, respectively. The hypothesis testing problem is: $H_{0} : θ \in Θ_{0} ⟷ H_{1} : θ \in Θ_{1} .$ Let $X_{1}, X_{2}, \dots, X_{n}$ be a simple random sample, $T$ be the test statistic, $α$ be the significance level, and $W$ be the rejection region. Define the test $ϕ$ as:

ϕ = {1, 0, (X_{1}, X_{2}, \dots, X_{n}) \in W . (X_{1}, X_{2}, \dots, X_{n}) \in / W .

Test with Level $α$

P_{H_{0}} (ϕ = 1) = P_{H_{0}} ({reject H_{0}}) \leq α, \forall θ \in Θ_{0}

Then we call $ϕ$ a test with level $α$

Power of a Test

The power of a hypothesis test is a critical measure that reflects its ability to correctly reject a false null hypothesis ( $H_{0}$ ). That is $β_{ϕ} (θ) = P (ϕ = 1)$ , or

β (θ) = P (θ \in W), where W is the rejection region

When $H_{0}$ is true, $β_{ϕ} (θ)$ is the probability for type I error to occur; When $H_{1}$ is true, $1 - β_{ϕ} (θ)$ is the probability for type II error to occur. Therefore, if $H_{0}$ is true, then we would have $β_{ϕ} (θ) \leq α$ . And when $H_{1}$ is true, we hope $β_{ϕ} (θ)$ to be as big as possible. As a result, when $H_{1}$ is true, we call $β_{ϕ} (θ)$ is the power of test $ϕ$ at $θ$ .

Uniformly Most Powerful Test (UMPT)

A Uniformly Most Powerful (UMP) test is defined as a hypothesis test that maximizes the probability of correctly rejecting the null hypothesis across all possible alternative hypotheses, while maintaining a fixed significance level $α$ . This means that for any alternative hypothesis, the UMP test has the greatest statistical power $1 - β$

$p$ -value

For a test $ϕ$ at level $α$ , $T$ is the test statistic. After obtaining the sample, let the observed value of $T$ be $t$ . If the rejection region of the hypothesis is of the form $W = {T > c}$ , then the p-value is defined as

p -value = P_{H_{0}} (T > t) .

Additionally, for a rejection region of the form $W = {T < c}$ , the p-value is $P_{H_{0}} (T < t)$ ; for a rejection region of the form $W = {∣ T ∣ > c}$ , the p-value is $P_{H_{0}} (∣ T ∣ > ∣ t ∣)$ . According to the shape of the rejection region, the p-value is the sum of the probabilities of those data that are more extreme (with a smaller probability of occurrence) than the observed data under the assumption that the null hypothesis is true ().

Warning

if $T$ is a discrete random variable, the sum must include the probability of the observed value. That is, use $T \geq t$ instead of $T > t$

In other words, the p-value is the minimum significance level required to reject the null hypothesis based on the observed sample. That is

If $p > α$ , we accept $H_{0}$
If $p \leq α$ , we reject $H_{0}$

Note

If we can define a clear threshold for the hypothesis to be true, use the rejection region approach; if we can't, use the p-value approach

Common Testing Problems

Simple Tests: $H_{0} : θ = θ_{0} ⟷ H_{1} : θ = θ_{1}$
Two-tailed Tests: $H_{0} : θ = θ_{0} ⟷ H_{1} : θ \neq = θ_{0}$
One-tailed Tests:
- $H_{0} : θ = θ_{0} ⟷ H_{1} : θ > θ_{0}$ or $H_{0} : θ \leq θ_{0} ⟷ H_{1} : θ > θ_{0}$
- $H_{0} : θ = θ_{0} ⟷ H_{1} : θ < θ_{0}$ or $H_{0} : θ \geq θ_{0} ⟷ H_{1} : θ < θ_{0}$

General Steps

Below are the general steps for hypothesis testing, divided into four steps:

Set the significance level $α$ . Obtain a Point Estimation $\hat{θ} = \hat{θ} (X_{1}, X_{2}, \dots, X_{n})$ , which is usually the maximum likelihood estimate;
Based on $\hat{θ}$ , construct the test statistic $T = T (X_{1}, X_{2}, \dots, X_{n})$ such that when $θ = θ_{0}$ , the distribution of $T$ is known, such as $N (0, 1)$ , $χ_{n}^{2}$ , $t_{n}$ , $F_{m, n}$ , etc., and it is independent of $θ$ ;
Based on $T$ , determine the shape of the rejection region according to the practical meaning of the alternative hypothesis $H_{1}$ . It is an inequality or two inequalities about $T$ , containing one or two critical values;
Based on the significance level $α$ , calculate the rejection region of the test, i.e., determine the critical values of the inequality in step 3. Calculate the value of the test statistic $T$ based on the sample, and then determine whether the sample falls into the rejection region. If it does, reject $H_{0}$ ; otherwise, accept $H_{0}$ ; Alternatively, calculate the p-value of the test. If the p-value is less than $α$ , reject $H_{0}$ ; otherwise, accept $H_{0}$ .

Tables

Test Mean with Known Variance

Test	Statistics	Distribution	Rejection Region
$H_{0} : μ = μ_{0} \leftrightarrow H_{1} : μ \neq = μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{σ}$	$N (0, 1)$	${abs (T) > u_{1 - α /2}}$
$H_{0} : μ = μ_{0} \leftrightarrow H_{1} : μ > μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{σ}$	$N (0, 1)$	${T > u_{1 - α}}$
$H_{0} : μ = μ_{0} \leftrightarrow H_{1} : μ < μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{σ}$	$N (0, 1)$	${T < u_{α}}$
$H_{0} : μ \leq μ_{0} \leftrightarrow H_{1} : μ > μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{σ}$	$N (0, 1)$	${T > u_{1 - α}}$
$H_{0} : μ \geq μ_{0} \leftrightarrow H_{1} : μ < μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{σ}$	$N (0, 1)$	${T < u_{α}}$

Test Mean with Unknown Variance

Test	Statistics	Distribution	Rejection Region
$H_{0} : μ = μ_{0} \leftrightarrow H_{1} : μ \neq = μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{S}$	$t_{n - 1}$	${abs (T) > t_{n - 1} (1 - α /2)}$
$H_{0} : μ = μ_{0} \leftrightarrow H_{1} : μ > μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{S}$	$t_{n - 1}$	${T > t_{n - 1} (1 - α)}$
$H_{0} : μ = μ_{0} \leftrightarrow H_{1} : μ < μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{S}$	$t_{n - 1}$	${T < t_{n - 1} (α)}$
$H_{0} : μ \leq μ_{0} \leftrightarrow H_{1} : μ > μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{S}$	$t_{n - 1}$	${T > t_{n - 1} (1 - α)}$
$H_{0} : μ \geq μ_{0} \leftrightarrow H_{1} : μ < μ_{0}$	$T = \frac{n ( X ˉ - μ _{0} )}{S}$	$t_{n - 1}$	${T < t_{n - 1} (α)}$

Test Variance with Known Mean

Test	Statistics	Distribution	Rejection Region
$H_{0} : σ^{2} = σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} \neq = σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - μ)^{2}$	$χ_{n}^{2}$	${T > χ_{n}^{2} (1 - \frac{α}{2})} \cup {T < χ_{n}^{2} (\frac{α}{2})}$
$H_{0} : σ^{2} = σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} > σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - μ)^{2}$	$χ_{n}^{2}$	${T > χ_{n}^{2} (1 - α)}$
$H_{0} : σ^{2} = σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} < σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - μ)^{2}$	$χ_{n}^{2}$	${T < χ_{n}^{2} (α)}$
$H_{0} : σ^{2} \leq σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} > σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - μ)^{2}$	$χ_{n}^{2}$	${T > χ_{n}^{2} (1 - α)}$
$H_{0} : σ^{2} \geq σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} < σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - μ)^{2}$	$χ_{n}^{2}$	${T < χ_{n}^{2} (α)}$

Test Variance with Unknown Mean

Test	Statistics	Distribution	Rejection Region
$H_{0} : σ^{2} = σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} \neq = σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - \overset{ˉ}{X})^{2}$	$χ_{n - 1}^{2}$	${T > χ_{n - 1}^{2} (1 - \frac{α}{2})} \cup {T < χ_{n - 1}^{2} (\frac{α}{2})}$
$H_{0} : σ^{2} = σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} > σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - \overset{ˉ}{X})^{2}$	$χ_{n - 1}^{2}$	${T > χ_{n - 1}^{2} (1 - α)}$
$H_{0} : σ^{2} = σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} < σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - \overset{ˉ}{X})^{2}$	$χ_{n - 1}^{2}$	${T < χ_{n - 1}^{2} (α)}$
$H_{0} : σ^{2} \leq σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} > σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - \overset{ˉ}{X})^{2}$	$χ_{n - 1}^{2}$	${T > χ_{n - 1}^{2} (1 - α)}$
$H_{0} : σ^{2} \geq σ_{0}^{2} \leftrightarrow H_{1} : σ^{2} < σ_{0}^{2}$	$T = \frac{1}{σ _{0}^{2}} \sum_{i = 1}^{n} (X_{i} - \overset{ˉ}{X})^{2}$	$χ_{n - 1}^{2}$	${T < χ_{n - 1}^{2} (α)}$

Lin's Notes Garden

Explorer

Hypothesis Test

Preliminaries

Hypothesis

Two Types of Error

Significance Level

Rejection Region

Definition

Test with Level $α$

Power of a Test

Uniformly Most Powerful Test (UMPT)

$p$ -value

Common Testing Problems

General Steps

Tables

Test Mean with Known Variance

Test Mean with Unknown Variance

Test Variance with Known Mean

Test Variance with Unknown Mean

Graph View

Table of Contents

Backlinks

Lin's Notes Garden

Explorer

Hypothesis Test

Preliminaries

Hypothesis

Two Types of Error

Significance Level

Rejection Region

Definition

Test with Level α

Power of a Test

Uniformly Most Powerful Test (UMPT)

p-value

Common Testing Problems

General Steps

Tables

Test Mean with Known Variance

Test Mean with Unknown Variance

Test Variance with Known Mean

Test Variance with Unknown Mean

Graph View

Table of Contents

Backlinks

Test with Level $α$

$p$ -value