Sufficient Statistic

A statistic is said to be sufficient for a parameter if it captures all the information needed to estimate that parameter from the sample data.

More formally, a statistic is sufficient for a parameter is the conditional distribution of the sample data given does not depend on . For example, in the case of a normal distribution with known variance and unknown mean, the sample mean is a sufficient statistic for the mean.

Fisher-Neyman Factorization Theorem

A statistic is sufficient for if and only if the likelihood function can be factored as , where depends only on and , and does not depend on .

Complete Statistic

A statistic is complete of no non-trivial function of that statistic has an expected value of zero for all parameter values unless the function itself is almost zero.

In simper terms, if for any measurable function , for all values of , then it must be that . This means that the complete statistic contains all possible information about the parameter.

Rao-Blackwell Theorem

The conditional expected value of an unbiased estimator given a sufficient statistic is another unbiased estimator that's at least as good.

That is, if is a sufficient statistic, is an unbiased estimator of , then we have

  • is unbiased: .
  • At least as good:

Lehmann-Scheffé Theorem

The Lehmann–Scheffé theorem has an additional hypothesis that the sufficient statistic is complete, i.e. it admits no unbiased estimators of zero. It also has an additional conclusion: the estimator you get is the unique best unbiased estimator.

That is, if is a sufficient complete statistic, is an unbiased estimator of , then we have

  • is unbiased: .
  • Best: for any unbiased estimator , i.e. is the UMVUE of
  • Unique: is the only UMVUE of

Exponential Distribution Family

For the exponential family, which includes distributions like the normal, Poisson, and exponential distributions, a sufficient statistic can often be expressed in terms of a function of the data. For instance, if are i.i.d. r.v.s from an exponential family with PDF

where are functions of , then is a sufficient complete statistic of

For example, for i.i.d. , a sufficient complete statistic of is , then is also a sufficient complete statistic. Therefore is the UMVUE of , and is the UMVUE of .