Large deviations

12 minute read

Published:

In the previous entries we have studied the asymptotic behavior of the sums of iid random variables. The law of large numbers showed that almost surely, the averages converge to the mean, while the central limit theorem gave us a second order approximation, that is, it provided information of the size of the fluctuations around the expected value. In this entry, we discuss a third order result, namely, Cramer’s theorem on large deviations. Intuitively, Cramer’s theorem establishes asymptotic rates for the decay of the probability of observing very unlikely events.

Let {Xn} be a sequence of iid random variables with mean μ, and suppose that the variables have finite log-moment generating function

φ(λ)=logEeλX1<

for all λR.

Theorem (Cramer): Let Sn=X1++Xn and x>μ. Then

limn1nlogP{Snnx}=φ(x),

where φ is the Legendre transform of φ.

Recall that the Legendre transform of φ is given by

φ(x)=supλR{λxφ(λ)}.

We will prove the theorem in two parts: first the upper bound, which uses a standard Chernoff bound technique, then the lower bound, which uses a measure tilting argument.

Proof: (Upper bound). For λ0, we have by Markov’s inequality

P{Snnx}=P{eλSneλnx}eλnxE(eλSn).

This implies that

1nlogP{1nSnx}λx+1nlogE(eλSn)=λx+φ(λ)

and consequently,

lim supn1nlogP{1nSnx}supλ0{λxφ(λ)}.

By Hölder’s inequality, φ is convex with φ(0)=μ<x. Then the expression in the brackets is negative for for λ<0, and so we can take the supremum over all real λ.

(Lower bound). Fix ϵ>0. We will change the law of the random variables X so that the event {Snnx} is not rare under the new law, and we can use the old limit laws on it. If P is the law of X, consider the new law Q given by

dQ(X)=eφ(λ)+λXdP(X).

Then we have that for iid X1,,Xn,

dQ=dQdQ=enφ(X)+λSndP(X1,,Xn).

Note now that

φ(λ)=eφ(λ)E(XeλX)=EQ(X).

If μ<x<esssupX, we can find λ such that φ(λ)=x+ϵ2. This implies that

Q{x+ϵ>1nSnx}1,

as EQ(X)=x+ϵ2. With this we can see that

lim infn1nlogP{1nSnx}lim infn1nlogP{x+ϵ21nSnx}=lim infn1nlogEQ(enφ(λ)λSn1{x+ϵ21nSnx})φ(λ)λ(x+ϵ)+lim infn1nlogQ{x+ϵ>1nSnx}=φ(λ)λ(x+ϵ)φ(x+ϵ)

from which the result follows from letting ϵ0.

Leave a Comment