Large deviations
Published:
In the previous entries we have studied the asymptotic behavior of the sums of iid random variables. The law of large numbers showed that almost surely, the averages converge to the mean, while the central limit theorem gave us a second order approximation, that is, it provided information of the size of the fluctuations around the expected value. In this entry, we discuss a third order result, namely, Cramer’s theorem on large deviations. Intuitively, Cramer’s theorem establishes asymptotic rates for the decay of the probability of observing very unlikely events.
Let {Xn} be a sequence of iid random variables with mean μ, and suppose that the variables have finite log-moment generating function
φ(λ)=logEeλX1<∞for all λ∈R.
Theorem (Cramer): Let Sn=X1+⋯+Xn and x>μ. Then
limn→∞1nlogP{Sn≥nx}=−φ∗(x),where φ∗ is the Legendre transform of φ.
Recall that the Legendre transform of φ is given by
φ∗(x)=supλ∈R{λx−φ(λ)}.We will prove the theorem in two parts: first the upper bound, which uses a standard Chernoff bound technique, then the lower bound, which uses a measure tilting argument.
Proof: (Upper bound). For λ≥0, we have by Markov’s inequality
P{Sn≥nx}=P{eλSn≥eλnx}≤e−λnxE(eλSn).This implies that
1nlogP{1nSn≥x}≤−λx+1nlogE(eλSn)=−λx+φ(λ)and consequently,
lim supn→∞1nlogP{1nSn≥x}≤−supλ≥0{λx−φ(λ)}.By Hölder’s inequality, φ is convex with φ′(0)=μ<x. Then the expression in the brackets is negative for for λ<0, and so we can take the supremum over all real λ.
(Lower bound). Fix ϵ>0. We will change the law of the random variables X so that the event {Sn≥nx} is not rare under the new law, and we can use the old limit laws on it. If P is the law of X, consider the new law Q given by
dQ(X)=e−φ(λ)+λXdP(X).Then we have that for iid X1,…,Xn,
dQ=dQ⊗⋯⊗dQ=e−nφ(X)+λSndP(X1,…,Xn).Note now that
φ′(λ)=e−φ(λ)E(XeλX)=EQ(X).If μ<x<esssupX, we can find λ∗ such that φ′(λ∗)=x+ϵ2. This implies that
Q{x+ϵ>1nSn≥x}→1,as EQ(X)=x+ϵ2. With this we can see that
lim infn→∞1nlogP{1nSn≥x}≥lim infn→∞1nlogP{x+ϵ21nSn≥x}=lim infn→∞1nlogEQ(e−nφ(λ)−λSn1{x+ϵ21nSn≥x})≥φ(λ)−λ(x+ϵ)+lim infn→∞1nlogQ{x+ϵ>1nSn≥x}=φ(λ)−λ(x+ϵ)≥−φ∗(x+ϵ)from which the result follows from letting ϵ→0.
Leave a Comment