Properties of Expectation, Variance, and Covariance

This is a quick summary of the properties of expectation, variance, and covariance.

For details:

Table of contents
  1. Expectation
  2. Conditional Expectation
  3. Variance
  4. Conditional Variance
  5. Covariance

Expectation

(Expectated value of a constant)E[c]=c

For the same reason E[E[X]]=E[X]

(Linearity of expectation)E[aX+bY]=aE[X]+bE[Y] (Expectation of joint RVs)E[XY]=E[X]E[Y]+Cov(X,Y)

Independence

When X and Y are independent,

Cov(X,Y)=0E[XY]=E[X]E[Y]

Conditional Expectation

(Law of total expectation)E[X]=E[E[X|Y]]

Conditional Expectation as RV

E[X|Y] is a random variable of Y.

Hence E[E[X|Y]] is aggregating over all possible values of Y, that’s why we are left with E[X].

E[a|X]=aE[aX+bY|Z]=aE[X|Z]+bE[Y|Z]

The above are linearity.

E[X|X]=XE[g(X)|X]=g(X)

The above are obvious. If you have X, you’re expected to get X.

E[X|Y,g(Y)]=E[X|Y]

The above is also obvious: if you already know Y, knowing g(Y) doesn’t change anything.

E[Xg(Y)|Y]=g(Y)E[X|Y] E[E[X|Y,Z]|Y]=E[X|Y]

These need a bit more thought.


Variance

Var(X)=E[(XE[X])2]=E[X2]E[X]2

Derivation

Just remember that E[E[X]]=E[X] and E[E[X]2]=E[X]2, because E[X] is a constant.

Var(X)=E[(XE[X])2]=E[X22XE[X]+E[X]2]=E[X2]2E[X]E[X]+E[X]2=E[X2]E[X]2

Var(aX+b)=a2Var(X)Var(aX+bY)=a2Var(X)+b2Var(Y)+2abCov(X,Y)Var(aXbY)=a2Var(X)+b2Var(Y)2abCov(X,Y)


Conditional Variance

Var(X|Y)=E[(XE[X|Y])2|Y]=E[X2|Y]E[X|Y]2

Derivation Var(X|Y)=E[(XE[X|Y])2|Y]=E[X22XE[X|Y]+E[X|Y]2|Y]()=E[X2|Y]2E[XE[X|Y]|Y]+E[E[X|Y]2|Y]=E[X2|Y]2E[X|Y]E[X|Y]+E[X|Y]2=E[X2|Y]E[X|Y]2

In (), remember that E[X|Y] is a RV of Y. See above:

E[Xg(Y)|Y]=g(Y)E[X|Y]E[g(Y)|Y]=g(Y)

(Law of total variance)Var(X)=E[Var(X|Y)]+Var(E[X|Y])

Derivation E[Var(X|Y)]=E[E[X2|Y]E[X|Y]2]=E[E[X2|Y]]E[E[X|Y]2]=E[X2]E[E[X|Y]2]

and,

Var(E[X|Y])=E[E[X|Y]2]E[E[X|Y]]2=E[E[X|Y]2]E[X]2

Then,

E[Var(X|Y)]+Var(E[X|Y])=E[X2]E[X]2=Var(X)

Covariance

Cov(X,Y)=E[(XE[X])(YE[Y])]=E[XY]E[X]E[Y]