Covariance
Def) Covariance of X X X & Y Y Y
C o v ( X , Y ) = E [ ( X − E ( X ) ) ( Y − E ( Y ) ) ] Cov(X,Y)=E[(X-E(X))(Y-E(Y))] C o v ( X , Y ) = E [ ( X − E ( X ) ) ( Y − E ( Y ) ) ]
Properties)
① C o v ( X , Y ) = E ( X Y ) − E ( X ) E ( Y ) Cov(X,Y) = E(XY)-E(X)E(Y) C o v ( X , Y ) = E ( X Y ) − E ( X ) E ( Y )
② C o v ( X , Y ) = C o v ( Y , X ) Cov(X,Y) = Cov(Y,X) C o v ( X , Y ) = C o v ( Y , X )
③ C o v ( X , X ) = V a r ( X ) Cov(X,X) = Var(X) C o v ( X , X ) = V a r ( X )
④ C o v ( a X + b , c Y + d ) = a c C o v ( X , Y ) Cov(aX+b, cY+d) = ac Cov(X,Y) C o v ( a X + b , c Y + d ) = a c C o v ( X , Y )
⑤ C o v ( a X + b Y + c , d X + e Y + f ) = a d V a r ( X ) + a e C o v ( X , Y ) + b d C o v ( X , Y ) + b e V a r ( Y ) Cov(aX+bY+c, dX+eY+f) = adVar(X) + aeCov(X,Y) + bdCov(X,Y) + beVar(Y) C o v ( a X + b Y + c , d X + e Y + f ) = a d V a r ( X ) + a e C o v ( X , Y ) + b d C o v ( X , Y ) + b e V a r ( Y )
✔︎ If X , Y X,Y X , Y are indep., C o v ( X , Y ) Cov(X,Y) C o v ( X , Y ) = 0
(역은 X , Y X,Y X , Y 가 정규분포를 따를 때만 성립)
✔︎ Covariance는 Variance에 의존하기 때문에 두 변수 간 선형관계를 판단할 수 없다.
Correlation Coefficient
Def) Correlation Coefficient of X , Y X,Y X , Y
ρ X , Y = C o v ( X , Y ) V a r ( X ) V a r ( Y ) \rho_{X,Y} = \frac{Cov(X,Y)}{\sqrt{Var(X)} \sqrt{Var(Y)}} ρ X , Y = V a r ( X ) V a r ( Y ) C o v ( X , Y )
Properties)
① C o r r ( X , X ) = 1 Corr(X,X) = 1 C o r r ( X , X ) = 1
② C o r r ( a X + b , c Y + d ) = s i g n ( a c ) C o r r ( X , Y ) Corr(aX+b, cY+d) = sign(ac)Corr(X,Y) C o r r ( a X + b , c Y + d ) = s i g n ( a c ) C o r r ( X , Y )
s i g n ( a c ) = { 1 a c > 0 − 1 a c < 0 sign(ac) = \left\{\begin{matrix} 1 & ac>0\\ -1 & ac<0 \end{matrix}\right. s i g n ( a c ) = { 1 − 1 a c > 0 a c < 0
③ − 1 ≤ C o r r ( X , Y ) ≤ 1 -1 \leq Corr(X,Y) \leq 1 − 1 ≤ C o r r ( X , Y ) ≤ 1
Variance - Covariance Matrix
Def)
∑ = C o v ( X ‾ ) = E [ ( X ‾ − E ( X ‾ ) ) ( X ‾ − E ( X ‾ ) ) T ] = [ V a r ( X 1 ) C o v ( X 1 , X 2 ) ⋯ C o v ( X 1 , X n ) C o v ( X 2 , X 1 ) V a r ( X 2 ) ⋯ C o v ( X 2 , X n ) ⋮ ⋮ ⋮ ⋮ C o v ( X n , X 1 ) C o v ( X n , X 2 ) ⋯ V a r ( X n ) ] \sum=Cov(\underline{X})=E[(\underline{X}-E(\underline{X}))(\underline{X}-E(\underline{X}))^T]=\begin{bmatrix} Var(X_1)& Cov(X_1,X_2) & \cdots & Cov(X_1,X_n) \\ Cov(X_2,X_1)& Var(X_2) & \cdots & Cov(X_2, X_n)\\ \vdots& \vdots & \vdots & \vdots \\ Cov(X_n,X_1)& Cov(X_n,X_2)& \cdots & Var(X_n) \\ \end{bmatrix} ∑ = C o v ( X ) = E [ ( X − E ( X ) ) ( X − E ( X ) ) T ] = ⎣ ⎢ ⎢ ⎢ ⎢ ⎡ V a r ( X 1 ) C o v ( X 2 , X 1 ) ⋮ C o v ( X n , X 1 ) C o v ( X 1 , X 2 ) V a r ( X 2 ) ⋮ C o v ( X n , X 2 ) ⋯ ⋯ ⋮ ⋯ C o v ( X 1 , X n ) C o v ( X 2 , X n ) ⋮ V a r ( X n ) ⎦ ⎥ ⎥ ⎥ ⎥ ⎤
Theorem)
Suppose X ‾ = ( X 1 , . . . , X n ) T , A ‾ = m × n m a t r i x , b ‾ = m × 1 v e c t o r \underline{X}=(X_1,...,X_n)^T,\underline{A}=m \times n \space matrix, \underline{b}= m \times 1 \space vector X = ( X 1 , . . . , X n ) T , A = m × n m a t r i x , b = m × 1 v e c t o r
① E ( A ‾ X ‾ + b ‾ ) = A ‾ E ( X ‾ ) + b ‾ E(\underline{A} \underline{X} + \underline{b}) = \underline{A}E(\underline{X}) + \underline{b} E ( A X + b ) = A E ( X ) + b
② C o v ( X ‾ ) = E ( X ‾ X ‾ T ) − E ( X ‾ ) E T ( X ‾ ) Cov(\underline{X}) = E(\underline{X} \underline{X}^T) - E(\underline{X})E^T(\underline{X}) C o v ( X ) = E ( X X T ) − E ( X ) E T ( X )
③ C o v ( A ‾ X ‾ + b ‾ ) = A ‾ C o v ( X ‾ ) A ‾ T Cov(\underline{A} \underline{X} + \underline{b}) = \underline{A} Cov(\underline{X}) \underline{A}^T C o v ( A X + b ) = A C o v ( X ) A T