Correlation Coefficient

deejayosamu·2025년 7월 2일

통계 기본 개념

목록 보기
8/20

Covariance

Def) Covariance of XX & YY
Cov(X,Y)=E[(XE(X))(YE(Y))]Cov(X,Y)=E[(X-E(X))(Y-E(Y))]

Properties)
Cov(X,Y)=E(XY)E(X)E(Y)Cov(X,Y) = E(XY)-E(X)E(Y)
Cov(X,Y)=Cov(Y,X)Cov(X,Y) = Cov(Y,X)
Cov(X,X)=Var(X)Cov(X,X) = Var(X)
Cov(aX+b,cY+d)=acCov(X,Y)Cov(aX+b, cY+d) = ac Cov(X,Y)
Cov(aX+bY+c,dX+eY+f)=adVar(X)+aeCov(X,Y)+bdCov(X,Y)+beVar(Y)Cov(aX+bY+c, dX+eY+f) = adVar(X) + aeCov(X,Y) + bdCov(X,Y) + beVar(Y)

✔︎ If X,YX,Y are indep., Cov(X,Y)Cov(X,Y) = 0
(역은 X,YX,Y가 정규분포를 따를 때만 성립)
✔︎ Covariance는 Variance에 의존하기 때문에 두 변수 간 선형관계를 판단할 수 없다.

Correlation Coefficient

Def) Correlation Coefficient of X,YX,Y
ρX,Y=Cov(X,Y)Var(X)Var(Y)\rho_{X,Y} = \frac{Cov(X,Y)}{\sqrt{Var(X)} \sqrt{Var(Y)}}

Properties)
Corr(X,X)=1Corr(X,X) = 1
Corr(aX+b,cY+d)=sign(ac)Corr(X,Y)Corr(aX+b, cY+d) = sign(ac)Corr(X,Y)
sign(ac)={1ac>01ac<0sign(ac) = \left\{\begin{matrix} 1 & ac>0\\ -1 & ac<0 \end{matrix}\right.
1Corr(X,Y)1-1 \leq Corr(X,Y) \leq 1

Variance - Covariance Matrix

Def)
=Cov(X)=E[(XE(X))(XE(X))T]=[Var(X1)Cov(X1,X2)Cov(X1,Xn)Cov(X2,X1)Var(X2)Cov(X2,Xn)Cov(Xn,X1)Cov(Xn,X2)Var(Xn)]\sum=Cov(\underline{X})=E[(\underline{X}-E(\underline{X}))(\underline{X}-E(\underline{X}))^T]=\begin{bmatrix} Var(X_1)& Cov(X_1,X_2) & \cdots & Cov(X_1,X_n) \\ Cov(X_2,X_1)& Var(X_2) & \cdots & Cov(X_2, X_n)\\ \vdots& \vdots & \vdots & \vdots \\ Cov(X_n,X_1)& Cov(X_n,X_2)& \cdots & Var(X_n) \\ \end{bmatrix}

Theorem)
Suppose X=(X1,...,Xn)T,A=m×n matrix,b=m×1 vector\underline{X}=(X_1,...,X_n)^T,\underline{A}=m \times n \space matrix, \underline{b}= m \times 1 \space vector
E(AX+b)=AE(X)+bE(\underline{A} \underline{X} + \underline{b}) = \underline{A}E(\underline{X}) + \underline{b}
Cov(X)=E(XXT)E(X)ET(X)Cov(\underline{X}) = E(\underline{X} \underline{X}^T) - E(\underline{X})E^T(\underline{X})
Cov(AX+b)=ACov(X)ATCov(\underline{A} \underline{X} + \underline{b}) = \underline{A} Cov(\underline{X}) \underline{A}^T

0개의 댓글