ch 4. random variables and expectations

원준식·2022년 9월 19일

확률통계

Introduction to statistics for engineers and scientists (Sheldon Ross) 6th Edition

목록 보기

4/6

링크텍스트

위 강의를 듣고 정리하는 글입니다.

random variable(확률변수): a function from sample space to (real) number

continuous RV
Discrete RV

cumulative distribution function(cdf, 누적 분포 함수) of RV $X$

$F(x) = P(X≤x)$

4.2 types of RV

1. Discrete RV

probability mass function(pmf): $P_X(a) = P(X=a)$

$Σ_iP(x_i)=1$

2. Continuous RV

probability density function(pdf): $f(x)=\frac{dF(x)}{dx}$ (cdf를 미분한 것)

$P(a<X≤b)=∫^b_af(x)dx$ → $f(x)$ 는 확률이 아니고 확률 밀도임!

$P(X=a)=0$

4.3 jointly distributed random variables

joint cumulative distribution function(joint cdf) $F_{XY}(x,y)=P(X≤x, Y≤y)$

marginal cdf $F_{X}(x)=P(X≤x) = P(X≤x, Y≤∞) = F_{XY}(x,∞)$

$F_Y(y)=F_{XY}(∞,y)$

만약 X, Y가 discrete RV라면
- joint pmf $P(x_i, y_i)=P(X=x_i, Y=y_i)$
- marginal pmf $P(x_i)=Σ_jP(x_i, y_j)$ , $P(y_i)=Σ_jP(x_j, y_i)$
  - joint pmf를 알면 marginal pmf를 구할 수 있겠지
  - marginal pmf를 알면 joint pmf를 구할 수 있나? → X, Y가 independent해야만 구하는게 가능하겠지
만약 X, Y가 contiuous RV라면
- joint pdf $f_{XY}(x,y) = \frac{∂^2F(x,y)}{∂x∂y}$ → $F(a, b)=∫^b_{-∞}∫^a_{-∞}f(x, y)dxdy$
- marginal pdf $f_X(x)=∫^∞_{-∞}f(x, y)dy$ , $f_Y(y)=∫^∞_{-∞}f(x, y)dx$

4.3.1 independent random variables

F_{XY}(a,b) = F_X(a)F_Y(b)\ \ for \ \ all \ a,\ b

joint cdf가 marginal cdf의 곱으로 표현이 될 때 두 RV $X$ 와 $Y$ 는 independent

discrete RV
- $P_{XY}(x,y)=P_X(x)P_Y(y)$
continuous RV
- $f_{XY}(x,y)=f_X(x)f_Y(y)$

4.3.2 conditional distributions

discrete RV
- conditional pmf $P_{X|Y}(x|y)=P(X=x|Y=y)=\frac{P(X=x, Y=y)}{P(Y=y)} = \frac{P(x, y)}{P_Y(y)}$
continuous RV
- conditional pdf $f_{X|Y}(x|y) = \frac{f_{XY}(x, y)}{f_Y(y)}$

4.4 expectation

mean of X

expectation of X

E[X]= \sum_ix_iP(X=x_i)

E[X]= \int^\infty_{-\infty}xf_X(x)dx = \lim\sum xP(x<X<x+dx)

discrete RV $X$ 에 대하여 $P(X=x_i)=P_i$ ( $i=1,2,…,n$ )라고 했을 때

$x_i$ 의 정보의 양(Amount of information of $x_i$ ) = $\log_2\frac{1}{P_i}$

RV $X$ 의 평균 information(Entropy) $H(X)=\sum^n_{i=1}P_i\log_2\frac{1}{P_i}$

4.5 properties of the expected value

E[aX+b] = aE[X]+b

$E[X^n]$ = nth moment of $X$

4.5.1 expected value of sums of random variables

E[aX+bY] = aE[X]+bE[Y]

어떤 RV의 값을 그것의 mean(µ)으로 예측하면 mean square error(MSE, prediction의 오류의 제곱의 평균)가 최소가 된다.

증명)

$c=prediction\ of\ X$

$µ=E[X]$

$E[(X-c)^2] = E[(X-µ+µ-c)^2] = E[(X-µ)^2] + (µ-c)^2 ≥ E[(X-µ)^2]$

4.6 variance

E[X]=µ

Var(X)=E[(X-µ)^2] = E[X^2]-µ^2

4.7 covariance and variance of sums of RVs

RV가 여러 개 있을 때 그것들의 합의 분포에 관심이 많게 됨

일반적으로는 $Var(X+Y) ≠ Var(X) + Var(Y)$

그럼 합의 variance는 어떻게 되니? → covariance

covariance of $X$ and $Y$

Cov(X, Y) = E[(X-µ_X)(Y-µ_Y)]

= E[XY - µ_XY-µ_YX+µ_Xµ_Y]

= E[XY]-µ_Xµ_Y-µ_Xµ_Y+µ_Xµ_Y

= E[XY] - E[X]E[Y]

$E[XY]=E[X]E[Y]$ 이면 $X$ and $Y$ are uncorrelated( $Cov(X,Y)=0$ )
$X$ and $Y$ are independent → $X$ and $Y$ are uncorrelated

correlation of $X$ and $Y$ = $E[XY]$

correlation coefficient of $X$ and $Y$ = $\frac{Cov(X, Y)}{\sqrt{Var(X)Var(Y)}}$

$Var(X+Y)=Var(X)+Var(Y)+2Cov(X,Y)$

4.8 moment generating functions(MGF)

Φ(t)=E[e^{tx}] = \sum_xe^{tx}p(x)

Φ(t)=E[e^{tx}] = \int^{\inf}_{-\inf}e^{tx}f(x)dx

Φ'(t)= \frac{d}{dt}E[e^{tx}] = E[\frac{d}{dt}e^{tx}] = E[xe^{tx}]

Φ'(0)= E[x]

MGF를 알면 $E[x]$ 구하기가 쉬움. MGF 미분해서 0 넣으면 됨.(pdf의 경우 $E[x]$ 를 구하기 위해 적분을 해야 하지만 MGF는 미분을 하면 됨 → 일반적으로 적분보다 미분이 쉬움)

4.9 Chebyshev’s inequality and the weak law of large numbers

Markov’s inequality

$X$ is a positive RV, a>0일 때

P(X>a) <= \frac{E[x]}{a}

$E[x]$ 는 분포에 대해 많은 것을 이야기해 주는구나

증명)

E[x] = \int^{\inf}_0xf(x)dx = \int^a_0xf(x)dx + \int^{\inf}_axf(x)dx

>= \int^{\inf}_axf(x)dx >= \int^{\inf}_aaf(x)dx = a\int^{\inf}_af(x)dx = aP(X>a)

Chebyshev’s inequality

$E[x] = µ$ , $Var(X)=σ^2$

P(|X-µ| >= k) <= \frac{σ^2}{k^2}

증명)

Markov’s inequality에서 $X$ 대신 $(X-µ)^2$ 을 넣어주고, $a$ 대신 $k^2$ 을 넣어주면 됨

The weak law of large numbers

$X_1, … X_n$ : iid(independent and identically distributed, 독립이고 분포가 같은) RVs

분포가 모두 같으니 $E[X_i] = µ$

임의의 $ε>0$ 에 대하여

\lim_{n->\inf}P(|\frac{X_1 + ... + X_n}{n}- µ| > ε)=0

원준식

이전 포스트

ch 3. elements of porbability

다음 포스트