[선형대수] Lecture 17: Orthogonal matrices and Gram-Schmidt

이재호·2025년 3월 11일

목록 보기

16/31

https://ocw.mit.edu/courses/18-06-linear-algebra-spring-2010/video_galleries/video-lectures/

먼저 강의에서 다음과 같은 내용을 정리한다.

\text{<orthonormal vectors\>> } \\ q_i^Tq_j= \begin{cases} 0, & \text{if } i \ne j \\ 1, & \text{if } i = j \end{cases}

\text{orthogonal basis} : q_1,q_2,...,q_n \\ \text{orthogonal matrix} : Q

\text{how can $A \rightarrow Q$ ?} \rightarrow \text{``Gran-Schmidt"}

즉, orthonomal vector의 조건은 위와 같고, 이를 orthogonal basis로 구성하는 orthogonal matrix Q를 "Gran-Shcmidt" 방식으로 구할 수 있다는 것이다.

예시와 함께 보자.

Q= \begin{bmatrix} | & | & | & | \\ q_1 & q_2 & ... & q_n \\ | & | & | & | \\ \end{bmatrix}

Q^TQ = \begin{bmatrix} -(q_1)^T- \\ -(q_2)^T- \\ . \\ -(q_n)^T- \\ \end{bmatrix} \begin{bmatrix} | & | & | & | \\ q_1 & q_2 & ... & q_n \\ | & | & | & | \\ \end{bmatrix} = \begin{bmatrix} 1 & \ & \ & 0 \\ 0 & 1 & \ & 0 \\ . & . & . & . \\ 0 & \ & \ & 1 \\ \end{bmatrix} = I

위 수식을 통해서 $Q^TQ=I$ 를 알 수 있다.

그리고 다음과 같이 정리할 수 있다.

\text{If $Q$ is square then $Q^TQ=I$ tell us $Q^T=Q^{-1}$}

\text{Ex.} : Q=\begin{bmatrix}0 & 0 & 1 \\1 & 0 & 0 \\0 & 1 & 0 \\\end{bmatrix}, Q=\begin{bmatrix}cos\theta & -sin\theta \\ sin\theta & cos\theta\end{bmatrix}, Q=\frac{1}{\sqrt{2}}\begin{bmatrix}1 & 1 \\ 1 & -1\end{bmatrix} \\ Q=\frac{1}{2}\begin{bmatrix}1 & 1 & 1 & 1\\ 1 & -1 & 1 & -1 \\ 1 & 1 & -1 & -1\\ 1 & -1 & -1 & 1\\ \end{bmatrix}

그렇다면 $Q$ 가 rectangular(=not square)라면 어떻게 구할 수 있을까? (이 방법이 Gram-Schmidt 이다.)

Q= \frac{1}{3} \begin{bmatrix} 1 & -2 \\ 2 & -1 \\ 2 & 2 \\ \end{bmatrix} \rightarrow \frac{1}{3} \begin{bmatrix} 1 & -2 & 2 \\ 2 & -1 & 2 \\ 2 & 2 & 1 \\ \end{bmatrix}

위 과정은 Gram-Schmidt의 개략적인 과정을 의미한다.
좀더 자세히 알아보자.

\text{Suppose : $Q$ has orthonormal columns.} \\ \text{proejct onto it's column space.}

p=Q(Q^TQ)^{-1}Q^T=QI^{-1}Q^T=QQ^T \text{(=1 if Q is square)}

(QQ^T)(QQ^T)=QIQ^T=QQ^T

A^T A \hat x=A^Tb \\ A \rightarrow Q

Q^TQ\hat x=Q^Tb

I\hat x=Q^Tb

\therefore \hat x=Q^Tb

\therefore \hat x_i=q_i^Tb

그럼 이제 위 식을 이용해서 Gram-Schmidt를 적용하는 절차를 알아보자.

\text{Gram-Schmidt}

\text{(independent vectors) $a, b$} \rightarrow \text{(orthogonal vectors) $A,B$} \\ \rightarrow \text{orthonormal vectors $q_1=\frac{A}{||A||}$, $q_2=\frac{B}{||B||}$}

그림으로 표시하면 위와 같이 서로 independent인 두 벡터 $a,b$ 에 대해서 적용한다.
우선, orthogonal vector $A$ 는 $a=A$ 로 봐도 무방하다. 대신에 $B$ 를 $A$ 와 perpendicuar하게 만들면 된다. 따라서 이를 위해 $B$ 를 $e$ 라고 볼 수 있다.

지난 강의에서 배운 projection 방법을 적용하여 $B$ 를 구해보자.

B=b-\frac{A^Tb}{A^TA}A

또한 다음과 같은 점도 발견할 수 있다.

A^TB=A^T(b-\frac{A^Tb}{A^TA}A)=A^Tb - \frac{A^Tb}{A^TA}A^TA=A^Tb-A^Tb=0

\therefore A^TB=0 \rightarrow \text{$A \perp B$}

이렇게 orthogonal vectors를 구한 후, $q_1=\frac{A}{||A||}$ , $q_2=\frac{B}{||B||}$ 식을 통해서 orthgonormal vectors를 구하면 끝이다.

다만 위는 2개의 벡터에 대해서만 적용하였는데, 만약 벡터의 개수가 늘어난다면 어떻게 적용할 수 있을까? 예를 들어 3개의 벡터에 대해서 Gram-Schmidt를 적용한다고 해보자.

independent vectors $a,b,c$
orthgonal vectors
- $A=a$
- $B=b-\frac{A^Tb}{A^TA}A$
- $C=c - \frac{A^Tc}{A^TA}A - \frac{B^Tc}{B^TB}B$
orthnormal vectors
- $q_1=\frac{A}{||A||}$
- $q_2=\frac{B}{||B||}$
- $q_3=\frac{C}{||C||}$

위와 같이 $B$ 는 이전처럼 구하면 되고, $C$ 는 $A$ 와 $B$ 모두 반영하여 구하면 된다.

예시와 함께 Gram-Schmidit을 적용해보자.

a=\begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix}, b=\begin{bmatrix} 1 \\ 0 \\ 2 \end{bmatrix}

A=\begin{bmatrix}1 \\ 1 \\ 1\end{bmatrix}

B=b-\frac{A^Tb}{A^TA}A=\begin{bmatrix} 1 \\ 0 \\ 2 \end{bmatrix}-\frac{3}{3}\begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix} = \begin{bmatrix} 0 \\ -1 \\ 1 \end{bmatrix}

q_1=\frac{A}{||A||}=\frac{1}{\sqrt3}\begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix}

q_2=\frac{B}{||B||}=\frac{1}{\sqrt{2}}\begin{bmatrix} 0 \\ -1 \\ 1 \end{bmatrix}

\therefore Q= \begin{bmatrix}1/\sqrt3 & 0 \\ 1/\sqrt3 & -1/\sqrt2\\ 1/\sqrt3 & 1/\sqrt2 \end{bmatrix}

그리고 Gram-Schmidt를 구하는 과정을 다음과 같이 행렬로 표시할 수 있다.

A=QR \sim A=LU

즉, $A=QR$ 에서 $R$ 은 upper triangluar matrix이다. 왜 그럴까? 다음을 보자.

A= \begin{bmatrix} | & | \\ a & b \\ | & | \end{bmatrix} = \underbrace{ \begin{bmatrix} | & | \\ q_1 & q_2 \\ | & | \end{bmatrix}}_Q R

따라서 위에서 $R$ 은 다음과 같이 나올 것이다.

\begin{bmatrix} a & b \\ \end{bmatrix} = \begin{bmatrix} q_1 & q_2 \end{bmatrix} \underbrace{ \begin{bmatrix} q_1^Ta & q_1^Tb\\ q_2^Ta & q_2^Tb \end{bmatrix} }_R

그리고 $q_2^Ta$ 는 0이 나올 것이다. 왜냐하면 $q_2$ 는 $a=A$ 와 perpendicuar(orthogonal)이기 때문이다.
(대신 $q_1^Tb$ 는 0이 아니다. 왜냐하면 $b$ 는 $q_1^T$ 가 존재하는 subspace $A$ 와 수직이라는 증명이 없기 때문이다.)
따라서 $A=QR$ 에서 $R$ 은 $U$ (upper triangle)라고 볼 수 있다.

이재호

천천히, 그리고 꾸준히.

이전 포스트

[선형대수] Lecture 16: Projection matrices and least squares

다음 포스트

[선형대수] Lecture 17: Orthogonal matrices and Gram-Schmidt

선형대수

[선형대수] Lecture 16: Projection matrices and least squares

[선형대수] Lecture 18: Properties of determinants

0개의 댓글