1D CNN에서 수식으로 이해하는 Backpropagation

Bean·2025년 5월 7일

1D_CNN Backpropagation MSELoss ReLU convolutionalneuralnetwork 딥러닝 딥러닝기초 딥러닝수학 미분 수식_기반_딥러닝 역전파 체인룰

인공지능

목록 보기

21/134

본 글은 1D CNN에서 수식을 기반으로 한 backpropagation 과정을 간단한 예제를 포함합니다.

1. 간단한 1D CNN 모델

구성

입력: $x = [x_1, x_2, x_3]$ (1채널, 길이 3)
커널: $w = [w_1, w_2]$ , bias: $b$
출력: $z = w_1 x_1 + w_2 x_2 + b$
활성화 함수: ReLU
손실 함수: MSE (Mean Squared Error)
타겟: $y$

2. 순전파 (Forward pass)

Conv1D + Bias

입력:
$x = [x_1, x_2]$ ,
가중치: $w = [w_1, w_2]$ ,
바이어스: $b$

Convolution 결과 (합성곱):

z = w_1 x_1 + w_2 x_2 + b

ReLU

a = \text{ReLU}(z) = \max(0, z)

출력층 (간단히 선형):

\hat{y} = a

3. 손실 함수 (MSE)

\mathcal{L} = \frac{1}{2} (\hat{y} - y)^2

4. 역전파 (Backpropagation)

Step 1: 손실 w.r.t 출력

\frac{\partial \mathcal{L}}{\partial \hat{y}} = \hat{y} - y

Step 2: 출력 w.r.t ReLU

\frac{\partial \hat{y}}{\partial a} = 1 \quad \Rightarrow \quad \frac{\partial \mathcal{L}}{\partial \hat{y}} \cdot \frac{\partial \hat{y}}{\partial a} = \frac{\partial \mathcal{L}}{\partial a} = \hat{y} - y

Step 3: ReLU w.r.t 합성곱 결과 $z$

\frac{\partial a}{\partial z} = \begin{cases} 1 & \text{if } z > 0 \\ 0 & \text{otherwise} \end{cases} \quad \Rightarrow \quad \frac{\partial \mathcal{L}}{\partial z} = (\hat{y} - y) \cdot \mathbf{1}_{z > 0}

Step 4: 합성곱 결과 w.r.t 가중치, 입력, 바이어스

$\frac{\partial z}{\partial w_1} = x_1$
$\frac{\partial z}{\partial w_2} = x_2$
$\frac{\partial z}{\partial b} = 1$
$\frac{\partial z}{\partial x_1} = w_1$
$\frac{\partial z}{\partial x_2} = w_2$

따라서:

$\frac{\partial \mathcal{L}}{\partial w_1} = \frac{\partial \mathcal{L}}{\partial z} \cdot x_1$
$\frac{\partial \mathcal{L}}{\partial w_2} = \frac{\partial \mathcal{L}}{\partial z} \cdot x_2$
$\frac{\partial \mathcal{L}}{\partial b} = \frac{\partial \mathcal{L}}{\partial z}$

5. 한 줄 정리

\frac{\partial \mathcal{L}}{\partial w_i} = (\hat{y} - y) \cdot \mathbf{1}_{z > 0} \cdot x_i

\frac{\partial \mathcal{L}}{\partial b} = (\hat{y} - y) \cdot \mathbf{1}_{z > 0}

이렇게 **체인 룰(chain rule)**을 통해 각 단계별로 미분값을 곱하면서 손실에 대한 가중치의 그래디언트를 구하고, 그걸로 가중치를 업데이트합니다.

Bean

AI developer

이전 포스트

지식 증류(Knowledge Distillation)의 개념과 원리 완벽 정리

다음 포스트

1D CNN에서 수식으로 이해하는 Backpropagation

인공지능

1. 간단한 1D CNN 모델

구성

2. 순전파 (Forward pass)

Conv1D + Bias

ReLU

출력층 (간단히 선형):

3. 손실 함수 (MSE)

4. 역전파 (Backpropagation)

Step 1: 손실 w.r.t 출력

Step 2: 출력 w.r.t ReLU

Step 3: ReLU w.r.t 합성곱 결과 $z$

Step 4: 합성곱 결과 w.r.t 가중치, 입력, 바이어스

5. 한 줄 정리

지식 증류(Knowledge Distillation)의 개념과 원리 완벽 정리

NMS (Non-Maximum Suppression) 설명

0개의 댓글

1D CNN에서 수식으로 이해하는 Backpropagation

인공지능

1. 간단한 1D CNN 모델

구성

2. 순전파 (Forward pass)

Conv1D + Bias

ReLU

출력층 (간단히 선형):

3. 손실 함수 (MSE)

4. 역전파 (Backpropagation)

Step 1: 손실 w.r.t 출력

Step 2: 출력 w.r.t ReLU

Step 3: ReLU w.r.t 합성곱 결과 zzz

Step 4: 합성곱 결과 w.r.t 가중치, 입력, 바이어스

5. 한 줄 정리

지식 증류(Knowledge Distillation)의 개념과 원리 완벽 정리

NMS (Non-Maximum Suppression) 설명

0개의 댓글

Step 3: ReLU w.r.t 합성곱 결과 $z$