Generative Adversarial Network

Roh's warehouse·2025년 9월 22일

Introduction to DL

목록 보기

13/17

Generative Adversarial Network (GAN)

Variational autoencoder (VAE)이 sample의 분포를 미리 정해두고 이를 학습하는 explicit modeling이었다면, generative adversarial network (GAN)은 density function에 대한 가정이 필요없는 implicit modeling이다.

GAN은 다음과 같은 형태를 갖는다.

Generator: 가짜 sample을 생성하고, discriminator를 속이는 것이 목표
Discriminator: 진짜와 가짜 sample을 구별해내는 것이 목표

GAN은 generator와 discriminator를 동시에 학습시켜, generator가 더욱 더 정교한 sample을 생성할 수 있도록 유도한다.

Min-max Objective Function

GAN은 game-theory의 2-player game의 아이디어를 가지고 구현되어 있으며, training 역시 minmax objective function을 통해 이루어진다.

Generator $\theta_g$ 를 통해 생성되는 가짜 sample을 $G_{\theta_g}(z)$ 라고 하자. 이 때, $z$ 는 sampling이 쉬운 random noise로 가정한다. Discriminator $\theta_d$ 의 판별 결과를 $D_{\theta_d}(\cdot)$ 이라고 할 때, 진짜 sample로 판단하는 경우 1, 반대의 경우 0의 값을 갖는다.

이 경우, GAN의 minmax objective function은 다음과 같다.

\min_{\theta_g} \max_{\theta_d} \left[ \mathbb{E}_{x\sim p_{data}} \log D_{\theta_d}(x) + \mathbb{E}_{z\sim p(z)} \log (1-D_{\theta_d}(G_{\theta_g}(z))) \right]

위 식을 살펴보면, discriminator는 $D(x)$ 를 1에 가깝게, $D(G(z))$ 를 0에 가깝게 만들려고 (즉, 가짜/진짜 여부를 잘 맞추려고) 노력할 것이고, generator는 $D(G(z))$ 를 1에 가깝게 만들려고 (즉, 가짜 sample을 진짜로 속이려고) 노력할 것이라는 사실을 알 수 있다.

Training GANs

GAN의 학습은 다음 두 과정을 반복적으로 수행하면서 이루어진다.

Discriminator $\theta_d$ 에 대해 gradient ascent 적용 ( $\theta_g$ 고정)
$\max_{\theta_d} \left[ \mathbb{E}_{x\sim p_{data}} \log D_{\theta_d}(x) + \mathbb{E}_{z\sim p(z)} \log (1-D_{\theta_d}(G_{\theta_g}(z))) \right]$
Generator $\theta_g$ 에 대해 gradient ascent 적용 ( $\theta_d$ 고정)
$\max_{\theta_g} \mathbb{E}_{z\sim p(z)} \log (D_{\theta_d}(G_{\theta_g}(z)))$