[논문리뷰] 적대적 생성 신경망 : Generative Adversarial Nets(GAN)

no need for any Markov chains or unrolled approximate inference networks during training or generation of samples
demostrate the potential of the framework through qualitative & quantitative evalutaion of the generated samples

: GAN은 동시에 두 가지의 모델을 학습해야 하는데 생성 모델과, 판별 모델 두 모델이다.
생성 모델은 데이터의 분포를 파악하고, 판별 모델은 들어온 샘플이 생성모델에서 나오지 않았을 확률을 반환한다.
즉, 진짜인지 만들어진 가짜인지 파악하는 것이다.
생성 모델은 판별모델이 최소화되도록 학습한다.
학습이 완료되면 모든 데이터 분포에서 판별모델의 결과가 1/2가 나온다
즉, 진짜인지 가짜인지의 확률이 1/2에 해당한다

3. Adversarial nets

Property of Adversatial nets

most straightforward to apply when the models are both mutilayer perceptrons

What is Adversarial nets

To learn the generator's distribution p.g over data x

First, we define a prior on input noise variable p.z(z)
p.z(z) : 노이즈 확률분포로 해당 분포에서 노이즈 샘플인 z를 추출해서 생성 모델에 입력값으로 넣음
Then represent a mapping to data space as G(z; θg)
G(z; θg) : 생성 모델의 가중치가 θg일 때, 인풋으로 노이즈를 넣고 나온 결과
: what is G? differentiable function represented by a multilayer perceptron with parameters θg
Second, we define a second multilayer perceptron D(x; θd)
D(x; θd) : 판별 모델의 가중치가 θd 일 때, 인풋으로 x를 넣은 결과
D's output is a single scalar
D(x) represents the probability that x came from the data rather than p.g
And, We train D to maximize the probability of assigning the correct label to both training examples and samples from G
Simultaneously train G to minimize log(1 − D(G(z)))
: In other words, D and G play the following two-player minimax game with value function V (G, D)
Here is expression of V (G, D),
min(G) max(D) V (D, G) = Ex∼pdata(x) [log D(x)] + Ez∼pz(z) [log(1 − D(G(z)))]*
Next, present a theoretical analysis of adversarial nets
the training criterion allows one to recover the data generating distribution G and D are given enough capacity in the non-parametric limit

한글요약

GAN을 구현하는 가장 직접적인 방법은 판별, 생성 모델 모두 mlp를 이용해서 구현하는 것이다.
데이터 x에 대한 생성자의 분포 p.g를 학습하기 위해서 p.z(z)를 정의해야 한다. 이후 G(z; θg)로 표현되는 데이터 스페이스에 매핑을 나타낸다. 또한 두 번째 모델 D(x; θd)를 정의한다. D(x)는 x가 p.g에서 나오지 않았을 확률을 나타낸다.
G에서 나오는 샘플들에 옳은 라벨을 붙일 확률을 높이도록 D를 학습해야 한다. 동시에 G는 log(1 − D(G(z))) 가 최소가 되게 해야 한다

4. Theroical Results

: The generator G implicitly defines a probability distribution pg as the distribution of the samples
G(z) obtained when z ∼ p.z

Algorithm 1

Algorithm 1 to converge to a good estimator of P.data, if given enough capacity and training time
Minibatch stochastic gradient descent training of generative adversarial nets.
The number of steps to apply to the discriminator, k, is a hyperparameter.
In our experiments, we used k = 1, the least expensive option

4.1 Global Optimality of P.g = P.data

어떠한 G에 대해서도 잘 구분해내는 최적의 D를 가정하자.
Proposition 1. G가 고정이면 최적의 D는 D.G(x) = p.data(x)/ p.data(x) + p.g(x) 이다

4.2 Convergence of Algorithm1

만약 생성모델, 판별 모델이 충분한 수용성이 있다면(학습이 잘 된다면), 알고리즘1의 매 단계에서 discriminator는 주어진 생성모델에 대하여 최적에 도달하고 p.g는 다음의 식을 향상시키기 위하여 업데이트 되어 p.g는 p.data로 수렴한다(p.g = p.data)
Ex∼pdata [log D∗G(x)] + Ex∼pg[log(1 − D∗G(x))]

6. Advantages and disadvantages

Advantages

생성모델의 분포 p.g(x)를 명시하지 않는다
따라서 학습하는 동안 판별모델은 생성모델과 잘 동기화(synchronize)되어야 한다
판별모델이 충분히 학습되기 전에 생성모델이 너무 많이 학습되면 안된다Disadvantages
markov chain이 필요 없고, backpropagation만으로 학습이 가능하다.
markov chain이란? 현재의 사건이 이전 사건에 영향을 받는다
학습하는데 inference는 필요하지 않음

독특한 화풍의 화가들

피카소

대표작 : 게르니카, 아비뇽의 처녀들
https://oylee.tistory.com/entry/%ED%94%BC%EC%B9%B4%EC%86%8C-%EA%B7%B8%EB%A6%BC-%EB%AA%A8%EC%9D%8C

모네

대표작 : 정원의 여인들
https://m.blog.naver.com/PostView.nhn?blogId=flower7644s&logNo=220656802797&proxyReferer=https:%2F%2Fwww.google.com%2F

달리

대표작 : 기억의 지속
https://m.cafe.daum.net/yunwhd8932/qAb6/42

램브란트

http://blog.daum.net/gallerystore/18353860

고흐, 뭉크
생각해봐야 할 것
: 화가들 중에는 주로 인물화를 그린사람과 풍경화 위주로 많이 갈리는데 실제 구현을 하려면
인물 쪽보다는 풍경에 초점을 맞추는게 더 구현하기 쉬울 것 같음

'인공지능(AI) > 컴퓨터비전(CV)' 카테고리의 다른 글

[패턴인식] 에지 검출(1) : 에지 검출의 기초, 영교차 이론 (0)	2021.10.24
[패턴인식] 영상 처리(3) : 다해상도, 모폴로지, 컬러 (0)	2021.10.24
[패턴인식] 영상 처리(2) : 이진영상, 영상 처리의 세가지 기본 연산 (2)	2021.10.21
[패턴인식] 영상 처리(1) : 디지털 영상, 히스토그램 (0)	2021.10.21
[패턴인식] 컴퓨터 비전 소개 : Intro. Computer Vision (0)	2021.10.17

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

[논문리뷰] 적대적 생성 신경망 : Generative Adversarial Nets(GAN)

논문 리뷰

Ian J.Goodfellow, [Generative Adversarial Nets]

목차

논문리뷰에 앞서 알아둬야 할 함수, 변수, 기호

D : 판별 모델

0. Abstract

What are models in G and D

Unique solution has exists

GAN's properties

한글 총 요약