Test Time Train Landscape Image using [Flickr Segmenter & Flickr SPADE]

temp·2021년 11월 27일

AI Deep Learning Generative Model SPADE segmentation

Generative Model

목록 보기

26/30

$ train.py --name Flickr 
--dataset_mode custom --label_dir gm_TTTT/test_label 
--image_dir gm_TTTT/test_img 
--continue_train --load_from_opt_file --gpu_ids=1 --niter 1000 
--save_epoch_freq 100 
--batchSize 1 --lr 0.0005 
--preprocess_mode resize_and_crop --which_epoch 50 --print_freq 10

50epoch : base.

초당 2 update.

학습 전(use pre-trained model)

그림자 반영 x

결과

순서대로 25초, 75초, 125초, 175초, ... (최적화 전)

최적화:
Training Rootine 수정
learning rate 수정
optimizer 수정
Loss함수 수정
...
(빠른 reconstruction에 초점)

개별 실행(Using Modified Label)

image : 001.jpg
label : 001.png + 001_01.png ~ 001_06.png (total 7)
(단, SPADE 실행 시에는 image와 label의 파일 명이 같아야 한다.)

나중에 모듈 만들어서 조금 더 간편하게 inference 하는걸로.

Point

(Epoch이 지남에 따라)
1. 원본 이미지를 얼마나 잘 유지하는가
2. SPADE 모델의 성능이 얼마나 잘 유지되는가

원본 이미지에 없는 Object도 잘 생성해야한다.
다양한 형태의 Segmentation mask도 잘 아우러야 한다.

Inference(pre-trained model 원본)

$ python test.py --name Flickr --dataset_mode custom --load_from_opt_file --gpu_ids -1 --which_epoch 50 --image_dir gm_TTTT/test_img --label_dir gm_TTTT/test_label

x 7 반복

inference time : model load 포함 4초 가량(cpu)

원본

Inference(300 epoch)

$ python test.py --name Flickr --dataset_mode custom --load_from_opt_file --gpu_ids -1 --which_epoch 300 --image_dir gm_TTTT/test_img --label_dir gm_TTTT/test_label

원본

Inference(700 epoch)

$ python test.py --name Flickr --dataset_mode custom --load_from_opt_file --gpu_ids -1 --which_epoch 700 --image_dir gm_TTTT/test_img --label_dir gm_TTTT/test_label

원본