시리즈

Capstone

1.[Capstone #1] RAG : Retrieval-Augmented Generation

출처 : https://eugeneyan.com/writing/llm-patterns/

2024년 1월 27일

2.[Capstone #2] E3 TTS: Easy end-to-end Diffusion-based Text to Speech

Paper : https://arxiv.org/abs/2311.00945

2024년 3월 8일

3.[Capstone #3] Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Paper : https://arxiv.org/abs/2205.11487

2024년 3월 9일

4.[Capstone #4] OverFlow: Putting flows on top of neural transducers for better TTS

Paper : https://arxiv.org/abs/2211.06892

2024년 3월 15일

5.[Capstone #5] Matcha-TTS: A fast TTS architecture with conditional flow matching

Paper : https://arxiv.org/abs/2309.03199

2024년 3월 22일

6.[Capstone #6] VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Paper : https://arxiv.org/abs/2307.16430

2024년 3월 29일

7.[Capstone #7] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Paper : https://arxiv.org/abs/2106.06103

2024년 4월 19일

8.[Capstone #8] SK TECH SUMMIT 2023: VITS2

Youtube : https://www.youtube.com/watch?v=Abov0q9T4jU:

2024년 4월 26일

9.[Capstone #9] Sound Design Strategies for Latent Audio Space Explorations Using Deep Learning Architectures

Paper : https://arxiv.org/abs/2305.15571

2024년 5월 10일

10.[Capstone #10] Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis

Paper : https://arxiv.org/abs/2311.11745

2024년 5월 18일