Speech (ASR/TTS)

1.[Speech] Automatic Speech Recognition Overview

post-thumbnail

2.[Speech] Acoustic Phonetics

post-thumbnail

3.[Speech] Feature Extraction- Fourier Transform

post-thumbnail

4.[Speech] Feature Extraction - MFCCs

post-thumbnail

5.[Speech] Korean Phonology

post-thumbnail

6.[ASR]SincNet: Speaker Recognition From Raw waveform with SincNet

post-thumbnail

7.[ASR] Baum-Welch algorithm

post-thumbnail

8.[ASR] 은닉마코프모델(Hidden Markov Models)

post-thumbnail

9.[ASR]wav2vec: Unsupervised Pre-training for Speech Recognition (2019)

post-thumbnail

10.[ASR]PASE: Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks & PASE+

post-thumbnail

11.[ASR] Gaussian Mixture Model

post-thumbnail

12.[ASR]Listen, Attend and Spell (2015)

post-thumbnail

13.[ASR] Speech Model Pre-training for End-to-End Spoken Language Understanding

post-thumbnail

14.[TTS] Viterbi Algorithm

post-thumbnail

15.[TTS] Tacotron1, Tacotron2

post-thumbnail