SPEECH PROCESSING

1.[기본] 오디오 형식을 변환하는 다양한 방법들

post-thumbnail

2.[기본] Transformers - Tokenizer

post-thumbnail

3.모르는 언어가 훈련대상일 때 사전조사하기

post-thumbnail

4.[datasets] 300G 넘는 오디오 파일들 빠르게 로딩하기

post-thumbnail

6.[Kaldi] 오류들 간단 해결

post-thumbnail

7.[speaker verification] basics

post-thumbnail

8.[SV] Exploring wav2vec 2.0 on speaker verification and language identification

post-thumbnail

9.[SV] VoxCeleb: a large-scale speaker identification dataset

post-thumbnail

10.microphones

post-thumbnail

11.[speaker diarization] basics

post-thumbnail

12.Google STT + Langchain

post-thumbnail

13.Google TTS

post-thumbnail