train.log
로그인
train.log
로그인
[speaker verification] basics
Willow
·
2024년 1월 16일
팔로우
0
개인공부
0
SPEECH PROCESSING
목록 보기
7/13
high performance necessary under 'real world' conditions
difficulties
intrinsic: age, emotion, manner of speaking
extrinsic: background noise, reverberation, channel/mic
speaker identification: mapping a given utterance to a speaker (open set vs. closed set)
makes "closed set" a "multi-class classification"
classification loss
speaker verification: mapping a given utterance to a target model
contrastive loss (learn the embedding, rather than computing distance, e.g. Siamese)
a portion of data (=test set) left for unseen POIs
practice
model
average pooling
fixed input length
metric
EER
terms:
POI: person of interest
Willow
Speech Processing/AI/Linguistics/CS/etc.
팔로우
이전 포스트
[Kaldi] 오류들 간단 해결
다음 포스트
[SV] Exploring wav2vec 2.0 on speaker verification and language identification
0개의 댓글
댓글 작성