๐Ÿ”ฎ ๊ฐ์ • ์Œ์„ฑ ๋ฐ์ดํ„ฐ ๋ถ„์„: ๊ณผ์ ํ•ฉ ๋ฐฉ์ง€ ๊ณ ์ฐฐ

Theo Kimยท2023๋…„ 9์›” 26์ผ
0
post-thumbnail

์‹ค์‹œ๊ฐ„ ๊ฐ์ •๋ถ„์„ ํ™”์ƒํšŒ์˜ ๋ฐ ๊ฐ์ •์ƒํ™ฉ ํšŒ์˜๋ก ์„œ๋น„์Šค

ํ”„๋กœ์ ํŠธ ๊ด€๋ จ ๋งํฌ

๋ฐœํ‘œ์šฉ ์Šฌ๋ผ์ด๋“œ: https://docs.google.com/presentation/d/1ysadmKWzAK9TvJ8QxTdkTegd7T5-nAVor1_x6cmYPDI/edit#slide=id.p

โœจ ํ”„๋กœ์ ํŠธ ์†Œ๊ฐœ

์Œ์„ฑ๋ฐ์ดํ„ฐ์—์„œ ํŠน์ง•์„ ์ถ”์ถœํ•˜์—ฌ ๊ธฐ์จ, ๋‹นํ™ฉ, ๋ถ„๋…ธ, ๋ถˆ์•ˆ, ์Šฌํ”” ์ด 5๊ฐœ์˜ ๊ฐ์ •์„ ๋ถ„๋ฅ˜ํ•˜๋Š” ๋ชจ๋ธ์„ ์„ค๊ณ„

  1. mfcc์™€ mel spectrogram์„ ์ด์šฉํ•˜์—ฌ ์Œ์„ฑ๋ฐ์ดํ„ฐ์—์„œ ํŠน์ง•์„ ์ถ”์ถœ

  1. LSTM ๋ชจ๋ธ ์‚ฌ์šฉ

-> ๊ณผ์ ํ•ฉ ๋ฐœ์ƒ !

  1. ์Œ์„ฑ๋ฐ์ดํ„ฐ๋ฅผ 8๋“ฑ๋ถ„ ํ›„ ๋žœ๋ค์œผ๋กœ ๋ณ‘ํ•ฉ

https://user-images.githubusercontent.com/58973535/228573752-ca3aaa49-0efd-4321-a439-94151d6f7fba.mp4

  1. ๋žœ๋ค์œผ๋กœ ์ด์–ด๋ถ™์ธ ์Œ์„ฑ๋ฐ์ดํ„ฐ์—์„œ mfcc์™€ mel spectrogram์„ ์ด์šฉํ•˜์—ฌ ํŠน์ง• ์ถ”์ถœ

  2. LSTM, ResNet, Efficient Net, Random Forest ๋ชจ๋ธ์„ ์‚ฌ์šฉ

๋ชจ๋ธ๋ช…train accuracytest accuracytop-2 accuracy
LSTM0.60870.40730.7120
ResNet0.62130.46530.6967
EfficientNet0.51700.44870.6947
RandomForset--0.4107--

๐Ÿ“œ ๊ธฐ์ˆ  ์Šคํƒ

์ฝ”๋“œ๋Š” Github

https://github.com/taeho8271/speech_data_emotions_recog/blob/master/final_project_for_paper.ipynb

profile
THEO's velog

0๊ฐœ์˜ ๋Œ“๊ธ€