Naive Bayes - Count vectorization, TF-IDF

최혜주·2024년 12월 20일

Using raw string text for Machine Learning models
= "Natural Language Processing" : Supervised learning text tasks

1. Bayes' Theorem

Naive Bayes -> Bayes' Theorem으로 supervised learning classify

Bayes' Theorem을 Machine Learning Model로 conversion이 가능하다.

여기서 Numerator: Equal to joint probability model -> chain rule에 의해 product of conditional probabilities로 표현 가능 (물론, 모든 x feature가 independent하다는 가정하에)

따라서, 최종 joint model(the full Naive Bayes Model)은 아래와 같이 적을 수 있음.

Naive Bayes Model의 variation으로는 Multinomial/Gaussian/Complement/Categorical/Bernoulli 등으로 다양함.
이 중 가장 잘 이용되는 모델은 Multinomial Naive Bayes Model.

희귀-> weight 증가시킴
흔함-> weight 감소시킴 (값이 0에 가까울수록 단어가 common)

예를 들어, 100개 word에서 'run' 100개면 IDF는
-> log 1 = 0

🍮