자연어 전처리

1.머신 러닝 워크플로우(Machine Learning Workflow)

post-thumbnail

2.토큰화(Tokenization)

post-thumbnail

3.정제(Cleaning), 정규화(Normalization)

post-thumbnail

4.어간 추출(Stemming), 표제어 추출(Lemmatization)

post-thumbnail

5.불용어(Stopword), 정규 표현식(Regular Expression)

post-thumbnail

6.정수 인코딩(Integer Encoding)

post-thumbnail

7.패딩(Padding)

post-thumbnail

8.한국어 전처리 패키지(Text Preprocessing Tools for Korean Text)

post-thumbnail