[NLP] Language Model

김선형·2025년 9월 20일

NLP

목록 보기

2/5

개요

Language Model (LM)은 이전 단어(/문자/subword)를 기반으로 문장에서 다음에 올 단어(/문자/subword)를 예측하는 모델이다.
단어 시퀀스 $w_1, w_2, \dots, w_{i-1}$ 이 주어졌을 때, 다음 단어 $w_i$ 의 확률 분포 $P\left(w_i|w_{i-1},\dots,w_1\right)$ 을 계산한다.
텍스트 자동완성, 검색어 입력 시 추천 등에서 사용한다.

n-grams

n-gram은 연속된 $n$ 개의 단어 조각을 의미한다. 예를 들어 'Students opened their notebooks'라는 문장이 있을 때, unigrams, bigrams, trigrams는 각각 다음과 같다.

unigrams: Students, opened, their, notebooks
bigrams: Students opened, opened their, their notebooks
trigrams: Students opened their, opened their notebooks
일반적으로 $n$ 은 5 이하의 값을 사용한다.

n-gram Language Modeling

전체 시퀀스가 아닌 마지막 몇 개의 단어만 사용해 다음 단어를 예측한다. n-gram LM은 마지막 $n-1$ 개의 단어로 다음 단어를 예측한다. 예를 들어, Trigram LM은 아래와 같이 확률 분포를 계산한다.

P\left(w_i|w_1,\dots,w_{i-1}\right)=P\left(w_i|w_{i-1},w_{i-2}\right)

김선형

선형의 비선형적 기록 🐜

이전 포스트

[NLP] Tokenization과 Embedding

다음 포스트

[NLP] Language Model

NLP

개요

n-grams

n-gram Language Modeling

[NLP] Tokenization과 Embedding

[NLP] RNN

0개의 댓글