Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation (preprint)

박상우·2024년 7월 23일

Paper Review

목록 보기

47/51

Conversational Recommender System(CRS) Task에서 LLM의 강점이 대두
- 특히 Context knowledge에서 강점을 가진 다는 것이 증명됨
우리는 CRS에서 LLM을 사용하는 Task를 Differential Search Index(DSI)로 간주
- 여기서 Ability와 Limitation을 발견
- Ability: LLM은 대부분의 popular movie를 indexing할 수 있으며, 복잡한 conversation context를 이해할 수 있음
- Limitation: Data ditribtion에서 misalignment를 보임 (figure를 보면 population이 다름)
우리는 LLM의 Target Dstn을 조정하는 것으로 misalignment를 개선하려 함
- 이를 통해 CRS의 accuracy를 improve하며 controllability와 fairness를 가져옴
  * 그러나 이를 LLM에 적용하기에는 여러 문제가 존재하는데, 전통적인 Recsys 모델과 달리, Logit Vector가 단일 벡터가 아닌 여러 단어 token의 결합임
이를 해결하기 위해 $Reindex-Then-Adapt (RTA)$ framework를 제안
- Reindex Step: multi-token item title을 single token으로 변환
- Adapt Step: single token의 logit을 바탕으로 dstn을 변환
기깔나는 성능을 보여줌

Transformer architecture가 retrieval task에 적합하다는 것을 보여준 DSI
- 여기서 Learn to Index(L2I)와 Learn to Retrieve(L2R)를 Recommendation에 적합하게 변환

LLM은 특별한 fine tuning 없이도 Indexing capability를 보유
표를 보면, Warm과 Pop item이 추천의 대다수를 차지함
- 현재의 Indexing 능력 만으로도 추천하는데에는 충분
- Cold Item은 future work로 남겨둠

Section 1에서, 우리는 varying token counts가 Rec dstn을 조정하는데 challenge임을 명시함
- 이를 다루기 위해, 이미 Indexing하는 능력은 충분하다고 보고 single token으로 Reindexing하는 작업을 수행
- 이후 재인덱싱 된 LLM의 Logit을 target dstn에 맞게 조정 (Logit vector 변환 및 Gating mechanism 사용)

Reindexing의 핵심은 여러 토큰의 아이템 임베딩을 단일 토큰 아이템 임베딩으로 압축하는 것 + LLM generation에서 원래의 semantic을 보존하는 것

negative sample을 준비해 contrastive loss 사용
$q \in \R^d$
- 원래 인덱싱된 아이템의 첫 번째 토큰을 생성하는 데 사용된 마지막 위치의 context embedding
- 여기서 우리는 aggregate 된 item embedding을 사용
학습 시에는 2가지 corpus 사용
- L2R: (query, target item) pair이며 conversation에서 나옴
- L2I: (content, target item) pair이며, content는 meta data와 같은 textual description에서 나옴

LLM이 lack한 collborative information을 target
$g$ tilda는 traditional Recsys model로부터 얻은 vector
$\alpha$ 는 0과 1사이에서 조절 가능한 값
본 실험에서는 다음과 같이 learnable scalar로 설정했지만, 내부에 MLP 등을 넣어 learnable contextual embedding을 사용할 수도 있을 것