논문 리뷰

1.[논문 리뷰] Direct Preference Optimization: Your Language Model is Secretly a Reward Model

post-thumbnail

2.[논문 리뷰] SimPO: Simple Preference Optimization with a Reference-Free Reward

post-thumbnail

3.[논문 리뷰] A Simple Framework for Contrastive Learning of Visual Representations

post-thumbnail

4.[논문 리뷰] Positive Unlabeled Contrastive Learning

post-thumbnail