LLM

1.[쉬운 리뷰] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models 제대로 이해하기

post-thumbnail

2.[쉬운 리뷰] Direct Preference Optimization: Your Language Model is Secretly a Reward Model 제대로 이해하기

post-thumbnail

3.[쉬운 리뷰] Learning to summarize from human feedback 제대로 이해하기

post-thumbnail

4.[쉬운 리뷰] Qwen3 Technical Report 제대로 이해하기

post-thumbnail