논문

1.Attention Is All You Need

post-thumbnail

3.On the Biology of a Large Language Model

post-thumbnail

4.In-N-Out: A Parameter-Level API Graph Dataset for Tool Agent

post-thumbnail

5.PPO

post-thumbnail

6.Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

post-thumbnail