Large Language model

6.OFA(One For All)

post-thumbnail

7.Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

post-thumbnail

8.Generative Agents: Interactive Simulacra of Human Behavior

post-thumbnail

9.Instruction Tuning

post-thumbnail

10.Instruction Tuning - 보충 설명

post-thumbnail

11.RLHF : Reinforcement Learning with Human Feedback

post-thumbnail