paper-study

1.Who's Harry Potter? Approximate Unlearning in LLMs

post-thumbnail

2.Bitnet : Scaling 1-bit Transformers for LLMs

post-thumbnail

3.Rephrase and Respond: Let LLMs Ask Better Questions for Themselves

post-thumbnail

4.Direct Preference Optimization: Your Language Model is Secretly a Reward Model

post-thumbnail

5.SOLAR 10.7B: Scaling LLMs with Simple yet Effective Depth Up-Scaling

post-thumbnail

6.Sparse Upcycling: Training MoE from Dense Checkpoints

post-thumbnail

7.Spotting LLMs with Binoculars: Zero-Shot Detection of Machine-Generated Text

post-thumbnail

8.Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs

post-thumbnail

9.Repeat After Me: Transformers are Better than State Space Models at Copying

post-thumbnail

10.Self-Discover: LLMs Self-Compose Reasoning Structure

post-thumbnail

11.The Era of 1-bit LLMs: All LLMs are in 1.58 bits

post-thumbnail

12.Beyond Language Models: Byte Models are Digital World Simulators

post-thumbnail

13.Is Cosine-Similarity of Embeddings Really About Similarity?

post-thumbnail

14.Training Neural Networks from Scratch with Parallel Low-Rank Adapters

post-thumbnail

15.Can large language models explore in-context?

post-thumbnail

16.Octopus v4: Graph of Language Models

post-thumbnail

17.LayerSkip : Enabling Early Exit Inference and Self-Speculative Decoding

post-thumbnail

18.Elements of Worls Knowledge (EWoK)

post-thumbnail

19.Computational analysis of 140 years of US political speeches reveals more positive but increasingly polarized framing of immigration

post-thumbnail

20.Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

post-thumbnail

21.A Multi-Task Benchmark for Korean Legal Langhage Understanding and Judgement Prediction

post-thumbnail

22.SELF-EXPERTISE: Knowledge-Based Instruction Dataset Augmentation for a Legal Expert Language Model

post-thumbnail

23.GPTScore: Evaluate as You Desire

post-thumbnail