논문리뷰

1.Few-Shot Learning(Siamese, Triplet, Relation Neural Network)

post-thumbnail

3.End-to-end Neural Coreference Resolution

post-thumbnail

4.Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

post-thumbnail

6.Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

post-thumbnail

7.GPT2 : Language Models are Unsupervised Multitask Learners

post-thumbnail

8.GPT1 : Improving Language Understanding by Generative Pre-Training

post-thumbnail

9.XLNet- Generalized Autoregressive Pretraining for Language Understanding

post-thumbnail

10.MT-DNN: Multi-Task Deep Neural Networks for Natural Language Understanding

post-thumbnail

11.Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy

post-thumbnail

12.ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

post-thumbnail

13.ELECTRA : Pre-training Text Encoders as Discriminators Rather Than Generators

post-thumbnail

14.GPT-3 : Language Models are Few-Shot Learners

post-thumbnail

15.InstructGPT

post-thumbnail

16.LoRA : Low-Rank Adaptation of Large Language Models

post-thumbnail

17.Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

post-thumbnail

18.Extrapolating Large Language Models to Non-English by Aligning Languages

post-thumbnail

19.DPO : Direct Preference Optimization: Your Language Model is Secretly a Reward Model

post-thumbnail

20.GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints

post-thumbnail

21.NEFTUNE: NOISY EMBEDDINGS IMPROVE INSTRUCTION FINETUNING

post-thumbnail

22.Multiscale Positive-Unlabeled Detection of AI-Generated Texts

post-thumbnail

23.RLAIF

post-thumbnail

24.KCTS - Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

post-thumbnail

25.Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

post-thumbnail

26.TAKE A STEP BACK: EVOKING REASONING VIA ABSTRACTION IN LARGE LANGUAGE MODELS

post-thumbnail

27.DayDreamer: World Models for Physical Robot Learning

post-thumbnail

28.Decision Transformer: Reinforcement Learning via Sequence Modeling

post-thumbnail

29.Active Retrieval Augmented Generation

post-thumbnail

30.Query2doc: Query Expansion with Large Language Models

post-thumbnail

31.Self-ask: Measuring and Narrowing the Compositionality Gap in Language Models

post-thumbnail

32.RaR: Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

post-thumbnail

33.Self-Rewarding Language Models

post-thumbnail

34.Unifying Large Language Models and Knowledge Graphs: A Roadmap

post-thumbnail

35.MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models

post-thumbnail

36.Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering

post-thumbnail

37.Learning to Tokenize for Generative Retrieval

post-thumbnail

38.multimodal- diffusion 논문 10개 정리

post-thumbnail

39.Adaptive Chameleon or Stubborn Sloth: REVEALING THE BEHAVIOR OF LARGE LANGUAGE MODELS IN KNOWLEDGE CONFLICTS

post-thumbnail

40.Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding

post-thumbnail

41.QUERY-DEPENDENT PROMPT EVALUATION AND OPTIMIZATION WITH OFFLINE INVERSE RL

post-thumbnail

42.LONGEMBED: EXTENDING EMBEDDING MODELS FOR LONG CONTEXT RETRIEVAL

post-thumbnail

43.Mathematical Language Models: A Survey

post-thumbnail

44.Let's Verify Step by Step

post-thumbnail

45.Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

post-thumbnail

46.Local Interpretations for Explainable Natural Language Processing: A Survey

post-thumbnail

47.Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps

post-thumbnail

48.Sparse Autoencoders Find Highly Interpretable Features in Language Models

post-thumbnail

49.Scaling and evaluating sparse autoencoders

post-thumbnail

50.Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning

post-thumbnail

51.Towards Automated Circuit Discovery for Mechanistic Interpretability

post-thumbnail

52.Controlling Large Language Model Agents with Entropic Activation Steering

post-thumbnail

53.Representation engineering: A top-down approach to ai transparency

post-thumbnail

54.Dragin: Dynamic retrieval augmented generation based on the real-time information needs of large language models

post-thumbnail

55.Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

post-thumbnail

56.RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

post-thumbnail

57.Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations

post-thumbnail

58.Visual Instruction Tuning

post-thumbnail

59.MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text

post-thumbnail