Paper Review

1.[RL] DQN

post-thumbnail

2.[RL] AlpahGo Zero

post-thumbnail

3.[DL] Deep Residual Learning for Image Recognition

post-thumbnail

4.[DL] Attention Is All You Need

post-thumbnail

5.[DL] Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

post-thumbnail

6.[DL] Layer Normalization

post-thumbnail

7.[DL] Dropout: A Simple Way to Prevent Neural Networks from Overfitting

post-thumbnail

8.[DL] Adam: A Method for Stochastic Optimization

post-thumbnail

9.[NLP] TermDiffuSum: A Term-guided Diffusion Model for Extractive Summarization of Legal Documents

post-thumbnail

10.[NLP] Integrating Extractive and Abstractive Summarization: A Hybrid Approach

post-thumbnail

11.[NLP] On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

post-thumbnail

12.[RLHF] Deep Reinforcement Learning from Human Preferences

post-thumbnail

13.[DPO] Direct Preference Optimization: Your Language Model is Secretly a Reward Mode

post-thumbnail