Quantization[논문핵심]

1.[핵심][20.04]AdaRound

post-thumbnail

2.[핵심][22.06]ZeroQuant

post-thumbnail

3.[핵심] [22.08]Optimal Brain Quantizer

post-thumbnail

4.[핵심][22.08]LLM.int8()

post-thumbnail

5.[핵심][22.11]SmoothQuant

post-thumbnail

6.[핵심][23.03] GPTQ

post-thumbnail

7.[핵심][23.06]OWQ

post-thumbnail

8.[24.08]The Impact of Quantization on Retrieval-Augmented Generation: An Analysis of Small LLMs

post-thumbnail

9.[핵심]QLLM

post-thumbnail

10.[핵심][24.ICLR] RETHINKING CHANNEL DIMENSIONS TO ISOLATE OUTLIERS FOR LOW-BIT WEIGHT QUANTIZATION OF LARGE LANGUAGE MODELS

post-thumbnail