(자료 : Zhou Zixuan, et. al, A Survey on Efficient Inference for LargeLanguage Models, 2024)
[1] https://arxiv.org/pdf/2404.14294