A Survey on Efficient Inference for Large Language Models (Zhou et al., 2024) 논문 리뷰
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization(Deng et al., 2025) 논문 리뷰