[Paper Review] Model Compression - Quantization, etc

1.[Simple Review] First-Generation Inference Accelerator Deployment at Facebook

post-thumbnail

2.A survey of model compression strategies for object detection

post-thumbnail

3.Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

post-thumbnail

4.[ICML 2017] How Do Adam and Training Strategies Help BNNs Optimization?

post-thumbnail

5.[2020 ICLR] LEARNED STEP SIZE QUANTIZATION

post-thumbnail