https://huggingface.co/papers
https://arxiv.org/list/cs.CV/recent
https://arxiv.org/list/cs.AI/recent
Key topics: Quantization, Pruning, Object detection, Transformer, Mamba
One-Index Vector Quantization Based Adversarial Attack on Image Classification
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers
Quantization
MobileQuant: Mobile-friendly Quantization for On-device Language Models
(Samsung AI Center, Cambridge)
Pruning
LLM Pruning and Distillation in Practice: The Minitron Approach
(NVIDIA)
Mamba
ReMamba: Equip Mamba with Effective Long-Sequence Modeling