시리즈

EfficientML

1.[EfficientML ai] Lecture 3: Pruning and Sparsity (1)

Neural Network Pruning Determine the Pruning Ratio What should target sparsity be for each layer? Fine-tune/Train Pruned Neural Network pruning rati

2024년 9월 5일

2.[EfficientML ai] Lecture 4: Pruning and Sparsity (2)

Neural Network Pruning 모든 layer에 동일한 비율로 pruning을 수행하는 것 보다 각 layer별로 영향력에 따라 비율을 달리하는 것이 더 성능이 좋다. 그래프에서 좌상단에 가까울 수록 좋은 것이다. Determine the Pruning

2024년 9월 5일

3.[EfficientML ai] Lecture 5: Quantization Part I

연속적으로 큰 값인 input을 이산적인 값의 집합으로 변환하는 프로세스임. 이렇게 변환할때에 input과 quantization결과의 차이를 quantization error라고 하며, 이 error를 최소화하는 quantization을 하는 것이 목표이다.가중치 q

2025년 12월 2일

4.[MIT] TinyML and Efficient Deep Learning Computing - Lab1

VGG모델, CIFAR10 datasetdense model has accuracy=92.95%dense model has size=35.20 MiBQuestion 1) What are the common characteristics of the weight distr

2026년 1월 6일