Sign in
Accelerating Sparse CNN Inference on GPUs with Performance-Aware Weight Pruning
Conference proceeding

Accelerating Sparse CNN Inference on GPUs with Performance-Aware Weight Pruning

Masuma Akter Rumi, Xiaolong Ma, Yanzhi Wang and Peng Jiang
PACT '20: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, pp.267-278
International Conference on Parallel Architectures and Compilation Techniques
01/01/2020
DOI: 10.1145/3410463.3414648

View Online

Abstract

Computer Science Technology Computer Science, Hardware & Architecture Computer Science, Software Engineering Computer Science, Theory & Methods Science & Technology

Details

Metrics