Conference proceeding
Lossless Compression of Deep Neural Networks
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2020, Vol.12296, pp.417-430
Lecture Notes in Computer Science
01/01/2020
DOI: 10.1007/978-3-030-58942-4_27
Abstract
Deep neural networks have been successful in many predictive modeling tasks, such as image and language recognition, where large neural networks are often used to obtain good accuracy. Consequently, it is challenging to deploy these networks under limited computational resources, such as in mobile devices. In this work, we introduce an algorithm that removes units and layers of a neural network while not changing the output that is produced, which thus implies a lossless compression. This algorithm, which we denote as LEO (Lossless Expressiveness Optimization), relies on Mixed-Integer Linear Programming (MILP) to identify Rectified Linear Units (ReLUs) with linear behavior over the input domain. By using l(1) regularization to induce such behavior, we can benefit from training over a larger architecture than we would later use in the environment where the trained neural network is deployed.
Details
- Title: Subtitle
- Lossless Compression of Deep Neural Networks
- Creators
- Thiago Serra - Bucknell UniversityAbhinav Kumar - University of UtahSrikumar Ramalingam - University of Utah
- Contributors
- E Hebrard (Editor)N Musliu (Editor)
- Resource Type
- Conference proceeding
- Publication Details
- INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2020, Vol.12296, pp.417-430
- Series
- Lecture Notes in Computer Science
- DOI
- 10.1007/978-3-030-58942-4_27
- ISSN
- 0302-9743
- eISSN
- 1611-3349
- Publisher
- Springer Nature
- Number of pages
- 14
- Language
- English
- Date published
- 01/01/2020
- Academic Unit
- Business Analytics
- Record Identifier
- 9984696656402771
Metrics
12 Record Views