Fast Exploration of Weight Sharing Opportunities for CNN Compression

Etienne Dupuis, David Novo, Ian O'Connor, Alberto Bosio

The computational workload involved in Convolutional Neural Networks (CNNs) is typically out of reach for low-power embedded devices. There are a large number of approximation techniques to address this problem. These methods have hyper-parameters that need to be optimized for each CNNs using design space exploration (DSE). The goal of this work is to demonstrate that the DSE phase time can easily explode for state of the art CNN. We thus propose the use of an optimized exploration process to drastically reduce the exploration time without sacrificing the quality of the output.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment