Research on Model Compression Based on Convolutional Neural Network

Minglan Xu

Research on Model Compression Based on Convolutional Neural Network

Download as PDF

DOI: 10.25236/cseem.2020.026

Author(s)

Minglan Xu

Corresponding Author

Minglan Xu

Abstract

Deep learning methods have achieved remarkable success in the variety of applications with various variants. The most popular instance is perhaps Convolutional neural networks(CNN) consisting numerous of numbers of convolutional layers to proceed image based input to yield desired output. Typically, CNNs contains enormous number of parameters and requires huge number of float operations for inference. Hence how to filter out redundant parameters become more and more necessary. In this paper, we study how to compress CNN architectures based on sparsity-inducing regularization optimization. We validate the method on one benchmark architecture VGG16 and dataset MNIST.

Keywords

Convolutional neural network, Deep learning, Benchmark architecture vgg16, Data set mnist