Memory efficient filtering algorithms for convolutional neural networks

Çakır, Bahadır Alp

Memory efficient filtering algorithms for convolutional neural networks

buir.advisor	Morgül, Ömer
dc.contributor.author	Çakır, Bahadır Alp
dc.date.accessioned	2021-01-08T08:50:19Z
dc.date.available	2021-01-08T08:50:19Z
dc.date.copyright	2020-12
dc.date.issued	2020-12
dc.date.submitted	2021-01-07
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 51-56).	en_US
dc.description.abstract	Deployment of state of the art CNN architectures like Xception, ResNet and GoogleNet in resource limited devices is a big challenge. These architectures consist of many layers and millions of parameters. Moreover, they require billions of floating point operations to inference just an image. Therefore, memory space needed to store parameters and to execute them are the main constraints for efficient convolutional neural network architectures. In this thesis, we examine Winograd’s minimal filtering algorithms to reduce number of floating point operations performed in convolutional layers. We reduce the number of multiplications x2.25 times without any accuracy loss. Moreover, we investigate, sparse and quantized Winograd’s algorithms so that we can make conventional Winograd algorithms more memory efficient. We propose a linear quantization scheme to quantize weights of the networks more than 1-bit. We use ReLU activation function and Targeted Dropout which is a variant of Dropout to prune transformed inputs of Winograd algorithm. We binarize weights so that most arithmetic operations are converted to bit-wise operations. We conduct several experiments on CIFAR10 and CIFAR100 datasets and discuss the classification performances of both conventional and modified Winograd minimal filtering algorithms. We achieve less than 1.9% classification error with ReLU-ed Winograd CNN compared to conventional Winograd. We reduce memory requirements up to x32 times by binarizing weights of ReLU-ed Winograd CNN, and in return we incur around 2% accuracy loss. Lastly, for applications which are less tolerant to accuracy loss, rather than binarizing weights we quantize them to 2-bit, 4-bit and 8-bit. Our quantized ReLU-ed Winograd CNNs reach same accuracy levels as ReLU-ed Winograd CNN.	en_US
dc.description.statementofresponsibility	by Bahadır Alp Çakır	en_US
dc.format.extent	xii, 56 leaves : some color, charts ; 30 cm.	en_US
dc.identifier.itemid	B130916
dc.identifier.uri	http://hdl.handle.net/11693/54874
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Winograd’s minimal filtering algorithms	en_US
dc.subject	ReLU	en_US
dc.subject	Targeted dropout	en_US
dc.subject	Binary weights	en_US
dc.subject	Quantized weights	en_US
dc.subject	Memory efficiency	en_US
dc.title	Memory efficient filtering algorithms for convolutional neural networks	en_US
dc.title.alternative	Evrişimli yapay sinir ağları için bellek verimli filtreleme algoritmaları	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Electrical and Electronic Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 10373488.pdf
Size:: 1.22 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science