Understanding how orthogonality of parameters improves quantization of neural networks

Eryılmaz, Şükrü Burç; Dündar, Ayşegül

Understanding how orthogonality of parameters improves quantization of neural networks

Files

Understanding_How_Orthogonality_of_Parameters_Improves_Quantization_of_Neural_Networks.pdf (1.81 MB)

Date

2022-05-10

Authors

Eryılmaz, Şükrü Burç

Dündar, Ayşegül

BUIR Usage Stats

3
views

88
downloads

Citation Stats

Abstract

We analyze why the orthogonality penalty improves quantization in deep neural networks. Using results from perturbation theory as well as through extensive experiments with Resnet50, Resnet101, and VGG19 models, we mathematically and experimentally show that improved quantization accuracy resulting from orthogonality constraint stems primarily from reduced condition numbers, which is the ratio of largest to smallest singular values of weight matrices, more so than reduced spectral norms, in contrast to the explanations in previous literature. We also show that the orthogonality penalty improves quantization even in the presence of a state-of-the-art quantized retraining method. Our results show that, when the orthogonality penalty is used with quantized retraining, ImageNet Top5 accuracy loss from 4- to 8-bit quantization is reduced by up to 7% for Resnet50, and up to 10% for Resnet101, compared to quantized retraining with no orthogonality penalty.

Source Title

IEEE Transactions on Neural Networks and Learning Systems

Publisher

IEEE

Keywords

Deep neural networks, Orthogonality regularization, Perturbation theory, Quantization

Permalink

http://hdl.handle.net/11693/111435

Published Version (Please cite this version)

https://www.doi.org/10.1109/TNNLS.2022.3171297

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Understanding how orthogonality of parameters improves quantization of neural networks

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Understanding how orthogonality of parameters improves quantization of neural networks

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type