Author(s):
1. Zahra Waheed Awan:
Bahria University Islamabad,Pakistan
2. Shehzad Khalid:
Bahria University Islamabad,Pakistan
3. Sajid Gul:
National University of Sciences and Technology Islamabad,Pakistan
Abstract:
Convolutional Neural Network (CNN) is considered as one of the most significant algorithms of deep learning that has made impressive achievements in many areas of computer vision and natural language processing. In the current times of big data, input data dimensions keep on increasing which leads to the development of complex CNN models for processing such big data. This has made CNN computationally intensive and limits its practical application to some extent. To address the aforementioned issue, this paper presents a detailed review of various network compression methods existing in literature. Two most commonly deployed network compression methods have been discussed including pruning and quantization which can be coupled with CNN to increase its applicability. The main goal of presenting this comprehensive review of the state-of-the-art pruning and quantization-based network compression schemes is to significantly improve trade-off between CNN architectural size and computational cost versus its performance in resource restricted environments. Additionally, this paper also exploits the challenges posed by these techniques when implemented for large-scale CNNs. In this context, paper also presents a novel framework to perform network compression of CNN to meet the requirements of resource-restricted devices.
Page(s):
1-1
DOI:
DOI not available
Published:
Journal: IEEE International Conference on Digital Futures and Transformative Technologies (ICoDT2) May 24-26, 2022 (Book of Abstracts), Volume: 1, Issue: 1, Year: 2022
Keywords:
natural language processing
,
Computer vision
,
CNN Compression Framework
References:
References are not available for this document.
Citations
Citations are not available for this document.