New Convolutional Neural Network Topology with Compressed Information to Enhance Accuracy for Image Classification Task

Jiang, Yanbing

New Convolutional Neural Network Topology with Compressed Information to Enhance Accuracy for Image Classification Task

dc.contributor.advisor	En-Hui, Yang
dc.contributor.author	Jiang, Yanbing
dc.date.accessioned	2019-09-23T17:46:50Z
dc.date.available	2019-09-23T17:46:50Z
dc.date.issued	2019-09-23
dc.date.submitted	2019-09-19
dc.description.abstract	Source coding and deep learning are two major branches in the field of information processing. Source coding encodes information that can be summarised with patterns into certain representation without semantic consideration. On the other hand, deep learning utilises multi-layers of representations with increasing levels of abstraction to learn the patterns that cannot be summarised easily. What is interesting is that source coding itself makes great contributions to the field of deep learning. The key that makes deep learning successful is the inclusion of cascading non-linear layers that help the network to abstract multi-level features. Source coding, such as image compression, contains fundamental non-linear operations including quantisation and rounding. How the non-linearity from the compression could further help deep learning is the inspiration of this research even though common sense tells us that compression usually results a worse ability to do recognition. This paper proposes the idea of integrating source coding and deep learning to have better accuracy performance in image classification. Image classification is one of the most popular tasks in the field of deep learning. Based on human vision’s perception to classify object(s) in images, when the images are compressed, such as by JPEG, the human’s recognition ability deteriorates. Nonetheless, it is not usually the case in machine's perspective. Compressed images may be recognised better by machine based on our observation. In order to improve the accuracy of image recognition, this study focuses on improving the pre-processing operation before image input into the neural network. At the meantime, we proposed a new Convolutional Neural Network (CNN) topology, which absorbs original input along with its various compressed versions. JPEG image compression is friendly for human when the images are compressed with higher quality. However, what level of the compressed image is machine friendly is uncertain. This topology facilitates the compressed information across the compression inputs from low to high qualities and lets the machine to learn from all potential compressed information by itself. We trained the topology with proposed Block-by-block training method and were able to increase the accuracy of state-of-art CNN for image classification: 0.374% increase in Top-1 accuracy, 0.346% increase in Top-5 accuracy in terms of Inception V3 model and 0.39% increase in Top-1 accuracy and 0.228% increase in Top-5 accuracy in terms of ResNet-50 V2 model. What's more, we can state that compression can highlight the contrast of the objects and discard interference information which helps our topology improve the accuracy of image classification based on visual observations. Furthermore, we believe the accuracy performance could be even more outstanding if our topology is applied to the state-of-art EfficientNet (published May 2019).	en
dc.identifier.uri	http://hdl.handle.net/10012/15127
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	Image Compression	en
dc.subject	Machine Learning	en
dc.subject	Deep Learning	en
dc.subject	GPU Utilization	en
dc.title	New Convolutional Neural Network Topology with Compressed Information to Enhance Accuracy for Image Classification Task	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Applied Science	en
uws-etd.degree.department	Electrical and Computer Engineering	en
uws-etd.degree.discipline	Electrical and Computer Engineering	en
uws-etd.degree.grantor	University of Waterloo	en
uws.contributor.advisor	En-Hui, Yang
uws.contributor.affiliation1	Faculty of Engineering	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Jiang_Yanbing.pdf
Size:: 3.69 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.08 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Electrical and Computer Engineering