dc.contributor.author | Shaterian Bidgoli, Sepideh | |
dc.date.accessioned | 2021-01-18 20:57:04 (GMT) | |
dc.date.available | 2021-01-18 20:57:04 (GMT) | |
dc.date.issued | 2021-01-18 | |
dc.date.submitted | 2021-01-13 | |
dc.identifier.uri | http://hdl.handle.net/10012/16692 | |
dc.description.abstract | Deep Neural Networks (DNNs) demonstrate excellent performance in many Computer Vision
(CV) applications such as image classification. To meet storage/bandwidth requirements,
the input images to these CV applications are compressed using lossy image compression standards,
among which JPEG is the most common. Classical JPEG is designed to consider Human
Vision (HV) and pays a little attention to CV, resulting in classification accuracy drop of DNNs,
especially at high Compression Ratios (CRs). This work presents Deep Selector-JPEG, an adaptive
JPEG compression method that simultaneously targets both image classification and HV. For
each image, Deep Selector-JPEG selects a Quality Factor (QF) adaptively to compress the image
so that a good trade-off between the Compression Ratio (CR) and DNN classifier Accuracy
(Rate-Accuracy performance) can be achieved over a set of images for a variety of DNN classifiers
while the PSNR of such compressed image is greater than a threshold value predetermined
by HV with a high probability.
Towards this end, Deep Selector-JPEG first defines a set of feasible QFs such that an image
compressed at any QF within this set has PSNR greater than a predetermined threshold value
with a high probability. For some images, multiple QFs within this set are suitable (ON) for
compressing for a DNN classifier, which means compressing at these QFs at least maintains
the ground truth rank of the original input for the DNN classifier. For a given image, Deep
Selector-JPEG first determines the QFs that are ON among the set of feasible QFs. This problem
is represented as a Multi-label Classification (MLC) problem since each image has multiple
corresponding suitable QFs. We solve MLC using a binary relevance procedure, which involves
training an independent binary DNN classifier for each QF within the feasible set to predict the
ON/OFF labeling for each input image. Given a target CR, we empirically derive a subset of
feasible QFs for this target CR and select the least QF that is ON in this set.
Experimental results show that in comparison with the default JPEG, Deep Selector-JPEG
indeed achieves better Rate-Accuracy performance over the entire ImageNet validation set for
all tested DNN classifiers with gains in classification accuracy up to 1% at the same CRs, while
satisfying HV constraints and keeping complexity under control. | en |
dc.language.iso | en | en |
dc.publisher | University of Waterloo | en |
dc.subject | computer vision | en |
dc.subject | deep learning | en |
dc.subject | image compression | en |
dc.subject | image processing | en |
dc.title | Deep SELECTOR-JPEG: ADAPTIVE JPEG IMAGE COMPRESSION FOR COMPUTER VISION IN IMAGE CLASSIFICATION AND HUMAN VISION | en |
dc.type | Master Thesis | en |
dc.pending | false | |
uws-etd.degree.department | Electrical and Computer Engineering | en |
uws-etd.degree.discipline | Electrical and Computer Engineering | en |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.degree | Master of Applied Science | en |
uws-etd.embargo.terms | 0 | en |
uws.contributor.advisor | Yang, En-hui | |
uws.contributor.affiliation1 | Faculty of Engineering | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.typeOfResource | Text | en |
uws.peerReviewStatus | Unreviewed | en |
uws.scholarLevel | Graduate | en |