Rethinking Image Compression for the Object Detection Task

Barua, Souptik

Rethinking Image Compression for the Object Detection Task

dc.contributor.advisor	Veeraraghavan, Ashok	en_US
dc.contributor.committeeMember	Baraniuk, Richard	en_US
dc.contributor.committeeMember	Shrivastava, Anshumali	en_US
dc.creator	Barua, Souptik	en_US
dc.date.accessioned	2016-01-06T20:30:30Z	en_US
dc.date.available	2016-01-06T20:30:30Z	en_US
dc.date.created	2015-12	en_US
dc.date.issued	2015-12-03	en_US
dc.date.submitted	December 2015	en_US
dc.date.updated	2016-01-06T20:30:30Z	en_US
dc.description.abstract	Traditionally, image compression algorithms, such as JPEG, have been designed for human viewers' satisfaction. Increasingly however, more and more images are being viewed by computers, for performing computer vision tasks such as object detection. Image compression and object detection have largely been independent areas of research so far. However, several applications such as surveillance and medical imaging impose severe bandwidth and power restrictions. These constraints make the quality and/or size of the compressed image a critical factor in object detection performance. My works presents three compressed image representations that enable fast and accurate object detection. The first representation is a saliency guided wavelet representation which modifies traditional wavelet compression using the knowledge of saliency to improve both compression and detection performance compared to JPEG images. The second representation, called event stream representation, comes directly from the new DVS sensor which has ultra-low bandwidth and power requirements. We show, for the first time, high speed video reconstruction, and direct detection, on the event data. We achieve detection performance comparable to that on conventional JPEG images. Finally, we explore an abstract compressed representation called patch-wise binary representation, which represents an image (patch-wise) as a collection of short binary strings. We demonstrate two ways of generating these binary strings, called hashing and feature binarization, which enable 10x faster detection. We show promising detection and reconstruction results for both these approaches.	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Barua, Souptik. "Rethinking Image Compression for the Object Detection Task." (2015) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/87714">https://hdl.handle.net/1911/87714</a>.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/87714	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	Image compression	en_US
dc.subject	object detection	en_US
dc.subject	wavelet	en_US
dc.subject	DVS sensor	en_US
dc.subject	video reconstruction	en_US
dc.subject	hashing	en_US
dc.subject	feature binarization	en_US
dc.title	Rethinking Image Compression for the Object Detection Task	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Electrical and Computer Engineering	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: BARUA-DOCUMENT-2015.pdf
Size:: 4.03 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 5.84 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.61 KB
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations
ECE Theses and Dissertations