Rethinking Image Compression for the Object Detection Task

dc.contributor.advisorVeeraraghavan, Ashok
dc.contributor.committeeMemberBaraniuk, Richard
dc.contributor.committeeMemberShrivastava, Anshumali
dc.creatorBarua, Souptik
dc.date.accessioned2016-01-06T20:30:30Z
dc.date.available2016-01-06T20:30:30Z
dc.date.created2015-12
dc.date.issued2015-12-03
dc.date.submittedDecember 2015
dc.date.updated2016-01-06T20:30:30Z
dc.description.abstractTraditionally, image compression algorithms, such as JPEG, have been designed for human viewers' satisfaction. Increasingly however, more and more images are being viewed by computers, for performing computer vision tasks such as object detection. Image compression and object detection have largely been independent areas of research so far. However, several applications such as surveillance and medical imaging impose severe bandwidth and power restrictions. These constraints make the quality and/or size of the compressed image a critical factor in object detection performance. My works presents three compressed image representations that enable fast and accurate object detection. The first representation is a saliency guided wavelet representation which modifies traditional wavelet compression using the knowledge of saliency to improve both compression and detection performance compared to JPEG images. The second representation, called event stream representation, comes directly from the new DVS sensor which has ultra-low bandwidth and power requirements. We show, for the first time, high speed video reconstruction, and direct detection, on the event data. We achieve detection performance comparable to that on conventional JPEG images. Finally, we explore an abstract compressed representation called patch-wise binary representation, which represents an image (patch-wise) as a collection of short binary strings. We demonstrate two ways of generating these binary strings, called hashing and feature binarization, which enable 10x faster detection. We show promising detection and reconstruction results for both these approaches.
dc.format.mimetypeapplication/pdf
dc.identifier.citationBarua, Souptik. "Rethinking Image Compression for the Object Detection Task." (2015) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/87714">https://hdl.handle.net/1911/87714</a>.
dc.identifier.urihttps://hdl.handle.net/1911/87714
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectImage compression
dc.subjectobject detection
dc.subjectwavelet
dc.subjectDVS sensor
dc.subjectvideo reconstruction
dc.subjecthashing
dc.subjectfeature binarization
dc.titleRethinking Image Compression for the Object Detection Task
dc.typeThesis
dc.type.materialText
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
BARUA-DOCUMENT-2015.pdf
Size:
4.03 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: