Data-parallel Digital Signal Processors: Algorithm Mapping, Architecture Scaling and Workload Adaptation

dc.citation.bibtexNamephdthesisen_US
dc.citation.journalTitlePh.D. Thesisen_US
dc.citation.locationHouston, TXen_US
dc.contributor.orgCenter for Multimedia Communications (http://cmc.rice.edu/)en_US
dc.creatorRajagopal, Sridhar
dc.date.accessioned2007-10-31T00:59:48Z
dc.date.available2007-10-31T00:59:48Z
dc.date.issued2004-05-01
dc.date.modified2004-08-30en_US
dc.date.submitted2004-03-21en_US
dc.descriptionPhD Thesisen_US
dc.description.abstractEmerging applications such as high definition television (HDTV), streaming video, image processing in embedded applications and signal processing in high-speed wireless communications are driving a need for high performance digital signal processors (DSPs) with real-time processing. This class of applications demonstrates significant data parallelism, finite precision, need for power-efficiency and the need for 100's of arithmetic units in the DSP to meet real-time requirements. Data-parallel DSPs meet these requirements by employing clusters of functional units, enabling 100's of computations every clock cycle. These DSPs exploit instruction level parallelism and subword parallelism within clusters, similar to a traditional VLIW (Very Long Instruction Word) DSP, and exploit data parallelism across clusters, similar to vector processors. Stream processors are data-parallel DSPs that use a bandwidth hierarchy to support dataflow to 100's of arithmetic units and are used for evaluating the contributions of this thesis. Different software realizations of the dataflow in the algorithms can affect the performance of stream processors by greater than an order-of-magnitude. The thesis first presents the design of signal processing algorithms that map efficiently on stream processors by parallelizing the algorithms and by re-ordering the flow of data. The design space for stream processors also exhibits trade-offs between arithmetic units per cluster, clusters and the clock frequency to meet the real-time requirements of a given application. This thesis provides a design space exploration tool for stream processors that meets real-time requirements while minimizing power consumption. The presented exploration methodology rapidly searches this design space at compile time to minimize power consumption and selects the number of adders, multipliers, clusters and the real-time clock frequency in the processor. Finally, the thesis improves the power efficiency in the designed stream processor by adapting the compute resources to run-time variations in the workload. The thesis presents an adaptive multiplexer network that allows the number of active clusters to be varied during run-time by turning off unused clusters. Thus, by efficient mapping of algorithms, exploring the architecture design space, and by compute resource adaptation, this thesis improves power efficiency in stream processors and enhances their suitability for high performance, power-aware, signal processing applications.en_US
dc.description.sponsorshipNokiaen_US
dc.description.sponsorshipNokia/Texas Instrumentsen_US
dc.identifier.citation "Data-parallel Digital Signal Processors: Algorithm Mapping, Architecture Scaling and Workload Adaptation," <i>Ph.D. Thesis,</i> 2004.
dc.identifier.urihttps://hdl.handle.net/1911/20239
dc.language.isoeng
dc.relation.projecthttp://www.ece.rice.edu/~sridharen_US
dc.subjectDSP architectures*
dc.subjectstream processors*
dc.subjectlow power*
dc.subjectwireless communications*
dc.subject.keywordDSP architecturesen_US
dc.subject.keywordstream processorsen_US
dc.subject.keywordlow poweren_US
dc.subject.keywordwireless communicationsen_US
dc.titleData-parallel Digital Signal Processors: Algorithm Mapping, Architecture Scaling and Workload Adaptationen_US
dc.typeThesis
dc.type.dcmiText
thesis.degree.levelDoctoral
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Raj2004Mar3Dataparall.PDF
Size:
994.45 KB
Format:
Adobe Portable Document Format