Typed Fusion with Applications to Parallel and Sequential Code Generation

dc.contributor.authorKennedy, Ken
dc.contributor.authorMcKinley, Kathryn S.
dc.date.accessioned2017-08-02T22:03:22Z
dc.date.available2017-08-02T22:03:22Z
dc.date.issued1994-01-01
dc.date.noteJanuary 1, 1994
dc.description.abstractLoop fusion is a program transformation that merges multiple loops into one and is an effective optimization both for increasing the granularity of parallel loops and for improving data locality. This paper introduces typed fusion, a formulation of loop fusion which captures the fusion and distribution problems encountered in sequential and parallel program optimization. Typed fusion is more general and applicable than previous work. We present a fast algorithm for a typed fusion on a graph G = (N; E), where nodes represent loops, edges represent dependence constraints between loops and each loop is assigned one of T distinct types. Only nodes of the same type may fuse. Only nodes of the same type may be fused. The asymptotic time bound for this algorithm is O((N + E)T). The fastest previous algorithm considered only one or two types, but was still O(NE) [KM93]. When T > 2 and there is no reason to prefer fusing one type over another, we prove the problem of finding a fusion with the fewest resultant loops to be NP-hard. Using typed fusion, we present fusion and distribution algorithms that improve data locality and a parallel code generation algorithm that incorporates compound transformations. We also give evidence of the effectiveness of this algorithm in practice.
dc.format.extent16 pp
dc.identifier.citationKennedy, Ken and McKinley, Kathryn S.. "Typed Fusion with Applications to Parallel and Sequential Code Generation." (1994) https://hdl.handle.net/1911/96439.
dc.identifier.digitalTR94-221
dc.identifier.urihttps://hdl.handle.net/1911/96439
dc.language.isoeng
dc.rightsYou are granted permission for the noncommercial reproduction, distribution, display, and performance of this technical report in any format, but this permission is only for a period of forty-five (45) days from the most recent time that you verified that this technical report is still available from the Computer Science Department of Rice University under terms that include this permission. All other rights are reserved by the author(s).
dc.titleTyped Fusion with Applications to Parallel and Sequential Code Generation
dc.typeTechnical report
dc.type.dcmiText
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR94-221.pdf
Size:
438.8 KB
Format:
Adobe Portable Document Format