Hierarchical Pipeline Optimization of Coarse Grained Reconfigurable Processor for Multimedia Applications

The major bottleneck of coarse-grained reconfigurable arrays (CGRAs) is the excessive configuration overhead; as a result, computing potential cannot be fully utilized. At run-time, the function of CGRAs can be fully and dynamically reconfigured by changing contexts. Therefore, the frequency of context switching on CGRAs is very high. On the other hand, the configuration time of CGRAs is very long. This paper proposes three configuration approaches to reduce interval latency when switching configuration contexts. These proposed approaches include input data relocation (IDR), line-based context switching (LCS), and loop interval minimization (LIM). IDR relocates input data to the first stage of the pipeline; as a result, the delay time for the input data of the next data flow graph (DFG) is reduced. LCS is a LCS mechanism for adjacent independent DFGs to reduce the interval of context switching, thereby expanding the depth of the pipeline. LIM is used to minimize the interval of loops. Simulations on a coarse-grained reconfigurable processor called reconfigurable multimedia system (REMUS) show that 1080 p@30 fps for H.264 high profile video decoding can be achieved under 200 MHz working frequency. As for AVS and MPEG2 decoding algorithms, much higher performance, i.e., 1080 p@39 fps and 1080 p@41 fps, can be achieved respectively.

Download Full-text

A Coarse-Grained Reconfigurable Processor for Sequencing and Phylogenetic Algorithms in Bioinformatics

2011 International Conference on Reconfigurable Computing and FPGAs ◽

10.1109/reconfig.2011.1 ◽

2011 ◽

Cited By ~ 3

Author(s):

Pei Liu ◽

Fatemeh O. Ebrahim ◽

Ahmed Hemani ◽

Kolin Paul

Keyword(s):

Coarse Grained ◽

Reconfigurable Processor

Download Full-text

Throughput/Resource-Efficient Reconfigurable Processor for Multimedia Applications

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2012.2206063 ◽

2013 ◽

Vol 21 (7) ◽

pp. 1346-1350 ◽

Cited By ~ 1

Author(s):

Sohan Purohit ◽

Sai Rahul Chalamalasetti ◽

Martin Margala ◽

Wim Vanderbauwhede

Keyword(s):

Multimedia Applications ◽

Reconfigurable Processor

Download Full-text

An Efficient Implementation of Advanced-Encryption-Standard Based on a Coarse-Grained Reconfigurable Processor

10.22323/1.259.0066 ◽

2015 ◽

Author(s):

Leibo Liu ◽

Le Chang

Keyword(s):

Efficient Implementation ◽

Coarse Grained ◽

Advanced Encryption Standard ◽

Reconfigurable Processor

Download Full-text

CHiPReP—A Compiler for the HiPReP High-Performance Reconfigurable Processor

Electronics ◽

10.3390/electronics10212590 ◽

2021 ◽

Vol 10 (21) ◽

pp. 2590

Author(s):

Markus Weinhardt ◽

Mohamed Messelka ◽

Philipp Käsgen

Keyword(s):

High Performance ◽

Optimization Method ◽

Coarse Grained ◽

Splitting Algorithm ◽

Reconfigurable Processor ◽

Routing Optimization ◽

Integrated Placement ◽

Reconfigurable Array ◽

Address Generator ◽

Scheduling Heuristic

This article presents CHiPReP, a C compiler for the HiPReP processor, which is a high-performance Coarse-Grained Reconfigurable Array employing Floating-Point Units. CHiPReP is an extension of the LLVM and CCF compiler frameworks. Its main contributions are (i) a Splitting Algorithm for Data Dependence Graphs, which distributes the computations of a C loop to Address-Generator Units and Processing Elements; (ii) a novel instruction clustering and scheduling heuristic; and (iii) an integrated placement, pipeline balancing and routing optimization method based on Simulated Annealing. The compiler was verified and analyzed using a cycle-accurate HiPReP simulation model.

Download Full-text

Efficient Evaluation of Power/Area/Latency Design Trade-Offs for Coarse-Grained Reconfigurable Processor Arrays

Journal of Low Power Electronics ◽

10.1166/jolpe.2011.1114 ◽

2011 ◽

Vol 7 (1) ◽

pp. 29-40 ◽

Cited By ~ 1

Author(s):

Dmitrij Kissler ◽

Frank Hannig ◽

Jürgen Teich

Keyword(s):

Coarse Grained ◽

Reconfigurable Processor ◽

Processor Arrays ◽

Trade Offs

Download Full-text

Hierarchical Pipeline Optimization of Coarse Grained Reconfigurable Processor for Multimedia Applications

A data-flow graph generation algorithm for a coarse-grained reconfigurable processor

An efficient implementation of Motion Compensation for AVS HD application based on a coarse-grained reconfigurable processor

Automatic contexts switch in loop pipeline for embedded coarse-grained reconfigurable processor

A mapping algorithm for embedded coarse-grained reconfigurable processor

Configuration Approaches to Enhance Computing Efficiency of Coarse-Grained Reconfigurable Array

A Coarse-Grained Reconfigurable Processor for Sequencing and Phylogenetic Algorithms in Bioinformatics

Throughput/Resource-Efficient Reconfigurable Processor for Multimedia Applications

An Efficient Implementation of Advanced-Encryption-Standard Based on a Coarse-Grained Reconfigurable Processor

CHiPReP—A Compiler for the HiPReP High-Performance Reconfigurable Processor

Efficient Evaluation of Power/Area/Latency Design Trade-Offs for Coarse-Grained Reconfigurable Processor Arrays

Export Citation Format