.: Core Research Activities in Reconfigurable Computing Laboratory

 


1. Application Specific Hybrid Grained Reconfigurable Computing

Processing Approaches
General Purpose Processor
Field
Programmable
Gate Array
(FPGA)
Coarse
Grained
Reconfigurable
Architecture
Application
Specific
Hybrid Coarse
Grained
Reconfigurable
Architecture
Application
Specific
Integrated
Circuit
(ASIC)

Students: Ruchika Verma, Yang Song, Deepak Sreedharan (graduated, May 2008 now at TI, Texas), David Montgomery (graduated, May 2007)

Physical limitations have constrained the evolution of reconfigurable computing through generic and symmetrical architectures. As logic density of field programmable gate arrays (FPGAs) is outpacing Moore's law, reconfigurable computing platforms will have the capability to become self-contained, high-end computers. Reconfigurable processors like FPGAs have resulted with significant performance improvements in many applications such as video processing, cryptography and scientific algorithms because of their inherently parallel architecture. However, there is a tradeoff between the complexity of logic blocks and area efficiency in a reconfigurable processor. For fine-grained architectures (e.g. generic FPGAs), more logic blocks will be required to implement the circuit and routing area becomes excessive. Coarse-grained architectures (e.g. RAW[2], Garp, ChESS, DReAM PipeRench) decrease the total number of logic blocks and hence interconnect complexity is reduced by localizing the connections. However, coarse-grained architectures may suffer from under-utilization of logic resources because it is difficult to fully utilize the functionality provided within each processing element. To overcome the drawbacks of generic reconfigurable architectures, there is a need for exploiting parallelism at multiple levels under a single architecture through the combined use of both coarse-grain and fine-grain functional blocks. Such architecture will have processing elements that are heterogeneous in nature and will have the reconfigurability to switch between applications within that domain.

H.264
Motion estimation is the most compute intensive routine of H.264 video compression standard. In this project we introduce an application-specific hybrid coarse grained reconfigurable architecture with an intelligent network on chip (NoC) mechanism. This is the first reconfigurable architecture with NoC in the academia which supports both variable block size and multiple motion vector search algorithms (full, diamond, hexagon, big hexagon and spiral ) requirement of H.264 video codec. Our model is a hierarchical hybrid processing element based 2D architecture which supports reuse of data between the processing elements through NoC routers in order to reduce memory transactions. Our approach is based on a simple design principle which utilizes a high-level of parallelism with intensive data reuse. Results show that our design reduces the gate count by 7x compared to its ASIC counterpart which only supports full search method. Moreover, the design operates at a frequency comparable to ASIC based implementation while sustaining 30fps and more (HDTV) which is not achievable with existing architecture.

Publications:

  • R. Verma, A. Akoglu, "A Coarse Grained and Hybrid Reconfigurable Architecture With Flexible NOC Router for Variable Block Size Motion Estimation", IEEE International Parallel & Distributed Processing Symposium , Miami, Florida, USA, April 14-15, 2008
  • R. Verma and A. Akoglu, "A coarse grained reconfigurable architecture for variable size block motion estimation", IEEE International Conference on Field-Programmable Technology 2007 (ICFPT'07), pp. 81-88, December 12 -14, 2007, Kitakyushu, Japan

Cryptography
The IP Security Protocol (IPSec) offers enoromous flexibility to the users to decide upon the choice of encryption and hashing algorithms. Software-only implementations act as a bottleneck in sustaining the throughput of a high-speed network due to high computation density of cryptographic operations. Hardware implementations could have different ASIC units (one for each algorithm) implemented and activated based on the needs of the protocol. However this would result in huge area overhead. Hence, there is a need for application-specific reconfigurable hybrid architecture for this algorithm domain.
The computation pattern profiler tool is developed to extract the recurring instruction sequences which then form the processing elements of the target architecture. We apply this toolset onto the three encryption algorithms (Blowfish, T-DES, and DES) and three hashing algorithms (MD5, SHA-1, RIPEMD-160). We then develop two new hybrid architectures specifically tailored to the computation characteristics of the target algorithms. This is a bottom-up approach to derive the architecture as opposed to modeling it at high level with System-C like environment.

Publications:

  • D. Sreedharan, A. Akoglu, "A Hybrid Processing element Based Reconfigurable Architecture for Hashing Algorithms", IEEE International Parallel & Distributed Processing Symposium Miami, Florida, USA, April 14-15, 2008
  • D. Montgomery, A. Akoglu, "Methodology and Toolset for ASIP Design and Development Targeting Cryptography-Based Applications", IEEE International Conference on Application -Specific Systems, Architectures and Processors, (ASAP 2007), 9-11 July 2007 Page(s):365 - 370, Montréal, Québec, Canada