Edinburgh Research Archive

Efficient runtime placement management for high performance and reliability in COTS FPGAs

dc.contributor.advisor
Arslan, Tughrul
en
dc.contributor.advisor
Hamilton, Alister
en
dc.contributor.author
Enemali, Godwin Ilemona
en
dc.date.accessioned
2019-07-10T13:10:03Z
dc.date.available
2019-07-10T13:10:03Z
dc.date.issued
2019-07-03
dc.description.abstract
Designing high-performance, fault-tolerant multisensory electronic systems for hostile environments such as nuclear plants and outer space within the constraints of cost, power and flexibility is challenging. Issues such as ionizing radiation, extreme temperature and ageing can lead to faults in the electronics of these systems. In addition, the remote nature of these environments demands a level of flexibility and autonomy in their operations. The standard practice of using specially hardened electronic devices for such systems is not only very expensive but also has limited flexibility. This thesis proposes novel techniques that promote the use of Commercial Off-The- Shelf (COTS) reconfigurable devices to meet the challenges of high-performance systems for hostile environments. Reconfigurable hardware such as Field Programmable Gate Arrays (FPGA) have a unique combination of flexibility and high performance. The flexibility offered through features such as dynamic partial reconfiguration (DPR) can be harnessed not only to achieve cost-effective designs as a smaller area can be used to execute multiple tasks, but also to improve the reliability of a system as a circuit on one portion of the device can be physically relocated to another portion in the case of fault occurrence. However, to harness these potentials for high performance and reliability in a cost-effective manner, novel runtime management tools are required. Most runtime support tools for reconfigurable devices are based on ideal models which do not adequately consider the limitations of realistic FPGAs, in particular modern FPGAs which are increasingly heterogeneous. Specifically, these tools lack efficient mechanisms for ensuring a high utilization of FPGA resources, including the FPGA area and the configuration port and clocking resources, in a reliable manner. To ensure high utilization of reconfigurable device area, placement management is a key aspect of these tools. This thesis presents novel techniques for the management of hardware task placement on COTS reconfigurable devices for high performance and reliability. To this end, it addresses design-time issues that affect efficient hardware task placement, with a focus on reliability. It also presents techniques to maximize the utilization of the FPGA area in runtime, including techniques to minimize fragmentation. Fragmentation leads to the creation of unusable areas due to dynamic placement of tasks and the heterogeneity of the resources on the chip. Moreover, this thesis also presents an efficient task reuse mechanism to improve the availability of the internal configuration infrastructure of the FPGA for critical responsibilities like error mitigation. The task reuse scheme, unlike previous approaches, also improves the utilization of the chip area by offering defragmentation. Task relocation, which involves changing the physical location of circuits is a technique for error mitigation and high performance. Hence, this thesis also provides a functionality-based relocation mechanism for improving the number of locations to which tasks can be relocated on heterogeneous FPGAs. As tasks are relocated, clock networks need to be routed to them. As such, a reliability-aware technique of clock network routing to tasks after placement is also proposed. Finally, this thesis offers a prototype implementation and characterization of a placement management system (PMS) which is an integration of the aforementioned techniques. The performance of most of the proposed techniques are tested using data processing tasks of a NASA JPL spectrometer application. The results show that the proposed techniques have potentials to improve the reliability and performance of applications in hostile environment compared to state-of-the-art techniques. The task optimization technique presented leads to better capacity to circumvent permanent faults on COTS FPGAs compared to state-of-the-art approaches (48.6% more errors were circumvented for the JPL spectrometer application). The proposed task reuse scheme leads to approximately 29% saving in the amount of configuration time. This frees up the internal configuration interface for more error mitigation operations. In addition, the proposed PMS has a worst-case latency of less than 50% of that of state-of- the-art runtime placement systems, while maintaining the same level of placement quality and resource overhead.
en
dc.identifier.uri
http://hdl.handle.net/1842/35720
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
G. Enemali, A. Adetomi, G. Seetharaman and T. Arslan, “A Functionality-Based Runtime Relocation System for Circuits on Heterogeneous FPGAs,” IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 65, no. 5, pp. 612–616, May 2018.q
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "FAReP: Fragmentation-Aware Replacement Policy for Task Reuse on Reconfigurable FPGAs", in 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 2017, pp. 202 – 206, 10.1109/IPDPSW.2017.153.
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "A Placement Management Circuit for Efficient Realtime Hardware Reuse on FPGAs Targeting Reliable Autonomous Systems", in 2017 IEEE International Symposium on Circuit and Systems (ISCAS 2017), 2017, pp. 2030 – 2033, 10.1109/ISCAS.2017.8050796
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "Expanding the Un-usable Area Strategy for Improved Utilization of Reconfigurable FPGAs", in 2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2017, 10.1109/AHS.2017.8046370.
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "Efficient Runtime Frame ECC Recomputation for Reliable Task Execution on Xilinx FPGAs ", in 2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2018, pp. 59 – 65. 10.1109/AHS.2018.8541471
en
dc.relation.hasversion
A. Adetomi, G. Enemali, and T. Arslan, "Relocation-Aware Communication Network for Circuits on Xilinx FPGAs”, in 2017 International Conference on Field Programmable Logic and Applications (FPL), 2017, pp. 1-7, 10.23919/FPL.2017.8056818.
en
dc.relation.hasversion
A. Adetomi, G. Enemali, and T. Arslan, "A Fault-Tolerant ICAP Controller with a Selective-Area Soft Error Mitigation Engine", in 2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2017, pp. 192-199, 10.1109/AHS.2017.8046378.
en
dc.relation.hasversion
A. Adetomi, G. Enemali, and T. Arslan, “R3TOS-Based Integrated Modular Space Avionics for On-Board Real-Time Data Processing,” in 2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2018. Pp. 1- 8. 10.1109/AHS.2018.8541369
en
dc.subject
FPGA
en
dc.subject
reliability
en
dc.subject
high performance
en
dc.title
Efficient runtime placement management for high performance and reliability in COTS FPGAs
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
Enemali2019.pdf
Size:
3.61 MB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)