Efficient runtime placement management for high performance and reliability in COTS FPGAs
dc.contributor.advisor
Arslan, Tughrul
en
dc.contributor.advisor
Hamilton, Alister
en
dc.contributor.author
Enemali, Godwin Ilemona
en
dc.date.accessioned
2019-07-10T13:10:03Z
dc.date.available
2019-07-10T13:10:03Z
dc.date.issued
2019-07-03
dc.description.abstract
Designing high-performance, fault-tolerant multisensory electronic systems for
hostile environments such as nuclear plants and outer space within the constraints of
cost, power and flexibility is challenging. Issues such as ionizing radiation, extreme
temperature and ageing can lead to faults in the electronics of these systems. In
addition, the remote nature of these environments demands a level of flexibility and
autonomy in their operations. The standard practice of using specially hardened
electronic devices for such systems is not only very expensive but also has limited
flexibility.
This thesis proposes novel techniques that promote the use of Commercial Off-The-
Shelf (COTS) reconfigurable devices to meet the challenges of high-performance
systems for hostile environments. Reconfigurable hardware such as Field
Programmable Gate Arrays (FPGA) have a unique combination of flexibility and
high performance. The flexibility offered through features such as dynamic partial
reconfiguration (DPR) can be harnessed not only to achieve cost-effective designs as
a smaller area can be used to execute multiple tasks, but also to improve the
reliability of a system as a circuit on one portion of the device can be physically
relocated to another portion in the case of fault occurrence. However, to harness
these potentials for high performance and reliability in a cost-effective manner, novel
runtime management tools are required. Most runtime support tools for
reconfigurable devices are based on ideal models which do not adequately consider
the limitations of realistic FPGAs, in particular modern FPGAs which are
increasingly heterogeneous. Specifically, these tools lack efficient mechanisms for
ensuring a high utilization of FPGA resources, including the FPGA area and the
configuration port and clocking resources, in a reliable manner.
To ensure high utilization of reconfigurable device area, placement management is a
key aspect of these tools. This thesis presents novel techniques for the management
of hardware task placement on COTS reconfigurable devices for high performance
and reliability. To this end, it addresses design-time issues that affect efficient
hardware task placement, with a focus on reliability. It also presents techniques to
maximize the utilization of the FPGA area in runtime, including techniques to
minimize fragmentation. Fragmentation leads to the creation of unusable areas due to
dynamic placement of tasks and the heterogeneity of the resources on the chip.
Moreover, this thesis also presents an efficient task reuse mechanism to improve the
availability of the internal configuration infrastructure of the FPGA for critical
responsibilities like error mitigation. The task reuse scheme, unlike previous
approaches, also improves the utilization of the chip area by offering
defragmentation.
Task relocation, which involves changing the physical location of circuits is a
technique for error mitigation and high performance. Hence, this thesis also provides
a functionality-based relocation mechanism for improving the number of locations to
which tasks can be relocated on heterogeneous FPGAs. As tasks are relocated, clock
networks need to be routed to them. As such, a reliability-aware technique of clock
network routing to tasks after placement is also proposed.
Finally, this thesis offers a prototype implementation and characterization of a
placement management system (PMS) which is an integration of the aforementioned
techniques. The performance of most of the proposed techniques are tested using
data processing tasks of a NASA JPL spectrometer application. The results show that
the proposed techniques have potentials to improve the reliability and performance of
applications in hostile environment compared to state-of-the-art techniques. The task
optimization technique presented leads to better capacity to circumvent permanent
faults on COTS FPGAs compared to state-of-the-art approaches (48.6% more errors
were circumvented for the JPL spectrometer application). The proposed task reuse
scheme leads to approximately 29% saving in the amount of configuration time. This
frees up the internal configuration interface for more error mitigation operations. In
addition, the proposed PMS has a worst-case latency of less than 50% of that of state-of-
the-art runtime placement systems, while maintaining the same level of placement
quality and resource overhead.
en
dc.identifier.uri
http://hdl.handle.net/1842/35720
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
G. Enemali, A. Adetomi, G. Seetharaman and T. Arslan, “A Functionality-Based Runtime Relocation System for Circuits on Heterogeneous FPGAs,” IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 65, no. 5, pp. 612–616, May 2018.q
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "FAReP: Fragmentation-Aware Replacement Policy for Task Reuse on Reconfigurable FPGAs", in 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 2017, pp. 202 – 206, 10.1109/IPDPSW.2017.153.
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "A Placement Management Circuit for Efficient Realtime Hardware Reuse on FPGAs Targeting Reliable Autonomous Systems", in 2017 IEEE International Symposium on Circuit and Systems (ISCAS 2017), 2017, pp. 2030 – 2033, 10.1109/ISCAS.2017.8050796
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "Expanding the Un-usable Area Strategy for Improved Utilization of Reconfigurable FPGAs", in 2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2017, 10.1109/AHS.2017.8046370.
en
dc.relation.hasversion
G. Enemali, A. Adetomi, and T. Arslan, "Efficient Runtime Frame ECC Recomputation for Reliable Task Execution on Xilinx FPGAs ", in 2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2018, pp. 59 – 65. 10.1109/AHS.2018.8541471
en
dc.relation.hasversion
A. Adetomi, G. Enemali, and T. Arslan, "Relocation-Aware Communication Network for Circuits on Xilinx FPGAs”, in 2017 International Conference on Field Programmable Logic and Applications (FPL), 2017, pp. 1-7, 10.23919/FPL.2017.8056818.
en
dc.relation.hasversion
A. Adetomi, G. Enemali, and T. Arslan, "A Fault-Tolerant ICAP Controller with a Selective-Area Soft Error Mitigation Engine", in 2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2017, pp. 192-199, 10.1109/AHS.2017.8046378.
en
dc.relation.hasversion
A. Adetomi, G. Enemali, and T. Arslan, “R3TOS-Based Integrated Modular Space Avionics for On-Board Real-Time Data Processing,” in 2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2018. Pp. 1- 8. 10.1109/AHS.2018.8541369
en
dc.subject
FPGA
en
dc.subject
reliability
en
dc.subject
high performance
en
dc.title
Efficient runtime placement management for high performance and reliability in COTS FPGAs
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Enemali2019.pdf
- Size:
- 3.61 MB
- Format:
- Adobe Portable Document Format
This item appears in the following Collection(s)

