Increasing the efficacy of automated instruction set extension

Bennett, Richard Vincent

Increasing the efficacy of automated instruction set extension

Simple item page

dc.contributor.advisor

Topham, Nigel

en

dc.contributor.advisor

Franke, Bjorn

en

dc.contributor.author

Bennett, Richard Vincent

en

dc.contributor.sponsor

Engineering and Physical Sciences Research Council (EPSRC)

en

dc.date.accessioned

2012-01-25T15:08:40Z

dc.date.available

2012-01-25T15:08:40Z

dc.date.issued

2011-11-24

dc.description.abstract

The use of Instruction Set Extension (ISE) in customising embedded processors for a specific application has been studied extensively in recent years. The addition of a set of complex arithmetic instructions to a baseline core has proven to be a cost-effective means of meeting design performance requirements. This thesis proposes and evaluates a reconfigurable ISE implementation called “Configurable Flow Accelerators” (CFAs), a number of refinements to an existing Automated ISE (AISE) algorithm called “ISEGEN”, and the effects of source form on AISE. The CFA is demonstrated repeatedly to be a cost-effective design for ISE implementation. A temporal partitioning algorithm called “staggering” is proposed and demonstrated on average to reduce the area of CFA implementation by 37% for only an 8% reduction in acceleration. This thesis then turns to concerns within the ISEGEN AISE algorithm. A methodology for finding a good static heuristic weighting vector for ISEGEN is proposed and demonstrated. Up to 100% of merit is shown to be lost or gained through the choice of vector. ISEGEN early-termination is introduced and shown to improve the runtime of the algorithm by up to 7.26x, and 5.82x on average. An extension to the ISEGEN heuristic to account for pipelining is proposed and evaluated, increasing acceleration by up to an additional 1.5x. An energyaware heuristic is added to ISEGEN, which reduces the energy used by a CFA implementation of a set of ISEs by an average of 1.6x, up to 3.6x. This result directly contradicts the frequently espoused notion that “bigger is better” in ISE. The last stretch of work in this thesis is concerned with source-level transformation: the effect of changing the representation of the application on the quality of the combined hardwaresoftware solution. A methodology for combined exploration of source transformation and ISE is presented, and demonstrated to improve the acceleration of the result by an average of 35% versus ISE alone. Floating point is demonstrated to perform worse than fixed point, for all design concerns and applications studied here, regardless of ISEs employed.

en

dc.identifier.uri

http://hdl.handle.net/1842/5789

dc.language.iso

en

dc.publisher

The University of Edinburgh

en

dc.relation.hasversion

R.V. Bennett, A.C. Murray, B. Franke, and N. Topham “Combining source-to-source transformations and processor instruction set extension for the automated design-space exploration of embedded systems”. In: Proceedings of Languages Compilers and Technology for Embedded Systems (LCTES), 2007.

en

dc.relation.hasversion

O. Almer, R.V. Bennett, I. B¨ohm, A.C. Murray, X. Qu, M. Zuluaga, B. Franke and N.P. Topham “An End-to-End Design Flow for Automated Instruction Set Extension and Complex Instruction Selection based on GCC”. In: Proceedings of the 1st International Workshop on GCC Research Opportunities (GROW), 2009.

en

dc.subject

automated synthesis

en

dc.subject

computer architecture

en

dc.subject

instruction set extension

en

dc.title

Increasing the efficacy of automated instruction set extension

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

Doctoral

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Bennett2011.pdf
Size:: 4.92 MB
Format:: Adobe Portable Document Format
Description:

Download

This item appears in the following Collection(s)

Informatics thesis and dissertation collection