

Author: Vijaykumar T.N. Sohi G.S.
Publisher: Academic Press
ISSN: 0743-7315
Source: Journal of Parallel and Distributed Computing, Vol.58, Iss.2, 1999-08, pp. : 132-158
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
The multiscalar architecture advocates a distributed processor organization and task-level speculation to exploit high degrees of instruction level parallelism (ILP) in sequential programs without impeding improvements in clock speeds. The main goal of this paper is to understand the key implications of the architectural features of distributed processor organization and task-level speculation for compiler task selection from the point of view of performance. We identify the fundamental performance issues to be: control flow speculation, data communication, data dependence speculation, load imbalance, and task overhead. We show that these issues are intimately related to a few key characteristics of tasks: task size, intertask control flow, and intertask data dependence. We describe compiler heuristics to select tasks with favorable characteristics. We report experimental results to show that the heuristics are successful in boosting overall performance by establishing larger ILP windows. We also present a breakdown of execution times to show that register wait, load imbalance, control flow squash, and conventional pipeline losses are significant for almost all the SPEC95 benchmarks.
Related content










By Edwards PJ Barnard L Leonard VK Yi JS Moloney KP Kongnakorn T Jacko JA Sainfort F
Behaviour and Information Technology, Vol. 24, Iss. 3, 2005-05 ,pp. :