Generic algorithmic scheme for 2D stencil applications on hybrid machines - UMI 2958 - Axe de recherche : Computer Science Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Generic algorithmic scheme for 2D stencil applications on hybrid machines

Résumé

Hardware accelerators are classic scientific coprocessors in HPC machines. However, the number of CPU cores on the mother board is increasing and constitutes a non negligible part of the total computing power of the machine. So, running an application both on an accelerator (like a GPU or a Xeon-Phi device) and on the CPU cores can provide the highest performance. Moreover, it is now possible to include different accelerators in a machine, in order to support and to speedup a larger set of applications. Then, running an application part on the most suitable device allows to reach high performance, but using all unused devices in the machine should permit to improve even more the performance of that part. However, the overlapping of computations with inter-device data transfers is mandatory to limit the overhead of this approach, leading to complex asynchronous algorithms and multi-paradigm optimized codes. This article introduces our research and experiments on cooperation between several CPU and both a GPU and a Xeon-Phi accelerators, all included in a same machine.
Fichier principal
Vignette du fichier
article-arcs.pdf (318.16 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
licence : CC BY ND - Paternité - Pas de modifications

Dates et versions

hal-01263242 , version 1 (17-02-2024)

Identifiants

Citer

Stéphane Vialle, Sylvain Contassot-Vivier, Patrick Mercier. Generic algorithmic scheme for 2D stencil applications on hybrid machines. ARCS 2016 - Architecture of Computing Systems, Apr 2016, Nuremberg, Germany. ⟨10.1007/978-3-319-30695-7_9⟩. ⟨hal-01263242⟩
196 Consultations
2 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More