Multi-target C++ implementation of parallel skeletons - Archive ouverte HAL Access content directly
Conference Papers Year : 2009

Multi-target C++ implementation of parallel skeletons

Wilfried Kirschenmann
EDF
Laurent Plagne
  • Function : Author
EDF

Abstract

This paper presents the design of an efficient multi-target (CPU+GPU) implementation for the Parallel_for skeleton. Emerging massively parallel architectures promise very high performances for a low cost. However, these architectures change faster than ever. Thus, optimization of codes becomes a very complex and time consumming task. We have identified the data storage as the main difference between the CPU and the GPU implementation of a code. We introduce an abstract data layout in order to adapt the data storage. Based on this layout, the utilization of Parallel_for skeleton allows to compile and execute the same program both on CPU and on GPU. Once compiled, the program runs close to the hardware limits.

Dates and versions

hal-00437542 , version 1 (30-11-2009)

Identifiers

Cite

Wilfried Kirschenmann, Laurent Plagne, Stéphane Vialle. Multi-target C++ implementation of parallel skeletons. 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing - POOSC'09, Jul 2009, Genova, Italy. pp.1-10, ⟨10.1145/1595655.1595662⟩. ⟨hal-00437542⟩
150 View
0 Download

Altmetric

Share

Gmail Facebook X LinkedIn More