C. G. Baker, C. Edwards, H. Heroux, M. A. Williams, and A. B. , A Light-weight API for Portable Multicore Programming, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, 2010.
DOI : 10.1109/PDP.2010.49

K. Czarnecki, J. T. Odonnell, J. Striegnitz, and T. Walid, DSL Implementation in MetaOCaml, Template Haskell, and C++, DSL Implementation in MetaOCaml, Template Haskell, and C++, pp.51-72, 2004.
DOI : 10.1007/978-3-540-25935-0_4

D. V. Dyk, M. Geveler, S. Mallach, D. Ribbrock, D. Göddeke et al., HONEI: A collection of libraries for numerical computations targeting multiple processor architectures, Computer Physics Communications, vol.180, issue.12, pp.2534-2543, 2009.

J. Falcou, J. Sérot, T. Chateau, and J. T. Lapresté, Quaff: efficient C++ design for parallel skeletons, Parallel Computing, vol.32, issue.7-8, pp.7-8, 2006.
DOI : 10.1016/j.parco.2006.06.001

URL : https://hal.archives-ouvertes.fr/hal-00167412

W. Kirschenmann, L. Plagne, S. Ploix, A. Ponçot, and S. Vialle, Massively Parallel Solving of 3D Simplified PN Equations on Graphic Processing Units, Proceedings of Mathematics, Computational Methods & Reactor Physics, 2009.

W. Kirschenmann, L. Plagne, and S. Vialle, Multi-target C++ implementation of parallel skeletons, Proceedings of the 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing, POOSC '09, 2009.
DOI : 10.1145/1595655.1595662

URL : https://hal.archives-ouvertes.fr/hal-00437542

W. Kirschenmann, L. Plagne, and S. Vialle, on Multi-Core CPUS and Many-Core GPUS, Transport Theory and Statistical Physics, vol.23, issue.1, pp.255-281, 2010.
DOI : 10.1080/00411450.2010.533741

URL : https://hal.archives-ouvertes.fr/hal-00556812

J. D. Mccalpin, Memory Bandwidth and Machine Balance in Current High Performance Computers, IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter pp, pp.19-25, 1995.

L. Plagne and A. Ponçot, Generic Programming for Deterministic Neutron Transport Codes, Proceedings of Mathematics and Computation, Supercomputing, Reactor Physics and Nuclear and Biological Applications. Palais des Papes, 2005.

J. Reinders, Intel threading building blocks, 2007.

T. L. Veldhuizen, Arrays in Blitz++, ISCOPE '98: Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments, pp.223-230, 1998.
DOI : 10.1007/3-540-49372-7_24