Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Massive Data Parallel Computational Framework for Petascale/Exascale Hybrid Computer Systems (1201.2118v1)

Published 10 Jan 2012 in cs.DC

Abstract: Heterogeneous systems are becoming more common on High Performance Computing (HPC) systems. Even using tools like CUDA and OpenCL it is a non-trivial task to obtain optimal performance on the GPU. Approaches to simplifying this task include Merge (a library based framework for heterogeneous multi-core systems), Zippy (a framework for parallel execution of codes on multiple GPUs), BSGP (a new programming language for general purpose computation on the GPU) and CUDA-lite (an enhancement to CUDA that transforms code based on annotations). In addition, efforts are underway to improve compiler tools for automatic parallelization and optimization of affine loop nests for GPUs and for automatic translation of OpenMP parallelized codes to CUDA. In this paper we present an alternative approach: a new computational framework for the development of massively data parallel scientific codes applications suitable for use on such petascale/exascale hybrid systems built upon the highly scalable Cactus framework. As the first non-trivial demonstration of its usefulness, we successfully developed a new 3D CFD code that achieves improved performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Marek Blazewicz (4 papers)
  2. Steven R. Brandt (16 papers)
  3. Peter Diener (17 papers)
  4. David M. Koppelman (3 papers)
  5. Krzysztof Kurowski (8 papers)
  6. Frank Löffler (29 papers)
  7. Erik Schnetter (60 papers)
  8. Jian Tao (22 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.