Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficient GPU Thread Mapping on Embedded 2D Fractals (2004.13475v1)

Published 25 Apr 2020 in cs.DC

Abstract: This work proposes a new approach for mapping GPU threads onto a family of discrete embedded 2D fractals. A block-space map $\lambda: \mathbb{Z}{\mathbb{E}}{2} \mapsto \mathbb{Z}{\mathbb{F}}{2}$ is proposed, from Euclidean parallel space $\mathbb{E}$ to embedded fractal space $\mathbb{F}$, that maps in $\mathcal{O}(\log_2 \log_2(n))$ time and uses no more than $\mathcal{O}(n\mathbb{H})$ threads with $\mathbb{H}$ being the Hausdorff dimension of the fractal, making it parallel space efficient. When compared to a bounding-box (BB) approach, $\lambda(\omega)$ offers a sub-exponential improvement in parallel space and a monotonically increasing speedup $n \ge n_0$. The Sierpinski gasket fractal is used as a particular case study and the experimental performance results show that $\lambda(\omega)$ reaches up to $9\times$ of speedup over the bounding-box approach. A tensor-core based implementation of $\lambda(\omega)$ is also proposed for modern GPUs, providing up to $\sim40\%$ of extra performance. The results obtained in this work show that doing efficient GPU thread mapping on fractal domains can significantly improve the performance of several applications that work with this type of geometry.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Cristóbal A. Navarro (21 papers)
  2. Felipe A. Quezada (8 papers)
  3. Nancy Hitschfeld (15 papers)
  4. Raimundo Vega (4 papers)
  5. Benjamin Bustos (10 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.