2000 character limit reached
Adaptive fast multipole methods on the GPU (1205.4611v2)
Published 21 May 2012 in cs.DC and cs.DS
Abstract: We present a highly general implementation of fast multipole methods on graphics processing units (GPUs). Our two-dimensional double precision code features an asymmetric type of adaptive space discretization leading to a particularly elegant and flexible implementation. All steps of the multipole algorithm are efficiently performed on the GPU, including the initial phase which assembles the topological information of the input data. Through careful timing experiments we investigate the effects of the various peculiarities of the GPU architecture.