Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

New Thread Migration Strategies for NUMA Systems (1809.10937v1)

Published 28 Sep 2018 in cs.DC and cs.PF

Abstract: Multicore systems present on-board memory hierarchies and communication networks that influence performance when executing shared memory parallel codes. Characterising this influence is complex, and understanding the effect of particular hardware configurations on different codes is of paramount importance. In previous works, monitoring information extracted from hardware counters at runtime has been used to characterise the behaviour of each thread in the parallel code in terms of the number of floating point operations per second, operational intensity, and latency of memory access. We propose to use this information to guide thread migration strategies that improve execution efficiency by increasing locality and affinity. Different configurations of NAS Parallel OpenMP benchmarks on multicores were used to validate the benefits of the proposed thread migration strategies. Our proposed strategies produce up to 70% improvement in scenarios where locality and affinity are low, there being a small degradation in performance for codes with high locality and affinity.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. O. G. Lorenzo (1 paper)
  2. M. L. BecoƱa (1 paper)
  3. T. F. Pena (1 paper)
  4. J. C. Cabaleiro (1 paper)
  5. J. A. Lorenzo (1 paper)
  6. F. F. Rivera (1 paper)
Citations (1)

Summary

We haven't generated a summary for this paper yet.