Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Decoupled Strategy for Imbalanced Workloads in MapReduce Frameworks (1810.04146v1)

Published 9 Oct 2018 in cs.DC

Abstract: In this work, we consider the integration of MPI one-sided communication and non-blocking I/O in HPC-centric MapReduce frameworks. Using a decoupled strategy, we aim to overlap the Map and Reduce phases of the algorithm by allowing processes to communicate and synchronize using solely one-sided operations. Hence, we effectively increase the performance in situations where the workload per process is unexpectedly unbalanced. Using a Word-Count implementation and a large dataset from the Purdue MapReduce Benchmarks Suite (PUMA), we demonstrate that our approach can provide up to 23% performance improvement on average compared to a reference MapReduce implementation that uses state-of-the-art MPI collective communication and I/O.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Sergio Rivas-Gomez (6 papers)
  2. Sai Narasimhamurthy (6 papers)
  3. Keeran Brabazon (1 paper)
  4. Oliver Perks (1 paper)
  5. Erwin Laure (32 papers)
  6. Stefano Markidis (106 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.