Fast Random Integer Generation in an Interval (1805.10941v4)

Published 28 May 2018 in cs.DS

Abstract: In simulations, probabilistic algorithms and statistical tests, we often generate random integers in an interval (e.g., [0,s)). For example, random integers in an interval are essential to the Fisher-Yates random shuffle. Consequently, popular languages like Java, Python, C++, Swift and Go include ranged random integer generation functions as part of their runtime libraries. Pseudo-random values are usually generated in words of a fixed number of bits (e.g., 32 bits, 64 bits) using algorithms such as a linear congruential generator. We need functions to convert such random words to random integers in an interval ([0,s)) without introducing statistical biases. The standard functions in programming languages such as Java involve integer divisions. Unfortunately, division instructions are relatively expensive. We review an unbiased function to generate ranged integers from a source of random words that avoids integer divisions with high probability. To establish the practical usefulness of the approach, we show that this algorithm can multiply the speed of unbiased random shuffling on x64 processors. Our proposed approach has been adopted by the Go language for its implementation of the shuffle function.

Citations (42)

View on Semantic Scholar

Summary

The paper proposes a novel method for generating random integers in an interval that replaces computationally expensive integer divisions with faster multiplication and bit-shifting operations.
Numerical results show the proposed method is significantly faster, performing up to three times faster than traditional methods, especially for 64-bit operations.
The optimized method has practical implications, adopted in Go language, benefiting applications requiring efficient random sampling like machine learning and simulations.

Analysis of "Fast Random Integer Generation in an Interval"

The paper, titled "Fast Random Integer Generation in an Interval," explores the critical task of generating random integers over a specified range while minimizing computational inefficiencies, particularly avoiding expensive operations like integer divisions. Authored by Daniel Lemire, the paper addresses the inefficiencies in conventional random integer generation methods embedded in programming languages like Java, Python, and C++. It proposes a method that significantly enhances performance by leveraging arithmetic operations over costly integer divisions.

Summary and Methodology

Random integer generation in a closed interval $[0,s)$ is fundamental in simulations, cryptographic applications, and statistical tests. Traditional approaches utilize pseudo-random generators such as linear congruential generators to produce random numbers across a fixed bit width (e.g., 32-bit or 64-bit). The prevalent method to generate numbers within a smaller range involves modulo operations, which in context typically involves integer division—a computationally expensive operation that hampers performance.

The author proposes a novel method that nearly eliminates integer division by employing a multiplication and bit-shift strategy. The technique is based on the premise that a product of a random number with the range size $s$ and subsequent bit-shifting can efficiently map a pseudo-random number to a specified range without introducing bias. The paper asserts that this method is not only faster but also maintains the uniformity of the distribution, an essential requirement for unbiased random generation.

Numerical Results and Implementations

The paper highlights robust numerical results illustrating that the proposed method significantly enhances speed. Specifically, it emphasizes that:

When using 32-bit indexes, the proposed method is nearly twice as fast as the Java-like method and can be 50% faster than the OpenBSD-like implementations under standard conditions.
For 64-bit indexes, the performance gain is even more marked. The proposed method can outperform traditional methods by up to three times, attributed to the higher latency associated with 64-bit integer divisions in classic methods.
The method has been experimentally validated on high-performance x64 processors to reveal reduced latency and improved memory access patterns, as highlighted by reduced cache misses.

Implications and Future Directions

The paper has immediate practical implications, notably prompting its adoption in the Go language's shuffle function implementation due to its superior performance. The reduced overhead in random integer generation can greatly benefit applications where random sampling or randomized algorithms are central, such as in machine learning and complex simulations, which often require extensive shuffling and sampling.

Theoretically, the paper’s methodology aids in exploring further avenues in algorithmic optimizations where integer division is prevalent. It also opens the door for further research into hardware-optimized random number generation strategies that could supplant current standard library offerings.

Conclusion

The methods advocated in this paper challenge traditional implementations and propose an efficient alternative that balances speed and uniform distribution. While the implementation favors general-purpose processors, the principled approach to reducing computational inefficiencies through arithmetic operations instead of division sets a precedent for future algorithmic advancements in random number generation.

This work reflects a meaningful step forward in optimizing low-level operations within high-level programmatic applications, providing a rigorous foundation for further exploration and application in various computational domains.

Related Papers

YouTube

Show All Videos