Emergence of Randomness in Temporally Aggregated Financial Tick Sequences (2511.17479v1)

Published 21 Nov 2025 in q-fin.ST and q-fin.TR

Abstract: Markets efficiency implies that the stock returns are intrinsically unpredictable, a property that makes markets comparable to random number generators. We present a novel methodology to investigate ultra-high frequency financial data and to evaluate the extent to which tick by tick returns resemble random sequences. We extend the analysis of ultra high-frequency stock market data by applying comprehensive sets of randomness tests, beyond the usual reliance on serial correlation or entropy measures. Our purpose is to extensively analyze the randomness of these data using statistical tests from standard batteries that evaluate different aspects of randomness. We illustrate the effect of time aggregation in transforming highly correlated high-frequency trade data to random streams. More specifically, we use many of the tests in the NIST Statistical Test Suite and in the TestU01 battery (in particular the Rabbit and Alphabit sub-batteries), to prove that the degree of randomness of financial tick data increases together with the increase of the aggregation level in transaction time. Additionally, the comprehensive nature of our tests also uncovers novel patterns, such as non-monotonic behaviors in predictability for certain assets. This study demonstrates a model-free approach for both assessing randomness in financial time series and generating pseudo-random sequences from them, with potential relevance in several applications.

Summary

The paper demonstrates that temporal aggregation of financial tick data significantly increases randomness, converting sequences into pseudo-random streams.
It employs model-free binary encoding and extensive randomness tests (NIST and TestU01) to reveal non-monotonic predictability in asset prices.
These findings suggest potential applications in cryptography and market efficiency evaluations, highlighting areas for further research.

Emergence of Randomness in Temporally Aggregated Financial Tick Sequences

Introduction

The paper "Emergence of Randomness in Temporally Aggregated Financial Tick Sequences" (2511.17479) investigates the degree to which ultra-high frequency financial data can resemble random sequences. With markets efficiency being a key concept under the Efficient Market Hypothesis (EMH), asset prices within such markets are theoretically unpredictable—akin to outputs from random number generators. This research extends traditional analyses by applying comprehensive randomness tests to high-frequency stock market data, beyond serial correlation and entropy measures. Through temporal aggregation, the authors transform highly correlated trade data into random streams, exploring the stochastic properties with statistical test batteries such as NIST Statistical Test Suite and TestU01. This model-free approach aids in assessing randomness in financial time series and may hold relevance for generating pseudo-random sequences, with applications in cryptography and other domains.

Methodology

The paper first sets out to encode financial price sequences into binary strings to measure randomness. For each sequence of transaction prices, binary strings are constructed by comparing consecutive price ratios. Detailed temporal aggregation is applied, creating numerous binary subsequences per day, which are analyzed across varying levels of aggregation. This innovative encoding approach captures frequency-based patterns within high-frequency trading data.

The paper employs diverse randomness tests from the NIST Statistical Test Suite and TestU01 batteries—spanning frequencies, patterns, entropy measures, spectral characteristics, and random walks. By testing these sequences, the paper yields empirical estimations of randomness, presenting a thorough characterization of stochastic financial data without relying on any prior model assumptions.

Figure 1: Results of smultin_MultinomialBitsOver (L=4) test from Alphabit applied to CCL, INTC, LLY, and SPY data.

Results

The research demonstrates that temporal aggregation significantly enhances the randomness of financial tick data. With increasing aggregation levels, the binarized transaction sequences exhibit intensified randomness, subsequently resembling outputs from random number generators in many cases. Such aggregation mitigates predictable microstructure patterns, transforming intra-day frequency data into random-like sequences over larger time intervals.

Notable results include the unveiling of non-monotonic behaviors in predictability for certain assets, observed through numerous tests. Furthermore, surprising instances of persistent predictability emerge even at high aggregation levels, particularly for actively traded stocks like AAPL and TSLA, indicating that random-like sequences might require even higher levels of aggregation.

Figure 2: Results of Runs test from NIST STS applied to AAPL, CCL, MSFT, and TSLA data.

Figure 3: Results of ApproximateEntropy test from NIST STS applied to AAPL, CCL, INTC, and SPY data.

Implications and Future Work

This research lays a robust foundation for deploying pseudo-random number sequences derived from financial data, enhancing the scope for applications in cryptographic and data security models where randomness is pivotal. Future research could further explore the applications of such sequences in randomness beacons and PRNGs, while expanding the dataset across broader asset baskets for verification.

The methodology provides valuable insight into detecting algorithmic trading influences at microstructural levels. Future work could explore understanding how algorithmic periodicity might impact randomness patterns, potentially refining randomness tests to capture deterministic market influences more accurately.

Conclusion

In summary, this paper successfully demonstrates how temporal aggregation can enhance the randomness of high-frequency financial tick sequences. By employing a detailed suite of randomness tests, the research reaffirms that financial data can generically transform into pseudo-random streams under aggregation, providing a model-free approach with profound implications for cryptography and market efficiency evaluations. Further development in randomness assessments might leverage improved visualization techniques to summarize stochastic properties across diverse markets and timeframes, optimizing randomness extraction from traded data.