Papers
Topics
Authors
Recent
Search
2000 character limit reached

Deciphering Bitcoin Blockchain Data by Cohort Analysis

Published 27 Feb 2021 in econ.GN, math.NA, q-fin.CP, and stat.CO | (2103.00173v3)

Abstract: Bitcoin is a peer-to-peer electronic payment system that has rapidly grown in popularity in recent years. Usually, the complete history of Bitcoin blockchain data must be queried to acquire variables with economic meaning. This task has recently become increasingly difficult, as there are over 1.6 billion historical transactions on the Bitcoin blockchain. It is thus important to query Bitcoin transaction data in a way that is more efficient and provides economic insights. We apply cohort analysis that interprets Bitcoin blockchain data using methods developed for population data in the social sciences. Specifically, we query and process the Bitcoin transaction input and output data within each daily cohort. This enables us to create datasets and visualizations for some key Bitcoin transaction indicators, including the daily lifespan distributions of spent transaction output (STXO) and the daily age distributions of the cumulative unspent transaction output (UTXO). We provide a computationally feasible approach for characterizing Bitcoin transactions that paves the way for future economic studies of Bitcoin.

Citations (17)

Summary

  • The paper introduces an innovative cohort analysis method to partition Bitcoin blockchain transactions based on temporal characteristics, improving data management.
  • The paper demonstrates that a significant proportion of UTXOs serve as a long-term store of value while newer UTXOs act as a medium of exchange.
  • The analysis validates its approach by matching Bitcoin supply data with external sources and lays groundwork for similar methods in other UTXO-based cryptocurrencies.

Summary of "Deciphering Bitcoin Blockchain Data by Cohort Analysis"

The paper "Deciphering Bitcoin Blockchain Data by Cohort Analysis" presents an innovative methodology for analyzing Bitcoin blockchain transactions by applying cohort analysis, a technique traditionally used in social sciences for population data. This approach addresses the challenges posed by the vast and complex Bitcoin transaction history exceeding 1.6 billion transactions.

Methodology and Technique

The study leverages cohort analysis to categorize Bitcoin transactions into cohorts based on their temporal characteristics. Specifically, unspent transaction outputs (UTXOs) and spent transaction outputs (STXOs) are organized into daily birth cohorts and death cohorts, respectively. This method allows for an efficient and economically insightful examination of the blockchain by segmenting the data into manageable partitions based on time of creation and spending.

By employing Google Colaboratory and Google BigQuery, the authors extract relevant data, significantly reducing the computational burden. They construct partitioned tables that facilitate cohort-specific queries and visualizations, enabling analysis of temporal patterns in BTC usage across various time spans.

Key Findings

UTXO and STXO Analysis

The use of cohort analysis reveals distinct trends in Bitcoin as a currency:

  • Store of Value: A significant proportion of UTXOs have not been spent for over a year, indicating their role as a store of value. By February 2021, approximately 2 million BTCs had remained inactive for over 10 years. Figure 1

    Figure 1: Number of BTC UTXOs by age. The data highlight the accumulated unspent coins, demonstrating varying usages within Bitcoin's ecosystem.

  • Medium of Exchange: Frequent transaction activity is observed among UTXOs less than one month old, suggesting their use as a medium of exchange.

Economic Indicators and Validation

The study further validates its methodology by comparing calculated Bitcoin supply data against external sources, showing an exact match with CoinMetrics data. Figure 2

Figure 2: Block reward and circulating BTC supply comparison, verifying data accuracy.

  • The cohort analysis method yields datasets capturing STXO and UTXO characteristics from 2009 to 2021, providing a valuable resource for economic modeling and future research expansions into other UTXO-based cryptocurrencies.

Implications and Future Directions

The application of cohort analysis to blockchain data grants new insights into Bitcoin's economic functionalities and offers significant computational efficiencies. This approach can extend to other cryptocurrencies (e.g., Litecoin, Dash) that utilize the UTXO model, as outlined in the study. Figure 3

Figure 3: Lifespan distribution of Bitcoin STXOs, illustrating various lifespan categories.

The paper suggests potential for broad applications in finance, security evaluations, and macroeconomic studies. Nonetheless, the authors acknowledge challenges in differentiating UTXOs used as stores of value from those lost or held below transaction-cost thresholds ("dust"). Future work may adapt these techniques to account-based blockchains such as Ethereum, opening avenues for comparative analysis across differing blockchain architectures.

Conclusion

This research exemplifies the application of social science methodologies to technical blockchain data, achieving substantial insights into Bitcoin's transaction dynamics. The integration of cohort analysis with blockchain data facilitates efficient processing and meaningful economic analysis, advancing the understanding of cryptocurrencies in financial ecosystems and informing subsequent studies in related domains.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Authors (3)

Collections

Sign up for free to add this paper to one or more collections.