Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Modified CTGAN-Plus-Features Based Method for Optimal Asset Allocation (2302.02269v3)

Published 5 Feb 2023 in q-fin.PM and cs.CE

Abstract: We propose a new approach to portfolio optimization that utilizes a unique combination of synthetic data generation and a CVaR-constraint. We formulate the portfolio optimization problem as an asset allocation problem in which each asset class is accessed through a passive (index) fund. The asset-class weights are determined by solving an optimization problem which includes a CVaR-constraint. The optimization is carried out by means of a Modified CTGAN algorithm which incorporates features (contextual information) and is used to generate synthetic return scenarios, which, in turn, are fed into the optimization engine. For contextual information we rely on several points along the U.S. Treasury yield curve. The merits of this approach are demonstrated with an example based on ten asset classes (covering stocks, bonds, and commodities) over a fourteen-and-half year period (January 2008-June 2022). We also show that the synthetic generation process is able to capture well the key characteristics of the original data, and the optimization scheme results in portfolios that exhibit satisfactory out-of-sample performance. We also show that this approach outperforms the conventional equal-weights (1/N) asset allocation strategy and other optimization formulations based on historical data only.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. \APACrefYearMonthDay2001. \BBOQ\APACrefatitleIt’s time for asset allocation It’s time for asset allocation.\BBCQ \APACjournalVolNumPagesJournal of Financial Transformation377–88. \PrintBackRefs\CurrentBib
  2. \APACrefYearMonthDay1999. \BBOQ\APACrefatitleCoherent measures of risk Coherent measures of risk.\BBCQ \APACjournalVolNumPagesMathematical finance93203–228. \PrintBackRefs\CurrentBib
  3. \APACrefYearMonthDay2019. \BBOQ\APACrefatitleThe big data newsvendor: Practical insights from machine learning The big data newsvendor: Practical insights from machine learning.\BBCQ \APACjournalVolNumPagesOperations Research67190–108. \PrintBackRefs\CurrentBib
  4. \APACrefYearMonthDay2018. \BBOQ\APACrefatitleInformation in the yield curve about future recessions Information in the yield curve about future recessions.\BBCQ \APACjournalVolNumPagesFRBSF Economic Letter201–5. \PrintBackRefs\CurrentBib
  5. \APACrefYearMonthDay2020. \BBOQ\APACrefatitleFrom predictive to prescriptive analytics From predictive to prescriptive analytics.\BBCQ \APACjournalVolNumPagesManagement Science6631025–1044. \PrintBackRefs\CurrentBib
  6. \APACinsertmetastarbogle2018stay{APACrefauthors}Bogle, J\BPBIC.  \APACrefYear2018. \APACrefbtitleStay the course: the story of Vanguard and the index revolution Stay the course: the story of Vanguard and the index revolution. \APACaddressPublisherJohn Wiley & Sons. \PrintBackRefs\CurrentBib
  7. \APACrefYearMonthDay2013. \BBOQ\APACrefatitleDensity-based clustering based on hierarchical density estimates Density-based clustering based on hierarchical density estimates.\BBCQ \BIn \APACrefbtitlePacific-Asia conference on knowledge discovery and data mining Pacific-Asia conference on knowledge discovery and data mining (\BPGS 160–172). \PrintBackRefs\CurrentBib
  8. \APACrefYearMonthDay2022. \BBOQ\APACrefatitleA statistical learning approach to personalization in revenue management A statistical learning approach to personalization in revenue management.\BBCQ \APACjournalVolNumPagesManagement Science6831923–1937. \PrintBackRefs\CurrentBib
  9. \APACrefYearMonthDay2009. \BBOQ\APACrefatitleOptimal versus naive diversification: How inefficient is the 1/N portfolio strategy? Optimal versus naive diversification: How inefficient is the 1/n portfolio strategy?\BBCQ \APACjournalVolNumPagesThe review of Financial studies2251915–1953. \PrintBackRefs\CurrentBib
  10. \APACrefYearMonthDay2021. \BBOQ\APACrefatitleGenerative Adversarial Networks in finance: an overview Generative adversarial networks in finance: an overview.\BBCQ \APACjournalVolNumPagesarXiv preprint arXiv:2106.06364. \PrintBackRefs\CurrentBib
  11. \APACrefYearMonthDay2019. \BBOQ\APACrefatitleAre passive funds really superior investments? An investor perspective Are passive funds really superior investments? an investor perspective.\BBCQ \APACjournalVolNumPagesFinancial Analysts Journal7537–19. \PrintBackRefs\CurrentBib
  12. \APACrefYearMonthDay2006. \BBOQ\APACrefatitleThe yield curve as a leading indicator: Some practical issues The yield curve as a leading indicator: Some practical issues.\BBCQ \APACjournalVolNumPagesCurrent issues in Economics and Finance125. \PrintBackRefs\CurrentBib
  13. \APACrefYearMonthDay2020. \BBOQ\APACrefatitleThe yield spread’s ability to forecast economic activity: What have we learned after 30 years of studies? The yield spread’s ability to forecast economic activity: What have we learned after 30 years of studies?\BBCQ \APACjournalVolNumPagesJournal of Business Research106221–232. \PrintBackRefs\CurrentBib
  14. \APACrefYear2021. \APACrefbtitleAsset Management: Tools and Issues Asset management: Tools and issues. \APACaddressPublisherWorld Scientific. \PrintBackRefs\CurrentBib
  15. \APACrefYearMonthDay2019. \BBOQ\APACrefatitleActive vs. Passive Funds—An Empirical Analysis of the German Equity Market Active vs. passive funds—an empirical analysis of the german equity market.\BBCQ \APACjournalVolNumPagesJournal of Financial Risk Management8273. \PrintBackRefs\CurrentBib
  16. \APACrefYear2014. \APACrefbtitleRisky curves: On the empirical failure of expected utility Risky curves: On the empirical failure of expected utility. \APACaddressPublisherRoutledge. \PrintBackRefs\CurrentBib
  17. \APACrefYearMonthDay2014. \APACrefbtitleGenerative Adversarial Networks. Generative adversarial networks. \APACaddressPublisherarXiv. {APACrefURL} \urlhttps://arxiv.org/abs/1406.2661 {APACrefDOI} 10.48550/ARXIV.1406.2661 \PrintBackRefs\CurrentBib
  18. \APACrefYearMonthDay2019. \BBOQ\APACrefatitleCan asset allocation limits determine portfolio risk–return profiles in DC pension schemes? Can asset allocation limits determine portfolio risk–return profiles in DC pension schemes?\BBCQ \APACjournalVolNumPagesInsurance: Mathematics and Economics86134-144. {APACrefURL} \urlhttps://www.sciencedirect.com/science/article/pii/S0167668718301331 {APACrefDOI} https://doi.org/10.1016/j.insmatheco.2019.02.009 \PrintBackRefs\CurrentBib
  19. \APACinsertmetastarhamilton1988{APACrefauthors}Hamilton, J\BPBID.  \APACrefYearMonthDay1988. \BBOQ\APACrefatitleRational-expectations econometric analysis of changes in regime: An investigation of the term structure of interest rates Rational-expectations econometric analysis of changes in regime: An investigation of the term structure of interest rates.\BBCQ \APACjournalVolNumPagesJournal of Economic Dynamics and Control122385-423. {APACrefURL} \urlhttps://www.sciencedirect.com/science/article/pii/0165188988900474 {APACrefDOI} https://doi.org/10.1016/0165-1889(88)90047-4 \PrintBackRefs\CurrentBib
  20. \APACinsertmetastarhamilton1989{APACrefauthors}Hamilton, J\BPBID.  \APACrefYearMonthDay1989. \BBOQ\APACrefatitleA New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle A new approach to the economic analysis of nonstationary time series and the business cycle.\BBCQ \APACjournalVolNumPagesEconometrica572357–384. {APACrefURL} [2022-11-15]\urlhttp://www.jstor.org/stable/1912559 \PrintBackRefs\CurrentBib
  21. \APACrefYearMonthDay2022. \BBOQ\APACrefatitleFast rates for contextual linear optimization Fast rates for contextual linear optimization.\BBCQ \APACjournalVolNumPagesManagement Science. \PrintBackRefs\CurrentBib
  22. \APACinsertmetastaribbotson2010{APACrefauthors}Ibbotson, R\BPBIG.  \APACrefYearMonthDay2010. \BBOQ\APACrefatitleThe Importance of Asset Allocation The importance of asset allocation.\BBCQ \APACjournalVolNumPagesFinancial Analysts Journal66218-20. {APACrefURL} \urlhttps://doi.org/10.2469/faj.v66.n2.4 {APACrefDOI} 10.2469/faj.v66.n2.4 \PrintBackRefs\CurrentBib
  23. \APACrefYearMonthDay2014. \BBOQ\APACrefatitle60 Years of portfolio optimization: Practical challenges and current trends 60 years of portfolio optimization: Practical challenges and current trends.\BBCQ \APACjournalVolNumPagesEuropean Journal of Operational Research2342356-371. {APACrefURL} \urlhttps://www.sciencedirect.com/science/article/pii/S0377221713008898 \APACrefnote60 years following Harry Markowitz’s contribution to portfolio theory and operations research {APACrefDOI} https://doi.org/10.1016/j.ejor.2013.10.060 \PrintBackRefs\CurrentBib
  24. \APACrefYear2002. \APACrefbtitlePortfolio optimization with conditional value-at-risk objective and constraints Portfolio optimization with conditional value-at-risk objective and constraints (\BVOL 4) (\BNUM 2). \APACaddressPublisherInfopro Digital Risk (IP) Limited. \PrintBackRefs\CurrentBib
  25. \APACrefYearMonthDay2021. \BBOQ\APACrefatitleThe Relationship between Yield Curve and Economic Activity: An Analysis of G7 Countries The relationship between yield curve and economic activity: An analysis of G7 countries.\BBCQ \APACjournalVolNumPagesJournal of Risk and Financial Management14262. \PrintBackRefs\CurrentBib
  26. \APACrefYearMonthDay2021. \BBOQ\APACrefatitleConfronting Machine Learning with Financial Research Confronting machine learning with financial research.\BBCQ \APACjournalVolNumPagesThe Journal of Financial Data Science3367–96. \PrintBackRefs\CurrentBib
  27. \APACrefYearMonthDay2022. \BBOQ\APACrefatitleAutoencoding Conditional GAN for Portfolio Allocation Diversification Autoencoding conditional GAN for portfolio allocation diversification.\BBCQ \APACjournalVolNumPagesarXiv preprint arXiv:2207.05701. \PrintBackRefs\CurrentBib
  28. \APACrefYearMonthDay2019. \APACrefbtitlePAGAN: Portfolio Analysis with Generative Adversarial Networks. Pagan: Portfolio analysis with generative adversarial networks. \APACaddressPublisherarXiv. {APACrefURL} \urlhttps://arxiv.org/abs/1909.10578 {APACrefDOI} 10.48550/ARXIV.1909.10578 \PrintBackRefs\CurrentBib
  29. \APACinsertmetastarmarkowitz1952{APACrefauthors}Markowitz, H.  \APACrefYearMonthDay1952. \BBOQ\APACrefatitlePortfolio Selection Portfolio selection.\BBCQ \APACjournalVolNumPagesThe Journal of Finance7177–91. {APACrefURL} [2022-10-20]\urlhttp://www.jstor.org/stable/2975974 \PrintBackRefs\CurrentBib
  30. \APACinsertmetastarkolmogorov{APACrefauthors}Massey, F\BPBIJ.  \APACrefYearMonthDay1951. \BBOQ\APACrefatitleThe Kolmogorov-Smirnov Test for Goodness of Fit The Kolmogorov-Smirnov test for goodness of fit.\BBCQ \APACjournalVolNumPagesJournal of the American Statistical Association4625368–78. {APACrefURL} [2022-11-25]\urlhttp://www.jstor.org/stable/2280095 \PrintBackRefs\CurrentBib
  31. \APACrefYearMonthDay2022. \BBOQ\APACrefatitleA Synthetic Data-Plus-Features Driven Approach for Portfolio Optimization A synthetic data-plus-features driven approach for portfolio optimization.\BBCQ \APACjournalVolNumPagesComputational Economics. {APACrefURL} \urlhttps://doi.org/10.1007/s10614-022-10274-2 {APACrefDOI} 10.1007/s10614-022-10274-2 \PrintBackRefs\CurrentBib
  32. \APACinsertmetastarpflug2000some{APACrefauthors}Pflug, G\BPBIC.  \APACrefYearMonthDay2000. \BBOQ\APACrefatitleSome remarks on the Value-at-Risk and the Conditional Value-at-Risk Some remarks on the Value-at-Risk and the Conditional Value-at-Risk.\BBCQ \BIn \APACrefbtitleProbabilistic constrained optimization Probabilistic constrained optimization (\BPGS 272–281). \APACaddressPublisherSpringer. \PrintBackRefs\CurrentBib
  33. \APACrefYearMonthDay2020. \BBOQ\APACrefatitleFinancial thought experiment: A GAN-based approach to vast robust portfolio selection Financial thought experiment: A GAN-based approach to vast robust portfolio selection.\BBCQ \BIn \APACrefbtitleProceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI’20). Proceedings of the 29th international joint conference on artificial intelligence (ijcai’20). \PrintBackRefs\CurrentBib
  34. \APACrefYearMonthDay2000. \BBOQ\APACrefatitleOptimization of conditional value-at-risk Optimization of conditional value-at-risk.\BBCQ \APACjournalVolNumPagesJournal of Risk2321–41. {APACrefDOI} 10.21314/JOR.2000.038 \PrintBackRefs\CurrentBib
  35. \APACrefYearMonthDay2002. \BBOQ\APACrefatitleConditional Value-at-Risk for general loss distributions Conditional Value-at-Risk for general loss distributions.\BBCQ \APACjournalVolNumPagesJournal of banking & finance2671443–1471. \PrintBackRefs\CurrentBib
  36. \APACrefYearMonthDay1997. \BBOQ\APACrefatitleRegime switching in stock market returns Regime switching in stock market returns.\BBCQ \APACjournalVolNumPagesApplied Financial Economics72177-191. {APACrefURL} \urlhttps://doi.org/10.1080/096031097333745 {APACrefDOI} 10.1080/096031097333745 \PrintBackRefs\CurrentBib
  37. \APACrefYearMonthDay2010. \BBOQ\APACrefatitleRobust approximation to multiperiod inventory management Robust approximation to multiperiod inventory management.\BBCQ \APACjournalVolNumPagesOperations research583583–594. \PrintBackRefs\CurrentBib
  38. \APACinsertmetastarsharpe1991arithmetic{APACrefauthors}Sharpe, W\BPBIF.  \APACrefYearMonthDay1991. \BBOQ\APACrefatitleThe arithmetic of active management The arithmetic of active management.\BBCQ \APACjournalVolNumPagesFinancial Analysts Journal4717–9. \PrintBackRefs\CurrentBib
  39. \APACrefYearMonthDay2019. \BBOQ\APACrefatitleModeling financial time-series with generative adversarial networks Modeling financial time-series with generative adversarial networks.\BBCQ \APACjournalVolNumPagesPhysica A: Statistical Mechanics and its Applications527121261. \PrintBackRefs\CurrentBib
  40. \APACinsertmetastarthune_2022{APACrefauthors}Thune, K.  \APACrefYearMonthDay2022Jan. \APACrefbtitleHow and Why John Bogle Started Vanguard. How and why John Bogle started Vanguard. {APACrefURL} \urlwww.thebalancemoney.com/how-and-why-john-bogle-started-vanguard-2466413 \PrintBackRefs\CurrentBib
  41. \APACrefYearMonthDay2004. \BBOQ\APACrefatitleData-generating process uncertainty: What difference does it make in portfolio decisions? Data-generating process uncertainty: What difference does it make in portfolio decisions?\BBCQ \APACjournalVolNumPagesJournal of Financial Economics722385-421. {APACrefURL} \urlhttps://www.sciencedirect.com/science/article/pii/S0304405X03002472 {APACrefDOI} https://doi.org/10.1016/j.jfineco.2003.05.003 \PrintBackRefs\CurrentBib
  42. \APACinsertmetastarwalden2015active{APACrefauthors}Walden, M\BPBIL.  \APACrefYearMonthDay2015. \BBOQ\APACrefatitleActive versus passive investment management of state pension plans: implications for personal finance Active versus passive investment management of state pension plans: implications for personal finance.\BBCQ \APACjournalVolNumPagesJournal of Financial Counseling and Planning262160–171. \PrintBackRefs\CurrentBib
  43. \APACrefYearMonthDay2019. \BBOQ\APACrefatitleModeling Tabular data using Conditional GAN Modeling tabular data using conditional GAN.\BBCQ \APACjournalVolNumPagesCoRRabs/1907.00503. {APACrefURL} \urlhttp://arxiv.org/abs/1907.00503 \PrintBackRefs\CurrentBib
Citations (4)

Summary

  • The paper demonstrates that integrating synthetic data generation with contextual features significantly improves portfolio optimization under CVaR constraints.
  • It leverages a modified CTGAN model to generate realistic market scenarios that overcome the limitations of historical correlation estimates.
  • Empirical analysis shows enhanced returns, controlled risk levels, and efficient diversification compared to traditional mean-variance approaches.

A Modified CTGAN-Plus-Features Based Method for Optimal Asset Allocation

The paper "A Modified CTGAN-Plus-Features Based Method for Optimal Asset Allocation" by Peña et al. offers a novel approach to portfolio optimization by synergizing synthetic data generation and a CVaR (Conditional Value-at-Risk) constraint. This work aims to address the limitations associated with traditional Markowitz-based mean-variance optimization frameworks, particularly in terms of accurately estimating the correlation matrix coefficients and effectively capturing risk metrics.

Motivation and Context

The issue of portfolio optimization, which involves the strategic allocation of assets to maximize returns for a given level of risk, has remained a central problem in financial engineering since the seminal work of Harry Markowitz. However, the practical application of Markowitz’s mean-variance optimization has seen limited success due to problems such as the unreliability of historical correlation matrices and the inadequacy of standard deviation as a comprehensive risk metric.

In response to these limitations, this paper builds upon advancements in risk metrics—specifically CVaR, which better captures tail risk by focusing on extreme loss scenarios. Furthermore, it leverages recent progress in synthetic data generation, notably Generative Adversarial Networks (GANs), to overcome the drawbacks of the traditional reliance on historical data alone. The authors argue that synthetic data generation adds value by simulating a broader array of realistic future scenarios from the input historical data, which helps in dealing with data sparsity and enhancing the robustness of the optimization process.

The Proposed Method

Peña et al. introduce a Modified CTGAN (Conditional Tabular GAN) method augmented with contextual features to generate synthetic returns data. Their method incorporates several innovative steps:

  1. Synthetic Data Generation: The CTGAN method is used to generate realistic synthetic data scenarios. This method is superior to simple random sampling as it learns the complex underlying distributions from historical data, capturing both marginal and joint distributions across multiple asset classes.
  2. Contextual Feature Integration: Features, specifically points along the U.S. Treasury yield curve, are integrated into the CTGAN model. This incorporation of features improves the relevance and realism of the generated scenarios by conditioning on current economic contexts, thereby making the generated data more aligned with current market regimes.
  3. CVaR-Based Optimization: The portfolio optimization problem is then formulated as a linear programming problem with a CVaR constraint, ensuring that the allocated portfolio's risk stays within a predefined tolerance level.

Performance Evaluation

The empirical analysis conducted in the paper is robust, covering a diverse set of asset classes over a period of fourteen and a half years. This period includes significant market events such as the 2008 financial crisis and the COVID-19 pandemic. The evaluation metrics include annualized returns, CVaR (ex post), portfolio rotation (as a proxy for transaction costs), and diversification (as measured by the Herfindahl–Hirschman Index).

Key Findings:

  • Returns: The modified CTGAN method with features (GwF) consistently outperformed other strategies, including the equal weights (1/N) strategy and historical data-based methods with and without features. This underscores the value of synthetic data and contextual feature incorporation.
  • Risk: Portfolios optimized using synthetic data with features maintained risk levels well within the predefined CVaR limits, often outperforming those based solely on historical data.
  • Transaction Costs: The portfolio rotation results demonstrate the efficiency of the proposed method in keeping transaction costs low.
  • Diversification: The Modified CTGAN approach led to well-diversified portfolios, avoiding the common pitfall of overly concentrated portfolios often seen in traditional mean-variance optimization.

Implications and Future Directions

This paper has implications for both theoretical advancements and practical applications in the field of portfolio management. The integration of machine learning techniques such as GANs into financial optimization represents a significant shift towards more adaptable and robust asset allocation strategies. The methodological approach of combining synthetic data generation with CVaR-constrained optimization and contextual features can be extended to other financial variables and settings beyond asset returns.

Future research could explore:

  1. Alternative Features: Incorporating additional or alternative features such as market volatility indices or liquidity measures to further enhance the robustness of the generated data.
  2. Application to Other Financial Metrics: Extending the model to other financial variables like bond default rates or exchange rates to assess broader applicability.
  3. Combination with Other Data Sources: Integrating macroeconomic indicators or sentiment analysis data from news and social media for a more comprehensive approach to data-driven portfolio optimization.

By addressing the limitations of historical data reliance and incorporating forward-looking elements into the optimization process, this paper sets a new benchmark for future research and practical implementation in asset management.

Conclusion

In sum, the Modified CTGAN-Plus-Features method proposed by Peña et al. provides a significant step forward in portfolio optimization techniques. The empirical results substantiate the claim that leveraging synthetic data generation coupled with CVaR constraints and contextual features yields more reliable and effective asset allocation strategies. This paper serves as an important reference point for both academic researchers and industry practitioners aiming to improve portfolio performance through advanced data generation and risk management techniques.

Youtube Logo Streamline Icon: https://streamlinehq.com