Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

2-Cats: 2D Copula Approximating Transforms (2309.16391v5)

Published 28 Sep 2023 in cs.LG and cs.AI

Abstract: Copulas are powerful statistical tools for capturing dependencies across data dimensions. Applying Copulas involves estimating independent marginals, a straightforward task, followed by the much more challenging task of determining a single copulating function, $C$, that links these marginals. For bivariate data, a copula takes the form of a two-increasing function $C: (u,v)\in \mathbb{I}2 \rightarrow \mathbb{I}$, where $\mathbb{I} = [0, 1]$. This paper proposes 2-Cats, a Neural Network (NN) model that learns two-dimensional Copulas without relying on specific Copula families (e.g., Archimedean). Furthermore, via both theoretical properties of the model and a Lagrangian training approach, we show that 2-Cats meets the desiderata of Copula properties. Moreover, inspired by the literature on Physics-Informed Neural Networks and Sobolev Training, we further extend our training strategy to learn not only the output of a Copula but also its derivatives. Our proposed method exhibits superior performance compared to the state-of-the-art across various datasets while respecting (provably for most and approximately for a single other) properties of C.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (64)
  1. Pair-copula constructions of multiple dependence. Insurance: Mathematics and economics, 44(2), 2009.
  2. A new mixture copula model for spatially correlated multiple variables with an environmental application. Scientific Reports, 12(1), 2022.
  3. Statistics with confidence: confidence intervals and statistical guidelines. John Wiley & Sons, 2013.
  4. Barry C Arnold. Multivariate logistic distributions. Marcel Dekker New York, 1992.
  5. Yves I Ngounou Bakam and Denys Pommeret. Nonparametric estimation of copulas and copula densities by orthogonal projections. Econometrics and Statistics, 2023.
  6. Christopher M Bishop. Mixture density networks. Online Report., 1994.
  7. Yongqiang Cai. Achieve the minimum width of neural networks for universal approximation. In ICLR, 2023.
  8. Statistical inference. Cengage Learning, 2001.
  9. Copula methods in finance. John Wiley & Sons, 2004.
  10. Neural likelihoods via cumulative distribution functions. In UAI, 2020.
  11. George Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics of control, signals, and systems, 2(4), 1989.
  12. Claudia Czado. Pair-copula constructions of multivariate copulas. In Copula Theory and Its Applications. Springer, 2010.
  13. Vine copula based modeling. Annual Review of Statistics and Its Application, 9, 2022.
  14. Sobolev training for neural networks. In NeurIPS, 2017.
  15. Monotone and partially monotone neural networks. IEEE Transactions on Neural Networks, 21(6), 2010.
  16. Pierre Dragicevic. Fair statistical communication in hci. Modern statistical methods for HCI, 2016.
  17. Confidence intervals rather than p values: estimation rather than hypothesis testing. Br Med J (Clin Res Ed), 292(6522), 1986.
  18. Probit transformation for nonparametric kernel estimation of the copula density. Bernoulli, 2017.
  19. Everything you always wanted to know about copula modeling but were afraid to ask. Journal of hydrologic engineering, 12(4), 2007.
  20. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika, 82(3), 1995.
  21. Statistical tests, p values, confidence intervals, and power: a guide to misinterpretations. European journal of epidemiology, 31, 2016.
  22. Copulae: An overview and recent developments. Wiley Interdisciplinary Reviews: Computational Statistics, 14(3), 2022.
  23. Copula-like variational inference. In NeurIPS, 2019.
  24. Moving beyond p values: data analysis with estimation graphics. Nature methods, 16(7), 2019.
  25. Multilayer feedforward networks are universal approximators. Neural networks, 2(5), 1989.
  26. Guido W Imbens. Statistical significance, p-values, and the reporting of uncertainty. Journal of Economic Perspectives, 35(3), 2021.
  27. Implicit generative copulas. In NeurIPS, 2021.
  28. Physics-informed machine learning. Nature Reviews Physics, 3(6), 2021.
  29. Normalizing flows: An introduction and review of current methods. IEEE transactions on pattern analysis and machine intelligence, 43(11), 2020.
  30. Ryan Kortvelesy. Fixed integral neural networks. arXiv preprint arXiv:2307.14439, 2023.
  31. Deep archimedean copulas. In NeurIPS, 2020.
  32. Yucong Liu. Neural networks are integrable. arXiv preprint arXiv:2310.14394, 2023.
  33. Negative mixture models via squaring: Representation and learning. In The 6th Workshop on Tractable Probabilistic Modeling, 2023.
  34. Canonical vine copulas in the context of modern portfolio management: Are they worth it? Asymmetric Dependence in Finance: Diversification, Correlation and Portfolio Management in Market Downturns, 2018.
  35. Physics-informed neural networks with hard constraints for inverse design. SIAM Journal on Scientific Computing, 43(6), 2021.
  36. A universal approximation theorem of deep neural networks for expressing probability distributions. In NeurIPS, 2020.
  37. The expressive power of neural networks: A view from the width. In NeurIPS, 2017.
  38. Non-parametric models for non-negative functions. In NeurIPS, 2020.
  39. A new hybrid method to improve the ultra-short-term prediction of lod. Journal of geodesy, 94, 2020.
  40. Nonparametric universal copula modeling. Applied Stochastic Models in Business and Industry, 36(1), 2020.
  41. Michael Naaman. On the tight constant in the multivariate dvoretzky–kiefer–wolfowitz inequality. Statistics & Probability Letters, 173, 2021.
  42. Nonparametric estimation of simplified vine copula models: comparison of methods. Dependence Modeling, 5(1), 2017.
  43. Roger B Nelsen. An introduction to copulas. Springer, 2006.
  44. Generative archimedean copulas. In UAI, 2021.
  45. Approximation by finite mixtures of continuous density functions that vanish at infinity. Cogent Mathematics & Statistics, 7(1), 2020.
  46. Masked autoregressive flow for density estimation. In NeurIPS, 2017.
  47. Psd representations for effective probability models. In NeurIPS, 2021.
  48. G Salvadori and Carlo De Michele. Frequency analysis via copulas: Theoretical aspects and applications to hydrological events. Water resources research, 40(12), 2004.
  49. Derivatives and fisher information of bivariate copulas. Statistical papers, 55(2), 2014.
  50. Joseph Sill. Monotonic networks. In NeurIPS, 1997.
  51. Bernard W Silverman. Density estimation for statistics and data analysis, volume 26. CRC press, 1986.
  52. Abe Sklar. Random variables, distribution functions, and copulas: a personal look backward and forward. Lecture notes-monograph series, 1996.
  53. M Sklar. Fonctions de répartition à n dimensions et leurs marges. Annales de l’ISUP, 8(3), 1959.
  54. Encoding negative dependencies in probabilistic circuits. In The 6th Workshop on Tractable Probabilistic Modeling, 2023.
  55. Learning structured output representation using deep conditional generative models. In NeurIPS, 2015.
  56. B Sohrabian. Geostatistical prediction through convex combination of archimedean copulas. Spatial Statistics, 41, 2021.
  57. Sifting the evidence—what’s wrong with significance tests? Physical therapy, 81(8), 2001.
  58. Retrospective uncertainties for deep models using vine copulas. In AISTATS, 2023.
  59. A simple approximation for the bivariate normal integral. Communications in Statistics-Simulation and Computation, 52(4), 2023.
  60. Squared neural families: A new class of tractable density models. arXiv preprint arXiv:2305.13552, 2023.
  61. Larry Wasserman. All of statistics: a concise course in statistical inference, volume 26. Springer, 2004.
  62. Unconstrained monotonic neural networks. In NeurIPS, 2019.
  63. A coherence algorithm for 3-d seismic data analysis based on the mutual information. IEEE Geoscience and Remote Sensing Letters, 16(6), 2019.
  64. A mixture copula bayesian network model for multimodal genomic data. Cancer Informatics, 16, 2017.

Summary

We haven't generated a summary for this paper yet.