Estimating causal vs non-causal deviations and optimal source-domain selection
Determine practical procedures to estimate the Causal Deviation Δ_c and the Non-Causal Deviation Δ_{nc} in the constrained optimization-based causal discovery framework for stock prediction, where Δ_c and Δ_{nc} are defined respectively as the minimal average squared deviation across source domains of the domain-wise least-squares regression coefficients for features φ(X) that correspond to causal features and for features φ(X) that correspond to non-causal features; and identify the optimal selection and partitioning of source domains (i.e., training horizon and domain construction) that yields favorable values of Δ_c and Δ_{nc} to enable reliable discovery of causal features.
Sponsor
References
However, it is non-trivial to estimate their values or find the best collection of domains. We leave this as an open question, and in this work, we assume that Δ_{nc} is low considering the large number of domains in our settings.