Identification of the structural functional ω(G) governing power-law scaling
Identify the functional ω(G) of the hidden bipartite factorization graph G (with input factors X_i, output factors Y_j, and parent sets I_j) that governs power-law scaling of the loss with respect to a resource variable ξ (such as compute, model capacity, or dataset size), i.e., determine ω(G) such that the loss satisfies L ∝ ξ^{−ω(G)}.
Sponsor
References
As a side note, a close look at some experiments (e.g., Figure~\ref{fig:filtration} or~\ref{fig:generalization}) suggests the existence of a power-law regime, which posits a functional \omega(G) such that
\mathcal{L} \propto \xi{-\omega(G)}, where \omega(G) captures how {\em simple} the underlying factorization is, and is thus expected to be a decreasing function of the complexity parameters of $G$. We leave the precise identification of such a functional for future work.