Integrate a statistics-aware distributed optimizer into TQP
Develop and integrate a statistics-aware distributed query optimizer for the Tensor Query Processor (TQP) that can automatically select data exchange operations (e.g., shuffle versus broadcast) and join orderings based on estimated cardinalities for distributed execution, replacing the current manual optimization of compiled tensor programs.
References
We leave the integration with a statistic-aware distributed query optimizer as part of our future work.
— Terabyte-Scale Analytics in the Blink of an Eye
(2506.09226 - Wu et al., 10 Jun 2025) in Section 4.4 (TQP Setup)