Optimal learning strategy for non-differentiable transformation choices in computation graph synthesis
Ascertain whether gradient descent is an optimal learning strategy for selecting sequences of non-differentiable mathematical transformations when synthesizing computation graphs for math word problem solvers.
References
The choices of these mathematical transformations are not differential, and hence it is unclear if its the optimal strategy.
— Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems
(2111.05364 - Faldu et al., 2021) in Section: Reinforcement Learning