Practical viability of input-dependent SSMs for large-scale language modeling
Determine whether input-dependent state-space model architectures that increase expressivity for state tracking—such as variants that make the SSM transition matrix depend on the input (e.g., Input-Dependent S4 or Liquid S4)—are practically viable for large-scale language modeling.
References
It is an open question whether such SSM architectures with greater expressivity for state tracking are practically viable for large-scale language modeling.
— The Illusion of State in State-Space Models
(2404.08819 - Merrill et al., 12 Apr 2024) in Introduction, final paragraph