Emma

Summary:

  • Sophia is a new second-order clipped stochastic optimization algorithm that improves language model training efficiency.
  • The optimizer achieves a 2x speed-up compared to Adam and adapts to the curvature in different components of the parameters.

Tags:

Research