How can chain-of-thought supervision be applied to unstructured tasks like story-writing?
Determine how to provide effective chain-of-thought supervision for unstructured tasks such as story-writing in order to circumvent teacher-forcing-related failures identified for lookahead tasks.
Sponsor
References
However, it is unclear how that is possible in more unstructured tasks like story-writing.
— The pitfalls of next-token prediction
(2403.06963 - Bachmann et al., 2024) in Section: Conclusion