Emma

Summary:

  • Large Language Models (LLMs) fail to generate correct Python code when default function names are changed.
  • As model size increases, some LLMs become more confident in incorrect predictions, a phenomenon called Inverse Scaling.

Tags:

Research