Identify the cause of GPT-4’s macro-level attribution performance discrepancy across funds
Ascertain the reasons for the observed discrepancy in GPT-4’s accuracy when executing macro-level performance attribution calculations at the “GICS Type” level using the generate_prompt_macro approach within a LangChain OpenAI Functions Agent and pandas workflow, specifically why the Portfolio Defensive achieved perfect accuracy on the first run while Portfolio Growth and Portfolio Value required several attempts. Evaluate whether prompt complexity and the absence of numerical examples in the prompt contribute to this inconsistency.
References
We are not sure of the reasons for this performance discrepancy. It could be related to the fact that the prompt is complicated with several formulas and instructions, and it may need numerical examples in the prompt.