Existence of LLMs or prompts that can effectively select covariates for confounder identification

Determine whether any large language model or prompt design can effectively select covariates for confounder identification in real-world causal analysis contexts such as the Coronary Drug Project.

Background

Although current models show inconsistent and often inadequate performance for confounder selection, the authors acknowledge that their results do not rule out the possibility that future LLMs or improved prompting strategies could succeed.

They propose that the CDP setting and code can serve as a repeatable benchmark to test future models for improvements in covariate selection capabilities.

References

These results of course cannot prove that LLMs can never complete this task. LLM technology is constantly improving, and none of the results in this paper prove that there will never be an LLM or a prompt design capable of effectively selecting covariates.

— Do LLMs Act as Repositories of Causal Knowledge? (2412.10635 - Huntington-Klein et al., 14 Dec 2024) in Conclusion

Existence of LLMs or prompts that can effectively select covariates for confounder identification

Sponsor

Background

References

Related Problems