Market-Cap Bias in LLM-Derived Textual Signals

Test the conjecture that textual signals extracted by large language models from financial news are of higher quality for large-cap companies than for small-cap companies due to news coverage bias.

Background

In comparing portfolio weighting schemes, the survey observes that market-capitalization-weighted portfolios showed slightly higher returns than equally weighted portfolios in multiple studies. The authors hypothesize a potential explanation: LLM-extracted textual signals might be more reliable for larger firms due to greater and higher-quality news coverage.

Validating or refuting this conjecture requires empirical analysis linking market capitalization, news coverage characteristics, and the accuracy or predictive value of LLM-derived signals.

References

We conjecture that the quality of textual signals from large-cap companies is better than that from smaller companies due to the bias in news coverage.

— Large Language Model Agent in Financial Trading: A Survey (2408.06361 - Ding et al., 26 Jul 2024) in Section 4.1: Evaluation — Trading Strategy

Market-Cap Bias in LLM-Derived Textual Signals

Background

References

Related Problems