Embedding-based similarity as a domain-agnostic pairwise feature
Investigate the effectiveness of using cosine similarity between the base model’s class embeddings as the domain-agnostic label-similarity component in the REPAIR reranker’s pairwise feature vector when structured external knowledge (e.g., taxonomic distance, WordNet path similarity, or HPO phenotype similarity) is unavailable, and determine its impact on reranking performance across datasets.
References
A natural alternative in the absence of external knowledge is cosine similarity between the base model's class embeddings, which we leave for future work.
— Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking
(2604.01506 - Wang et al., 2 Apr 2026) in Appendix, Section Limitations