Papers
Topics
Authors
Recent
2000 character limit reached

AnnoDPO: Protein Functional Annotation Learning with Direct Preference Optimization (2506.07035v1)

Published 8 Jun 2025 in q-bio.BM and cs.AI

Abstract: Deciphering protein function remains a fundamental challenge in protein representation learning. The task presents significant difficulties for protein LLMs (PLMs) due to the sheer volume of functional annotation categories and the highly imbalanced distribution of annotated instances across biological ontologies. Inspired by the remarkable success of reinforcement learning from human feedback (RLHF) in LLM alignment, we propose AnnoDPO, a novel multi-modal framework for protein function prediction that leverages Direct Preference Optimization (DPO) to enhance annotation learning. Our methodology addresses the dual challenges of annotation scarcity and category imbalance through preference-aligned training objectives, establishing a new paradigm for biological knowledge integration in protein representation learning.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.