Dice Question Streamline Icon: https://streamlinehq.com

Unknown causative features in drug activity prediction

Identify currently unknown causative molecular features that determine small-molecule activity against their cognate targets and incorporate these features into the representations used by AI models to improve predictive performance.

Information Square Streamline Icon: https://streamlinehq.com

Background

In the discussion on feature relevance, the authors note that predictive features can correlate with but not cause the property of interest, and express concern that some causative features may be missing from models either because they are unknown or cannot be effectively computed. They illustrate related known but hard-to-quantify factors (e.g., ligand and pocket desolvation, induced-fit effects) that limit the accuracy of scoring functions, underscoring the need to discover and include missing causative determinants.

References

A more concerning possibility is that some of the causative features are not modelled. This could happen because we do not know about them.

Data-centric challenges with the application and adoption of artificial intelligence for drug discovery (2407.05150 - Ghislat et al., 6 Jul 2024) in Section 2.4 Irrelevant data