Target-Free Compound Activity Prediction via Few-Shot Learning (2311.16328v1)
Abstract: Predicting the activities of compounds against protein-based or phenotypic assays using only a few known compounds and their activities is a common task in target-free drug discovery. Existing few-shot learning approaches are limited to predicting binary labels (active/inactive). However, in real-world drug discovery, degrees of compound activity are highly relevant. We study Few-Shot Compound Activity Prediction (FS-CAP) and design a novel neural architecture to meta-learn continuous compound activities across large bioactivity datasets. Our model aggregates encodings generated from the known compounds and their activities to capture assay information. We also introduce a separate encoder for the unknown compound. We show that FS-CAP surpasses traditional similarity-based techniques as well as other state of the art few-shot learning methods on a variety of target-free drug discovery settings and datasets.
- Recent advances in ligand-based drug design: relevance and utility of the conformationally sampled pharmacophore approach. Current computer-aided drug design, 7(1):10–22, 2011.
- Low data drug discovery with one-shot learning. ACS central science, 3(4):283–293, 2017.
- learn2learn: A library for Meta-Learning research. arXiv, August 2020.
- Why is tanimoto index an appropriate choice for fingerprint-based similarity calculations? Journal of cheminformatics, 7(1):1–13, 2015.
- The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 483(7391):603–607, 2012.
- Image-based profiling for drug discovery: due for a machine-learning upgrade? Nature Reviews Drug Discovery, 20(2):145–159, 2021.
- Meta-learning adaptive deep kernel gaussian processes for molecular property prediction. In NeurIPS 2022 AI for Science: Progress and Promises, 2022.
- Graph prototypical networks for few-shot learning on attributed networks. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 295–304, 2020.
- Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning, pp. 1126–1135. PMLR, 2017.
- Conditional neural processes. In International Conference on Machine Learning, pp. 1704–1713. PMLR, 2018a.
- Neural processes. arXiv preprint arXiv:1807.01622, 2018b.
- Bindingdb in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic acids research, 44(D1):D1045–D1053, 2016.
- How phenotypic screening influenced drug discovery: lessons from five years of practice. Assay and drug development technologies, 15(6):239–246, 2017.
- Moltrans: Molecular interaction transformer for drug–target interaction prediction. Bioinformatics, 37(6):830–836, 2021.
- Principles of early drug discovery. British journal of pharmacology, 162(6):1239–1249, 2011.
- Mol2vec: unsupervised machine learning approach with chemical intuition. Journal of chemical information and modeling, 58(1):27–35, 2018.
- Concepts and applications of molecular similarity. Wiley, 1990.
- Improved protein–ligand binding affinity prediction with structure-based deep fusion inference. Journal of chemical information and modeling, 61(4):1583–1592, 2021.
- A deep learning model for cell growth inhibition ic50 prediction and its application for gastric cancer patients. International journal of molecular sciences, 20(24):6276, 2019.
- Rocs-derived features for virtual screening. Journal of computer-aided molecular design, 30(8):609–617, 2016.
- Khan, A. U. et al. Descriptors and their selection methods in qsar analysis: paradigm for drug design. Drug discovery today, 21(8):1291–1302, 2016.
- Attentive neural processes. arXiv preprint arXiv:1901.05761, 2019.
- Qphar: quantitative pharmacophore activity relationship: method and validation. Journal of cheminformatics, 13(1):1–14, 2021.
- Quantifying sources of uncertainty in drug discovery predictions with probabilistic models. Artificial Intelligence in the Life Sciences, 1:100004, 2021.
- Metadta: Meta-learning-based drug-target binding affinity prediction. In ICLR Machine Learning for Drug Discovery Workshop, 2022.
- Simultaneous regression and classification for drug sensitivity prediction using an advanced random forest method. Scientific Reports, 12(1):1–13, 2022.
- Mol-bert: An effective molecular representation with bert for molecular property prediction. Wireless Communications and Mobile Computing, 2021, 2021.
- Quantitative structure–activity relationship for prediction of the toxicity of phenols on photobacterium phosphoreum. Bulletin of environmental contamination and toxicology, 89(1):27–31, 2012.
- Strategies for indirect computer-aided drug design. Pharmaceutical research, 10(4):475–486, 1993.
- The power metric: a new statistically robust enrichment-type metric for virtual screening applications with early recovery capability. Journal of Cheminformatics, 9(1):1–11, 2017.
- Predicting binding from screening assays with transformer network embeddings. Journal of Chemical Information and Modeling, 60(9):4191–4199, 2020.
- Meta networks. In International Conference on Machine Learning, pp. 2554–2563. PMLR, 2017.
- Meta-learning initializations for low-resource drug discovery. ChemRxiv, 2020.
- Deepdta: deep drug–target binding affinity prediction. Bioinformatics, 34(17):i821–i829, 2018.
- Artificial intelligence in drug discovery and development. Drug discovery today, 26(1):80, 2021.
- Protein–ligand scoring with convolutional neural networks. Journal of chemical information and modeling, 57(4):942–957, 2017.
- Optimization as a model for few-shot learning. International Conference on Learning Representations, 2016.
- Extended-connectivity fingerprints. Journal of chemical information and modeling, 50(5):742–754, 2010.
- Pacoh: Bayes-optimal meta-learning with pac-guarantees. In International Conference on Machine Learning, pp. 9116–9126. PMLR, 2021.
- A generalized framework for embedding-based few-shot learning methods in drug discovery. ELLIS Machine Learning for Molecules workshop, 2021.
- Non-gaussian gaussian processes for few-shot regression. Advances in Neural Information Processing Systems, 34:10285–10298, 2021.
- Prototypical networks for few-shot learning. Advances in neural information processing systems, 30, 2017.
- Multi-scale representation learning on proteins. Advances in Neural Information Processing Systems, 34:25244–25255, 2021.
- Fs-mol: A few-shot learning dataset of molecules. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), 2021.
- Development and evaluation of a deep learning model for protein–ligand binding affinity prediction. Bioinformatics, 34(21):3666–3674, 2018.
- Recent advances in phenotypic drug discovery. F1000Research, 9, 2020.
- Adaptive deep kernel learning. arXiv preprint arXiv:1905.12131, 2019.
- Virtual screening workflow development guided by the “receiver operating characteristic” curve approach. application to high-throughput docking on metabotropic glutamate receptor subtype 4. Journal of medicinal chemistry, 48(7):2534–2547, 2005.
- Applications of machine learning in drug discovery and development. Nature reviews Drug discovery, 18(6):463–477, 2019.
- Visualizing data using t-sne. Journal of machine learning research, 9(11), 2008.
- Few-shot learning for low-data drug discovery. Journal of Chemical Information and Modeling, 2022.
- Matching networks for one shot learning. Advances in neural information processing systems, 29, 2016.
- Pubchem’s bioassay database. Nucleic acids research, 40(D1):D400–D412, 2012.
- Generalizing from a few examples: A survey on few-shot learning. ACM computing surveys (csur), 53(3):1–34, 2020.
- Resatom system: protein and ligand affinity prediction model based on deep learning. arXiv preprint arXiv:2105.05125, 2021.
- Hit identification and optimization in virtual screening: Practical recommendations based on a critical literature analysis: Miniperspective. Journal of medicinal chemistry, 56(17):6560–6572, 2013.
- Peter Eckmann (6 papers)
- Jake Anderson (1 paper)
- Rose Yu (84 papers)
- Michael K. Gilson (6 papers)