Pessimistic Backward Policy for GFlowNets (2405.16012v3)
Abstract: This paper studies Generative Flow Networks (GFlowNets), which learn to sample objects proportionally to a given reward function through the trajectory of state transitions. In this work, we observe that GFlowNets tend to under-exploit the high-reward objects due to training on insufficient number of trajectories, which may lead to a large gap between the estimated flow and the (known) reward value. In response to this challenge, we propose a pessimistic backward policy for GFlowNets (PBP-GFN), which maximizes the observed flow to align closely with the true reward for the object. We extensively evaluate PBP-GFN across eight benchmarks, including hyper-grid environment, bag generation, structured set generation, molecular generation, and four RNA sequence generation tasks. In particular, PBP-GFN enhances the discovery of high-reward objects, maintains the diversity of the objects, and consistently outperforms existing methods.
- Flow network based generative models for non-iterative diverse candidate generation. Advances in Neural Information Processing Systems, 34:27381–27394, 2021.
- Multi-objective gflownets. In International Conference on Machine Learning, pages 14631–14653. PMLR, 2023.
- Synflownet: Towards molecule design with guaranteed synthesis pathways. In ICLR 2024 Generative and Experimental Perspectives for Biomolecular Design (GEM) Workshop, 2024.
- Biological sequence design with gflownets. In International Conference on Machine Learning, pages 9786–9801. PMLR, 2022.
- Let the flows tell: Solving graph combinatorial problems with gflownets. In A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, editors, Advances in Neural Information Processing Systems, volume 36, pages 11952–11969. Curran Associates, Inc., 2023.
- Amortizing intractable inference in large language models. In International Conference on Learning Representations, 2024.
- Gflownet foundations. Journal of Machine Learning Research, 24(210):1–55, 2023.
- Trajectory balance: Improved credit assignment in gflownets. Advances in Neural Information Processing Systems, 35:5955–5967, 2022.
- Learning gflownets from partial episodes for improved convergence and stability. In International Conference on Machine Learning, pages 23467–23483. PMLR, 2023.
- Better training of gflownets with local credit and incomplete trajectories. In International Conference on Machine Learning, pages 26878–26890. PMLR, 2023.
- Learning energy decompositions for partial inference of gflownets. In International Conference on Learning Representations, 2024.
- Generative augmented flow networks. In International Conference on Learning Representations, 2023.
- Towards understanding and improving gflownet training. In Proceedings of the 40th International Conference on Machine Learning, Proceedings of Machine Learning Research. PMLR, 2023.
- Local search gflownets. In International Conference on Learning Representations, 2024.
- Maximum entropy gflownets with soft q-learning. In International Conference on Artificial Intelligence and Statistics, pages 2593–2601. PMLR, 2024.
- Discrete probabilistic inference as control in multi-path environments. arXiv preprint arXiv:2402.10309, 2024.
- Gflownets and variational inference. In International Conference on Learning Representations, 2023.
- Thompson sampling for improved exploration in gflownets. In ICML 2023 Structured Probabilistic Inference & Generative Modeling (SPIGM) Workshop, 2023.
- Qgfn: Controllable greediness with action values. arXiv preprint arXiv:2402.05234, 2024.
- Learning to scale logits for temperature-conditional gflownets. arXiv preprint arXiv:2310.02823, 2023.
- Sample-efficient multi-objective molecular optimization with gflownets. Advances in Neural Information Processing Systems, 36, 2023.
- Generative flow networks assisted biological sequence editing. In NeurIPS 2023 Generative AI and Biology (GenBio) Workshop, 2023.
- Generative flow networks for discrete probabilistic modeling. In International Conference on Machine Learning, pages 26412–26428. PMLR, 2022.
- Bayesian structure learning with generative flow networks. In Uncertainty in Artificial Intelligence, pages 518–528. PMLR, 2022.
- Robust scheduling with gflownets. In International Conference on Learning Representations, 2023.
- Ant colony sampling with gflownets for combinatorial optimization. arXiv preprint arXiv:2403.07041, 2024.
- A theory of continuous generative flow networks. In International Conference on Machine Learning, pages 18269–18300. PMLR, 2023.
- Diffusion generative flow samplers: Improving learning signals through partial trajectory optimization. In International Conference on Learning Representations, 2024.
- On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling. arXiv preprint arXiv:2402.05098, 2024.
- Why is tanimoto index an appropriate choice for fingerprint-based similarity calculations? Journal of cheminformatics, 7(1):1–13, 2015.
- Asynchronous methods for deep reinforcement learning. In Maria Florina Balcan and Kilian Q. Weinberger, editors, Proceedings of The 33rd International Conference on Machine Learning, volume 48 of Proceedings of Machine Learning Research, pages 1928–1937, New York, New York, USA, 20–22 Jun 2016. PMLR.
- Survey of variation in human transcription factors reveals prevalent dna binding changes. Science, 351(6280):1450–1454, 2016.
- Design-bench: Benchmarks for data-driven offline model-based optimization. In International Conference on Machine Learning, pages 21658–21676. PMLR, 2022.
- Adalead: A simple and robust adaptive greedy search algorithm for sequence design. arXiv preprint arXiv:2010.02141, 2021.
- How powerful are graph neural networks? In International Conference on Learning Representations, 2019.