Consistency Flow Matching: Defining Straight Flows with Velocity Consistency (2407.02398v1)

Published 2 Jul 2024 in cs.CV

Abstract: Flow matching (FM) is a general framework for defining probability paths via Ordinary Differential Equations (ODEs) to transform between noise and data samples. Recent approaches attempt to straighten these flow trajectories to generate high-quality samples with fewer function evaluations, typically through iterative rectification methods or optimal transport solutions. In this paper, we introduce Consistency Flow Matching (Consistency-FM), a novel FM method that explicitly enforces self-consistency in the velocity field. Consistency-FM directly defines straight flows starting from different times to the same endpoint, imposing constraints on their velocity values. Additionally, we propose a multi-segment training approach for Consistency-FM to enhance expressiveness, achieving a better trade-off between sampling quality and speed. Preliminary experiments demonstrate that our Consistency-FM significantly improves training efficiency by converging 4.4x faster than consistency models and 1.7x faster than rectified flow models while achieving better generation quality. Our code is available at: https://github.com/YangLing0818/consistency_flow_matching

References (55)

Authors (9)

Ling Yang (88 papers)
Zixiang Zhang (3 papers)
Zhilong Zhang (20 papers)
Xingchao Liu (28 papers)
Minkai Xu (40 papers)
Wentao Zhang (261 papers)
Chenlin Meng (39 papers)
Stefano Ermon (279 papers)
Bin Cui (165 papers)

Citations (5)

View on Semantic Scholar

Summary

Consistency Flow Matching: A Novel Approach to Enhance Generative Model Efficiency

The paper introduces Consistency Flow Matching (Consistency-FM), an innovative method to improve the performance of generative models that utilize flow-based techniques. The primary goal of Consistency-FM is to enhance the efficiency of generating high-quality samples by addressing inherent computational challenges in existing methods. This paper accomplishes this by enforcing a self-consistency property within the velocity fields of Ordinary Differential Equations (ODEs) used to transport noise samples to data samples.

Core Concepts and Methodology

The concept of flow matching (FM) is central to this paper. FM entails learning a vector field that defines the trajectory of an ODE, allowing it to transform noise samples to the desired data distribution. Prior approaches struggle with maintaining a balance between computational cost and sampling quality. Existing models, such as Consistency Models (CMs) and Rectified Flow, either involve computationally expensive optimal transport plans or suffer from error accumulation due to iterative processes.

Consistency-FM innovatively addresses these issues by defining straight flows with consistent velocities. The proposed method extends past works by incorporating the following:

Self-Consistency in Velocity Fields: Consistency-FM directly enables straight trajectory flows by maintaining constant velocity values across different time segments, avoiding the need for full trajectory reconstructions or optimal transport estimates.
Multi-Segment Optimization: This technique divides the time intervals into multiple segments, training each to maintain consistency. This approach is particularly useful for modeling complex data distributions, allowing for flexible and piecewise linear paths.

These concepts are theoretically grounded by leveraging the consistency constraints side by side with approximation errors, providing a rigorous framework for training these models efficiently.

Experimental Insights

The empirical validation of Consistency-FM demonstrates significant advances. On classical image generation datasets such as CIFAR-10, CelebA-HQ, and AFHQ-Cat, the method shows it converges 4.4 times faster than previous consistency models and 1.7 times faster than rectified flow models, while also achieving superior image generation quality. For instance, Consistency-FM achieves a Frechet Inception Distance (FID) of 5.34 on the CIFAR-10 dataset, surpassing both Consistency Models and Rectified Flow models that exhibit higher FID values.

Practical and Theoretical Implications

The presented work carries both immediate and far-reaching implications. Practically, Consistency-FM provides benchmark improvements by delivering high-quality samples with significantly reduced computational requirements. This efficiency is crucial for scaling generative models to higher-resolution tasks and broader application areas such as text-to-image generation.

Theoretically, Consistency-FM redefines the implementation of flow models by embedding consistency directly into velocity fields rather than trajectory pathways, which could lead to new directions in efficient generative modeling. The results suggest promising pathways for employing velocity consistency in conjunction with pretrained models, presenting new venues for distillation techniques across various model hierarchies.

Future Directions

Several research avenues are posited for future exploration. The extension of Consistency-FM to more complex generative tasks, including but not limited to, text-to-image synthesis remains an alluring domain. Moreover, the potential for distilling pre-existing diffusion models (DMs) using the principles of Consistency-FM could transform how these models are leveraged in large-scale datasets.

In conclusion, Consistency-FM signifies a substantial methodological advancement in the landscape of generative modeling. By leveraging velocity consistency as a core design principle, it offers a pathway to achieve exceptional results in sample quality and computational efficiency, setting a new standard and exploring critical innovations in generative AI technology.

GitHub

GitHub - YangLing0818/consistency_flow_matching: Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency" (148 stars)