Effective control of two-dimensional Rayleigh--Bénard convection: invariant multi-agent reinforcement learning is all you need (2304.02370v2)

Published 5 Apr 2023 in physics.flu-dyn and cs.LG

Abstract: Rayleigh-B\'enard convection (RBC) is a recurrent phenomenon in several industrial and geoscience flows and a well-studied system from a fundamental fluid-mechanics viewpoint. However, controlling RBC, for example by modulating the spatial distribution of the bottom-plate heating in the canonical RBC configuration, remains a challenging topic for classical control-theory methods. In the present work, we apply deep reinforcement learning (DRL) for controlling RBC. We show that effective RBC control can be obtained by leveraging invariant multi-agent reinforcement learning (MARL), which takes advantage of the locality and translational invariance inherent to RBC flows inside wide channels. The MARL framework applied to RBC allows for an increase in the number of control segments without encountering the curse of dimensionality that would result from a naive increase in the DRL action-size dimension. This is made possible by the MARL ability for re-using the knowledge generated in different parts of the RBC domain. We show in a case study that MARL DRL is able to discover an advanced control strategy that destabilizes the spontaneous RBC double-cell pattern, changes the topology of RBC by coalescing adjacent convection cells, and actively controls the resulting coalesced cell to bring it to a new stable configuration. This modified flow configuration results in reduced convective heat transfer, which is beneficial in several industrial processes. Therefore, our work both shows the potential of MARL DRL for controlling large RBC systems, as well as demonstrates the possibility for DRL to discover strategies that move the RBC configuration between different topological configurations, yielding desirable heat-transfer characteristics. These results are useful for both gaining further understanding of the intrinsic properties of RBC, as well as for developing industrial applications.

References (86)

Citations (32)

View on Semantic Scholar

Summary

The paper introduces invariant MARL to overcome the curse of dimensionality in controlling two-dimensional Rayleigh–Bénard convection.
It segments the convection domain into pseudo-environments to localize control actions, effectively reducing the Nusselt number.
Comparative results show MARL outperforms single-agent methods, offering scalable strategies for complex fluid dynamics.

Effective Control of Two-Dimensional Rayleigh--Bénard Convection with Invariant Multi-Agent Reinforcement Learning

The paper focuses on applying deep reinforcement learning (DRL) to control two-dimensional Rayleigh--Bénard convection (RBC), a canonical problem in fluid dynamics. RBC is pivotal in several industrial and geoscientific contexts, characterized by complex convective heat transfer processes. This paper leverages invariant multi-agent reinforcement learning (MARL) to elucidate efficient control strategies for RBC, especially in wider channels with multiple convection cells, addressing challenges like the curse of dimensionality in control spaces.

MARL Approach and Main Findings

The research adopts the MARL paradigm to manage RBC inside domains with substantial aspect ratios and periodic boundary conditions. This approach contrasts with conventional single-agent reinforcement learning (SARL), which struggles with the dimensionality issue due to simultaneous control of multiple actuators.

MARL Efficacy: The MARL framework, through exploiting the inherent translational invariance of RBC, enables effective RBC control by segmenting the RBC domain into pseudo-environments. This segmentation allows localized control actions that collectively enhance RBC management without burdening the learning process with unnecessary complexity.
Control Strategy Discovery: The MARL approach discovers intricate control strategies that destabilize the spontaneous RBC cell patterns, prompting coalescence of adjacent cells. Once coalesced, these singular cells reach a stable configuration with reduced convective heat transfer. This reduction in the Nusselt number demonstrates enhanced control effectiveness, translating to industrial benefits where heat-transfer minimization is advantageous.
Comparison with SARL: The SARL struggles to achieve significant control policy learning within the same timeline, due to its inability to handle the expanded control action space efficiently. This underscores MARL's potential in complex fluid control scenarios, where multi-agent frameworks significantly accelerate learning and strategy optimization.

Practical and Theoretical Implications

The paper's findings have profound implications:

Industrial Applications: The ability to control and optimize heat-transfer properties of RBC has direct applications in improving thermal management systems. Industries reliant on thermal regulation can leverage the paper's outcomes to enhance energy efficiency.
Future Research Directions: The success of MARL in this simplified RBC model sets the stage for future exploration in more complex and three-dimensional configurations. This could eventually lead to more effective control mechanisms in turbulent flows, not just for RBC but across various fluid dynamic systems.
Theoretical Contributions: On a theoretical level, the paper validates MARL's potential to overcome the curse of dimensionality in fluid control problems. It exemplifies how localized control through quasi-independent pseudo-environments can be synthesized into a holistic control mechanism.

Conclusion

The paper demonstrates a substantive advancement in fluid-control methodologies using neural-network-based frameworks, specifically tailored for complex, high-dimensional systems like RBC. The paper highlights how leveraging system symmetries and invariants with multi-agent approaches like MARL can lead to efficient and scalable control solutions. This examination of RBC control via DRL and MARL not only enriches the fluid mechanics discipline but also sets a template for future studies aiming to harness artificial intelligence for dynamic system optimization. The research also signifies a step forward in bridging machine learning techniques with traditional fluid dynamics, reinforcing interdisciplinary synergies aimed at solving classical and contemporary engineering challenges.

PDF Markdown

YouTube

Show All Videos