This research gauges the ability of deep reinforcement learning (DRL) techniques to assist control conjugate heat transfer systems governed by coupled Navier–Stokes and equations. It uses a novel, “degenerate” version proximal policy optimization (PPO) algorithm, intended for situations where optimal be learnt neural network does not depend on state, as is notably case in open-loop problems. Th...