We approach the task of network congestion control in datacenters using Reinforcement Learning (RL). Successful algorithms can dramatically improve latency and overall throughput. Until today, no such learning-based have shown practical potential this domain. Evidently, most popular recent deployments rely on rule-based heuristics that are tested a predetermined set benchmarks. Consequently, th...