Optimal stopping is the problem of deciding when to stop a stochastic system obtain greatest reward, arising in numerous application areas such as finance, healthcare, and marketing. State-of-the-art methods for high-dimensional optimal involve approximating value function or continuation then using that approximation within greedy policy. Although policies can perform very well, they are gener...