We train neural models to represent both the optimal policy (i.e. thrust direction) and value function time of flight) for a optimal, constant acceleration low-thrust rendezvous. In cases we develop make use data augmentation technique call backward generation examples. are thus able produce work with large dataset fully exploit benefit employing deep learning framework. achieve, in all cases, ...