The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of model-based, iterative learning strategies we propose an alternative definition and construction temporal difference error arising in Policy Iteration strategies. such architectures error computed via evolution Hamiltonian function (or, possibly, its integral) along trajectories closed-loop s...