We consider an extension to the restless multi-armed bandit (RMAB) problem with unknown arm dynamics, where exogenous global Markov process governs rewards distribution of each arm. Under state, evolves according Markovian rule, which is non-identical among different arms. At time, a player chooses out $N$ arms play, and receives random reward from finite set states. The are restless, that is, ...