Reinforcement learning for high performance computing in heterogeneous networks