Fard, N. E. and Selmic, R. (2022) “Consensus of Multi-agent Reinforcement Learning Systems: The Effect of Immediate Rewards”, Journal of Robotics and Control (JRC), 3(2), pp. 115–127. doi: 10.18196/jrc.v3i2.13082.