Fard, N. E., and R. Selmic. “Consensus of Multi-Agent Reinforcement Learning Systems: The Effect of Immediate Rewards”. Journal of Robotics and Control (JRC), vol. 3, no. 2, Feb. 2022, pp. 115-27, doi:10.18196/jrc.v3i2.13082.