Fard, N. E., & Selmic, R. (2022). Consensus of Multi-agent Reinforcement Learning Systems: The Effect of Immediate Rewards. Journal of Robotics and Control (JRC), 3(2), 115–127. https://doi.org/10.18196/jrc.v3i2.13082