Multi-Agent Reinforcement Learning with Multi-Step Generative Models