Multi-Agent Proximal Policy Optimization for Cooperative Cooking
Implemented and trained a Multi-Agent Proximal Policy Optimization (MAPPO) approach for cooperative cooking tasks in the Overcooked simulation environment, using centralized training with decentralized execution — agents learn independently while sharing the same environment.
← back to terminal