Multi-Agent Proximal Policy Optimization for Cooperative Cooking

Implemented and trained a Multi-Agent Proximal Policy Optimization (MAPPO) approach for cooperative cooking tasks in the Overcooked simulation environment, using centralized training with decentralized execution — agents learn independently while sharing the same environment.

← back to terminal