https://github.com/koulanurag/ma-gym

A collection of multi agent environments based on OpenAI gym.
https://github.com/koulanurag/ma-gym

collaborative environment gym multi-agent openai-gym reinforcement-learning

Last synced: 3 months ago
JSON representation

A collection of multi agent environments based on OpenAI gym.

Host: GitHub
URL: https://github.com/koulanurag/ma-gym
Owner: koulanurag
License: apache-2.0
Created: 2019-06-10T15:28:41.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2024-07-07T15:52:18.000Z (12 months ago)
Last Synced: 2025-03-02T01:55:36.510Z (4 months ago)
Topics: collaborative, environment, gym, multi-agent, openai-gym, reinforcement-learning
Language: Python
Homepage: https://github.com/koulanurag/ma-gym/wiki
Size: 3.87 MB
Stars: 589
Watchers: 7
Forks: 106
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-deep-rl - ma-gym - A collection of multi agent environments based on OpenAI gym. (Environments)
awesome-marl-engineering - ma-gym - marl](https://github.com/koulanurag/minimal-marl), set of baselines for vanilla MARL problems by Anurag Koul (Engineering / Building Multi-agent Environments)

README

        # ma-gym

It's a collection of multi agent environments based on OpenAI gym. Also, you can use [**minimal-marl**](https://github.com/koulanurag/minimal-marl) to warm-start training of agents.

![Python package](https://github.com/koulanurag/ma-gym/workflows/Python%20package/badge.svg)

![Upload Python Package](https://github.com/koulanurag/ma-gym/workflows/Upload%20Python%20Package/badge.svg)

![Python Version](https://img.shields.io/pypi/pyversions/ma-gym)

[![Downloads](https://static.pepy.tech/badge/ma-gym)](https://pepy.tech/project/ma-gym)

[![Wiki Docs](https://img.shields.io/badge/-Wiki%20Docs-informational?style=flat)](https://github.com/koulanurag/ma-gym/wiki)

[![Papers using ma-gym](https://img.shields.io/badge/-Papers%20using%20ma--gym-9cf?style=flat&logo=googlescholar)](https://scholar.google.com/scholar?oi=bibs&hl=en&cites=14123576959169220642,12284637994392993807)

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/koulanurag/ma-gym/blob/master/examples/colab_example.ipynb)

## Installation

- Setup (important):

   ```bash

      pip install 'pip<24.1'

      pip install 'setuptools<=66'

      pip install 'wheel<=0.38.4'

   ```

- Install package:

   - Using PyPI:

      ```bash

      pip install ma-gym

      ```

   - Directly from source (recommended):

      ```bash

      git clone https://github.com/koulanurag/ma-gym.git

      cd ma-gym

      pip install -e .

      ```

## Reference:

Please use this bibtex if you would like to cite it:

```

@misc{magym,

      author = {Koul, Anurag},

      title = {ma-gym: Collection of multi-agent environments based on OpenAI gym.},

      year = {2019},

      publisher = {GitHub},

      journal = {GitHub repository},

      howpublished = {\url{https://github.com/koulanurag/ma-gym}},

    }

```

## Usage:

```python

import gym

env = gym.make('ma_gym:Switch2-v0')

done_n = [False for _ in range(env.n_agents)]

ep_reward = 0

obs_n = env.reset()

while not all(done_n):

    env.render()

    obs_n, reward_n, done_n, info = env.step(env.action_space.sample())

    ep_reward += sum(reward_n)

env.close()

```

Please refer to [**Wiki**](https://github.com/koulanurag/ma-gym/wiki/Usage) for complete usage details

## Environments:

- [x] Checkers

- [x] Combat

- [x] PredatorPrey

- [x] Pong Duel  ```(two player pong game)```

- [x] Switch

- [x] Lumberjacks

- [x] TrafficJunction

```

Note : openai's environment can be accessed in multi agent form by prefix "ma_".Eg: ma_CartPole-v0

This returns an instance of CartPole-v0 in "multi agent wrapper" having a single agent. 

These environments are helpful during debugging.

```

Please refer to [Wiki](https://github.com/koulanurag/ma-gym/wiki/Environments) for more details.

## Zoo!

| __Checkers-v0__ | __Combat-v0__ | __Lumberjacks-v0__ |

|:---:|:---:|:---:|

|![Checkers-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/Checkers-v0.gif)|![Combat-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/Combat-v0.gif)|![Lumberjacks-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/Lumberjacks-v0.gif)|

| __PongDuel-v0__ | __PredatorPrey5x5-v0__ | __PredatorPrey7x7-v0__ |

| ![PongDuel-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/PongDuel-v0.gif) | ![PredatorPrey5x5-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/PredatorPrey5x5-v0.gif) | ![PredatorPrey7x7-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/PredatorPrey7x7-v0.gif) |

|                 __Switch2-v0__                 |                        __Switch4-v0__                         | __TrafficJunction4-v0__ |                                                             |

|  ![Switch2-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/Switch2-v0.gif)  |         ![Switch4-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/Switch4-v0.gif)|![TrafficJunction4-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/TrafficJunction4-v0.gif)|

| __TrafficJunction10-v0__ |

|![TrafficJunction10-v0.gif](https://raw.githubusercontent.com/koulanurag/ma-gym/master/static/gif/TrafficJunction10-v0.gif)|         |                                                              |

## Testing:

- Install: ```pip install -e ".[test]" ```

- Run: ```pytest```

## Acknowledgement:

- This project was initially developed to complement my research internship @ [SAS](https://www.sas.com/en_us/home.html) (Summer - 2019).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/koulanurag/ma-gym

Awesome Lists containing this project

README