Playground

Find complete thesis here.

Playground

First time? check out our website for more information, our Discord to join the community, or read the documentation to get started.

Playground hosts Pommerman, a clone of Bomberman built for AI research. People from around the world submit agents that they've trained to play. We run regular competitions on our servers and report the results and replays.

There are three variants for which you can enter your agents to compete:

FFA: Free For All where four agents enter and one leaves. It tests planning, tactics, and cunning. The board is fully observable.
Team (The NIPS '18 Competition environment): 2v2 where two teams of agents enter and one team wins. It tests planning, and tactics, and cooperation. The board is partially observable.
Team Radio: Like team in that a it's a 2v2 game. Differences are that the agents each have a radio that they can use to convey 2 words from a dictionary of size 8 each step.

Why should I participate?

You are a machine learning researcher and similarly recognize the lack of approachable benchmarks for this subfield. Help us rectify this and prove that your algorithm is better than others.
You want to contribute to multi agent or communication research. This is first and foremost a platform for doing research and everything that we do here will eventually get published with generous (or primary) support from us.
You really like(d) Bomberman and are fascinated by AI. This is a great opportunity to learn how to build intelligent agents.
You want the glory of winning an AI competition. We are going to publicize the results widely.
You think AI is dumb and can make a deterministic system that beats any learned agent.

How do I train agents?

Most open-source research tools in this domain have been designed with single agents in mind. We will be developing resources towards standardizing multi-agent learning. In the meantime, we have provided an example training script in train_with_tensorforce.py. It demonstrates how to wrap the Pommerman environments such that they can be trained with popular libraries like TensorForce.

How do I submit agents that I have trained?

The setup for submitting agents will be live shortly. It involves making a Docker container that runs your agent. We then read and upload your docker file via Github Deploy Keys. You retain the ownership and license of the agents. We will only look at your code to ensure that it is safe to run, doesn't execute anything malicious, and does not cheat. We are just going to run your agent in competitions on our servers. We have an example agent that already works and further instructions are in the games/a/docker directory.

Who is running this?

Cinjon Resnick, Denny Britz, David Ha, Jakob Foerster, and Wes Eldridge are the folks behind this. We are generously supported by a host of other people, including Kyunghyun Cho, Joan Bruna, Julian Togelius and Jason Weston. You can find us in the Discord.

Pommerman is immensely appreciate of the generous assistance it has received from Jane Street Capital, NVidia, Facebook AI Research, and Google Cloud.

How can I help?

To see the ways you can get invovled with the project head over to our Contributing Guide and checkout our current issues.

Contributing

We welcome contributions through pull request. See CONTRIBUTING for more details.

Code of Conduct

We strive for an open community. Please read over our CODE OF CONDUCT

Citation

If you use the Pommerman environment in your research, please cite us using the bibtex file in docs.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
battle_csv		battle_csv
csv		csv
docs		docs
evaluation		evaluation
examples		examples
json_games/Comm-Agent_Random-Agent		json_games/Comm-Agent_Random-Agent
manager		manager
models		models
notebooks		notebooks
outs		outs
pommerman		pommerman
scripts		scripts
thesis_text		thesis_text
.gitignore		.gitignore
.travis.yml		.travis.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
clean_csv_files.py		clean_csv_files.py
conf.py		conf.py
env.yml		env.yml
exec_training.sh		exec_training.sh
exec_training_james.sh		exec_training_james.sh
grep.sh		grep.sh
grep_per_parameter.sh		grep_per_parameter.sh
mkdocs.yml		mkdocs.yml
pylintrc		pylintrc
requirements.txt		requirements.txt
requirements_extra.txt		requirements_extra.txt
run_remote_training_random.sh		run_remote_training_random.sh
run_remote_training_random_without_bomb-25.sh		run_remote_training_random_without_bomb-25.sh
run_remote_training_random_without_bomb-50.sh		run_remote_training_random_without_bomb-50.sh
run_remote_training_random_without_bomb-75.sh		run_remote_training_random_without_bomb-75.sh
run_remote_training_selfplay.sh		run_remote_training_selfplay.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Playground

Why should I participate?

How do I train agents?

How do I submit agents that I have trained?

Who is running this?

How can I help?

Contributing

Code of Conduct

Citation

About

Releases

Packages

Contributors 2

Languages

License

patriciamdr/ba-code

Folders and files

Latest commit

History

Repository files navigation

Playground

Why should I participate?

How do I train agents?

How do I submit agents that I have trained?

Who is running this?

How can I help?

Contributing

Code of Conduct

Citation

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages