Name		Name	Last commit message	Last commit date
parent directory ..
MCTS.py		MCTS.py
README.md		README.md
config.yml		config.yml
jax_mcts.py		jax_mcts.py
jax_mcts.yml		jax_mcts.yml
jax_mcts_separate_env.py		jax_mcts_separate_env.py
mcts_config.yml		mcts_config.yml
train_sb3.py		train_sb3.py

README.md

/train

First of all, this is NOT a submodule. This is a separate area for python scripts related to training. We want to keep a record of some important training scripts but they are not decent enough to be merged into the mainn library. This folder exists only for documenting purposes. Please proceed with caution.

.jax_mcts.py

The best method so far is to use JAX to implement MCTS+neural network like AlphaZero for our game, because we have the luxury of encoding the game into our policy. The general training script is this one (in practice, one can tweak it all you like).

.train_sb3.py

The most straightforward and perhaps lazy way to solve an RL problem is to plug into a general-purpose RL trainer and let it find a policy for the players. It wraps the environment into OpenAI Gym, and there are some amazing implementations that connects to these (stable-baselines3, etc).

This script uses our gym wrapper gym_env to train using stable_baselines3.

.jax_mcts.py

The JAX training script. It runs and is our best effort so far (with the right hyper-parameters of course).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train

train

README.md

/train

.jax_mcts.py

.train_sb3.py

.jax_mcts.py

Files

train

Directory actions

More options

Directory actions

More options

Latest commit

History

train

Folders and files

parent directory

README.md

/train

.jax_mcts.py

.train_sb3.py

.jax_mcts.py