League evaluation for partial observable environments by xluox · Pull Request #63 · Farama-Foundation/MicroRTS-Py

xluox · 2022-02-24T02:43:04Z

This pr includes changes that support multiple maps in league evaluation and evaluation during training. This change has some code dedicated to partially observable training.
Some changes may not fit in the current system since the recent PRs.
Need review to extract the useful portion to merge into master

Co-authored-by: Costa Huang <costa.huang@outlook.com>

vwxyzjn

Hey @xluox thanks for preparing the PR. I have left some comments on which components should be kept. To make it easier, I also suggest making a new branch off of the latest master and including these components.

vwxyzjn · 2022-02-25T14:29:09Z

experiments/new_league.py

        help='if toggled, the database will be updated')
    parser.add_argument('--cuda', type=lambda x: bool(strtobool(x)), default=True, nargs='?', const=True,
        help='if toggled, cuda will not be enabled by default')
+    parser.add_argument('--maps', nargs='+', default=["maps/16x16/basesWorkers16x16B.xml","maps/16x16/basesWorkers16x16C.xml","maps/16x16/basesWorkers16x16D.xml", "maps/16x16/basesWorkers16x16E.xml", "maps/16x16/basesWorkers16x16F.xml"], # [],


The map-related changes should be incorporated into the master.

vwxyzjn · 2022-02-25T14:29:18Z

experiments/new_league.py


 class Match:
-    def __init__(self, partial_obs: bool, match_up=None):
+    def __init__(self, partial_obs: bool, match_up=None, map_path="maps/16x16/basesWorkers16x16A.xml"):


The map-related changes should be incorporated into the master.

vwxyzjn · 2022-02-25T14:29:23Z

experiments/new_league.py

        built_in_ais2=None
        rl_ai=None
        rl_ai2=None
+        self.map_path = map_path


The map-related changes should be incorporated into the master.

vwxyzjn · 2022-02-25T14:29:28Z

experiments/new_league.py

                max_steps=max_steps,
                render_theme=2,
                ai2s=built_in_ais,
-                map_paths=["maps/16x16/basesWorkers16x16A.xml"],


The map-related changes should be incorporated into the master.

vwxyzjn · 2022-02-25T14:30:06Z

experiments/ppo_gridnet.py

                torch.save(agent.state_dict(), f"models/{experiment_name}/{global_step}.pt")
                wandb.save(f"models/{experiment_name}/agent.pt", base_path=f"models/{experiment_name}", policy="now")
-                subprocess.Popen(["python", "new_league.py", "--evals", f"models/{experiment_name}/{global_step}.pt", "--update-db", "false"])
+                subprocess.Popen(["python", "new_league.py", "--evals", f"models/{experiment_name}/{global_step}.pt", "--update-db", "false", "--partial-obs", str(args.partial_obs)])


This should be needed

vwxyzjn · 2022-02-25T14:32:18Z

experiments/ppo_gridnet.py

-    trueskill_df = pd.read_csv("league.csv")
-    trueskill_step_df = pd.read_csv("league.csv")
+    trueskill_df = pd.read_csv("po_league.csv")
+    trueskill_step_df = pd.read_csv("po_league.csv")


We shouldn't have to worry about this anymore because the new script contains an output path for to CSVs: https://github.com/vwxyzjn/gym-microrts/blob/3d7a42f46efbd39a0b806388b8a445fbee48d00f/experiments/ppo_gridnet.py#L240.

xluox and others added 2 commits December 16, 2021 20:52

Support multiple maps in league evaluation

f3549d1

Co-authored-by: Costa Huang <costa.huang@outlook.com>

Support evaluation in PO training

4332070

xluox requested a review from vwxyzjn February 24, 2022 02:43

vwxyzjn reviewed Feb 25, 2022

View reviewed changes

vwxyzjn changed the title ~~Po league evaluation~~ League evaluation for partial observable environments Feb 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

League evaluation for partial observable environments#63

League evaluation for partial observable environments#63
xluox wants to merge 2 commits intomasterfrom
PO_League_Evaluation

xluox commented Feb 24, 2022

Uh oh!

vwxyzjn left a comment

Uh oh!

vwxyzjn Feb 25, 2022

Uh oh!

vwxyzjn Feb 25, 2022

Uh oh!

vwxyzjn Feb 25, 2022

Uh oh!

vwxyzjn Feb 25, 2022

Uh oh!

vwxyzjn Feb 25, 2022

Uh oh!

vwxyzjn Feb 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

xluox commented Feb 24, 2022

Uh oh!

vwxyzjn left a comment

Choose a reason for hiding this comment

Uh oh!

vwxyzjn Feb 25, 2022

Choose a reason for hiding this comment

Uh oh!

vwxyzjn Feb 25, 2022

Choose a reason for hiding this comment

Uh oh!

vwxyzjn Feb 25, 2022

Choose a reason for hiding this comment

Uh oh!

vwxyzjn Feb 25, 2022

Choose a reason for hiding this comment

Uh oh!

vwxyzjn Feb 25, 2022

Choose a reason for hiding this comment

Uh oh!

vwxyzjn Feb 25, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants