Biased MCTS/Expert Iteration questions

fuerchter · 09-08-2022, 07:21 AM

(09-07-2022, 09:27 AM)DennisSoemers Wrote: It should actually resume training from where it left off if you use the same command line arguments (or similar ones) as before, specifically if the --out-dir is the same. It should see that files it needs to resume (like the experience buffer, and all the checkpoints of features and weights) are already there and use them.

I tried this yesterday and it does work. Checkpoints restart from no. 0 though, so it's not surprising I missed it (without looking at the code).

(09-07-2022, 09:27 AM)DennisSoemers Wrote: I normally evaluate the playing strength of my agents using the command line as well

I'm currently still using "Compare Agents" (have found a checkpoint with ~47% win vs MC-GRAVE so far, at --thinking-time 6), but I intend to switch this to CLI eventually.

Thanks again for the input. I'm hoping I can get a winning agent on my own from here on out (and perhaps this thread can be useful for anyone else looking to train an agent).