Changed everything to use q-network and refactored #5

fshcat · 2022-03-28T22:43:21Z

For the refactoring:

td_update calls were consolidated to train.py
run_training_game and run_game were split into different functions
added option to create a Board object from an array containing the desired initial position

Also fixed a bug with reward assignment.

td_update calls were consolidated to train.py, run_training_game and run_game were split off, created option to make Board object with board array

PedroContipelli

Looks great!

Changed everything to use q-network and refactored

a47c42c

td_update calls were consolidated to train.py, run_training_game and run_game were split off, created option to make Board object with board array

fshcat requested review from VicenteVivan, PedroContipelli and ethanpartidas March 28, 2022 22:43

PedroContipelli approved these changes Mar 30, 2022

View reviewed changes

PedroContipelli merged commit a0d1e9e into main Mar 30, 2022

fshcat deleted the q-network branch April 22, 2022 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed everything to use q-network and refactored #5

Changed everything to use q-network and refactored #5

fshcat commented Mar 28, 2022

PedroContipelli left a comment

Changed everything to use q-network and refactored #5

Changed everything to use q-network and refactored #5

Conversation

fshcat commented Mar 28, 2022

PedroContipelli left a comment

Choose a reason for hiding this comment