Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changed everything to use q-network and refactored #5

Merged
merged 1 commit into from
Mar 30, 2022

Conversation

fshcat
Copy link
Collaborator

@fshcat fshcat commented Mar 28, 2022

For the refactoring:

  • td_update calls were consolidated to train.py
  • run_training_game and run_game were split into different functions
  • added option to create a Board object from an array containing the desired initial position

Also fixed a bug with reward assignment.

td_update calls were consolidated to train.py, run_training_game and
run_game were split off, created option to make Board object with board
array
Copy link
Member

@PedroContipelli PedroContipelli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great!

@PedroContipelli PedroContipelli merged commit a0d1e9e into main Mar 30, 2022
@fshcat fshcat deleted the q-network branch April 22, 2022 00:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants