Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DQN Boltzmann mountain car benchmark #219

Merged
merged 1 commit into from
Oct 25, 2018
Merged

DQN Boltzmann mountain car benchmark #219

merged 1 commit into from
Oct 25, 2018

Conversation

kengz
Copy link
Owner

@kengz kengz commented Oct 25, 2018

Experiment Result

See PR #180 for an example

The data files to upload below are all created automatically in the data/ folder.

Github does not support .json and .csv upload, but .txt. Please rename those files .txt before uploading.

Abstract

DQN Boltzmann Mountain Car Benchmark

Methods

No state normalization since env positions are not invariant to translation.

To Reproduce

  1. JSON spec:
    dqn_boltzmann_mountain_car_spec.txt

  2. git SHA (contained in the file above): 67f44f06a9be5ca420578aefdfb1eb45a291450d

Results

All the results contributed will be added to the benchmark, and made publicly available on Dropbox.

Discussion (optional)

Describe some useful observations from the experiment.

Turns our state normalization is harmful in this environment, which makes sense since normalizing would lose information about the starting and end point in the environment, which are absolute. The problem is not invariant to translation and scaling.

@kengz kengz added the result experiment result upload label Oct 25, 2018
@kengz kengz merged commit 4500649 into master Oct 25, 2018
@kengz kengz deleted the dqn-car branch October 25, 2018 04:57
@kengz kengz mentioned this pull request Oct 25, 2018
6 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
result experiment result upload
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant