-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] DreamerV3 on PyTorch. #45463
Open
sven1977
wants to merge
58
commits into
ray-project:master
Choose a base branch
from
sven1977:dreamer_v3_torch
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
+3,914
−106
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…mer_v3_torch Signed-off-by: sven1977 <[email protected]> # Conflicts: # rllib/BUILD # rllib/algorithms/algorithm.py # rllib/algorithms/dreamerv3/dreamerv3_rl_module.py # rllib/algorithms/dreamerv3/tests/test_dreamerv3.py # rllib/algorithms/dreamerv3/tf/dreamerv3_tf_rl_module.py # rllib/algorithms/dreamerv3/utils/env_runner.py # rllib/core/rl_module/torch/torch_rl_module.py # rllib/utils/tests/test_torch_utils.py
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
sven1977
requested review from
avnishn,
ArturNiederfahrenhorst,
maxpumperla,
kouroshHakha and
simonsays1980
as code owners
May 21, 2024 12:16
Signed-off-by: sven1977 <[email protected]>
…was responsible for breaking a test Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
… and Atari (Pong) on a V100 machine. 👍 Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…model grads Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…layer's (32x32 z state) distribution. Signed-off-by: sven1977 <[email protected]>
…or states and w/ videos) to the sampled ones. Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…'s almost-glorot-normal initializer. Signed-off-by: sven1977 <[email protected]>
…dB; could be that WandB doesn't like lists of lists). Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…d between calls to sample() Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…mer_v3_torch Signed-off-by: sven1977 <[email protected]> # Conflicts: # rllib/utils/metrics/metrics_logger.py
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…the Stats.values list is the problem ... to be checked Signed-off-by: sven1977 <[email protected]>
…mer_v3_torch Signed-off-by: sven1977 <[email protected]> # Conflicts: # rllib/utils/metrics/stats.py
Signed-off-by: sven1977 <[email protected]>
…_on_new_api_stack_w_env_runner_and_connectorv2 Signed-off-by: sven1977 <[email protected]> # Conflicts: # rllib/env/env_runner_group.py
Hi all, what is the current status of this PR? |
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
stale
bot
added
the
stale
The issue is stale. It will be closed within 7 days unless there are further conversation
label
Jan 31, 2025
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
rllib
RLlib related issues
rllib-newstack
stale
The issue is stale. It will be closed within 7 days unless there are further conversation
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
DreamerV3 on PyTorch.
Why are these changes needed?
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.