state-reps-planning

This rep is a playground for me to explore ideas about knowledge representation in the context of developing an AI. Most of these ideas are obtained/inspired by Rich Sutton's reinforcement learning class and text.

Incomplete list of ideas explored:

General Value Functions
- Using GVFs as features
- Learning a GVF that estimates time till episode end in cart pole
Generating features that are tested and thrown away according to how useful they are in achieving reward
Adaptive step-sizes per feature that are updated through some gradient descent rule
- Go one step forward and run descent on estimated TD error for next step (does not really work)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
algs.py		algs.py
includes.py		includes.py
main.py		main.py
training_algs.py		training_algs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

state-reps-planning

About

Releases

Packages

Languages

SwordShieldMouse/state-reps-planning

Folders and files

Latest commit

History

Repository files navigation

state-reps-planning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages