Reinforcement_learning_with_pytorch

Implement some algorithms of RL

Pytorch version: 1.8.1+cudnn10.1

Requierment

gym
numpy
pytorch: 1.8.1+cudnn10.1
tensorboard

Implemented algorithms

Bandit algorithms

UCB
LinUCB

Model-free algorithms

Model-based algorithms

Dyna-Q
MBPO
PETS

Causal RL algorithms

to be continue...

How to run

Discrete action environment

We use Cartpole-v1 as our test environment.

python train_cartpole.py -a A2C

Continuous action environment

We use Pendulum-v1 as our test environment.

not yet completed...

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Bandit		Bandit
model_based		model_based
model_free		model_free
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
train_cartpole.py		train_cartpole.py
train_pendulum.py		train_pendulum.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement_learning_with_pytorch

Requierment

Implemented algorithms

Bandit algorithms

Model-free algorithms

Model-based algorithms

Causal RL algorithms

How to run

Discrete action environment

Continuous action environment

Reference

About

Releases

Packages

Contributors 2

Languages

License

sherlockHSY/Reinforcement_learning_with_pytorch

Folders and files

Latest commit

History

Repository files navigation

Reinforcement_learning_with_pytorch

Requierment

Implemented algorithms

Bandit algorithms

Model-free algorithms

Model-based algorithms

Causal RL algorithms

How to run

Discrete action environment

Continuous action environment

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages