Implement some algorithms of RL
Pytorch version: 1.8.1+cudnn10.1
- gym
- numpy
- pytorch: 1.8.1+cudnn10.1
- tensorboard
- UCB
- LinUCB
- REINFORCE
- A2C(Advantage Actor-Critic)
- A3C
- DQN
- DoubleDQN
- DuelingDQN
- D3QN(DuelingDoubleDQN)
- DDPG
- PPO
- SAC
- SAC_Discrete
- Dyna-Q
- MBPO
- PETS
to be continue...
We use Cartpole-v1 as our test environment.
python train_cartpole.py -a A2C
We use Pendulum-v1 as our test environment.
not yet completed...