Skip to content

k-rishabh/donkey-car-RL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Comparative Analysis of Deep Reinforcement Learning Algorithms for Autonomous Driving in Simulation

Screenshot

This project aims to implement and compare the performance of three deep reinforcement learning (DRL) algorithms—Proximal Policy Optimization (PPO), Deep Deterministic Policy Gradient (DDPG), and Twin Delayed Deep Deterministic Policy Gradient (TD3)—for training an autonomous agent to complete laps in the Donkey Car simulator. The goal is to assess which algorithm is most effective for continuous control in autonomous driving environments.

  • Donkey Car simulator for realistic self-driving experiments
  • PPO, DDPG, and TD3 algorithms for autonomous driving tasks
  • Actor-critic architecture, experience replay, and target networks for stable training

Demo Video

Watch the video

Reinforcement Learning on Donkey Car Simulator

Setup

  1. Clone our repository into a folder.
  2. Follow the setup instructions provided here (https://github.com/tawnkramer/gym-donkeycar/releases) and extract the simulator into our repository.
  3. Clone the gym-environment (https://github.com/tawnkramer/gym-donkeycar/) into our repository.
  4. Install the dependencies by
pip install -r .\requirements.txt
  1. Follow the setup instruction from (https://docs.donkeycar.com/guide/install_software) to integrate code for Donkey Car simulator and environment setup.

How to Run the Code

Navigate to the RL_algorithms directory and to run any of the alogithms inside the directory using

python "filename" --sim "SIM_PATH" --test --gui

Default configuration is training mode, to run the file on test mode add argument --test and visualize it using GUI add argument --gui. This will launch the exe file and start training and testing (does not work if exe has not been opened before).

Algorithm Implementation

The implementation includes three DRL algorithms:

  • Proximal Policy Optimization (PPO) for on-policy training with CNNs for visual input.
  • Deep Deterministic Policy Gradient (DDPG) using an actor-critic architecture for continuous action spaces.
  • Twin Delayed DDPG (TD3) for addressing overestimation biases and improving training stability.

Ensure you have the necessary configurations and environment variables set for running the simulation.

Modules

Directory Description
Results Results of our trained model
rl_algorithms/ppo PPO implementation for autonomous driving in Donkey Car simulator
rl_algorithms/ddpg DDPG implementation for autonomous driving in Donkey Car simulator
rl_algorithms/td3 TD3 implementation for autonomous driving in Donkey Car simulator
archive Old implementations of the algorithm

Contributors

  • Rishabh Kumar
  • Rohit Kothawale
  • Rucha Pendharkar
  • Santrupti Patil
  • Yash Phalle

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages