
DDPG: Deep Deterministic Policy Gradient (TensorFlow 2.x)
July 2025
Completed
A robust, modular, and extensible reimplementation of the DDPG reinforcement learning algorithm (Lillicrap et al., 2015) using TensorFlow 2.x. Built for reproducibility, research comparison, and interactive experimentation with Gym environments.
- Modular architecture: separate agent, networks, noise, buffer, wrappers
- Hyperparameter sweeps, YAML config system, and TensorBoard logging
- Includes full unit test suite and reproducibility controls (seed, version logging)
- Supports Gym ≥ 0.26, Apple Silicon acceleration (tensorflow-macos, -metal)