Offline RL benchmark project featuring a custom Gym environment, dual observation modes, reward shaping, and real-time PyGame rendering
docker reinforcement-learning inference pytorch neural-networks pruning cql quantization gymnasium model-compression custom-environment offline-rl observation-space td3-bc rl-benchmark
-
Updated
Jul 4, 2025 - Python