Skip to content

Latest commit

 

History

History
19 lines (15 loc) · 888 Bytes

File metadata and controls

19 lines (15 loc) · 888 Bytes

Deep Q-Learning (DQN)

Deep Q-Learning uses Experience Replay to learn in small batches in order to avoid skewing the dataset distribution of different states, actions, rewards, and next_states that the neural network will see. Importantly, the agent doesn't need to train after each step.

Code

python sample_keras.py
python sample_pytorch.py

Usefull Resources: