Discretized RL DQN ... Description Action Space Observation Space Reward Structure Our Implementation & Results