Work Plan

Running reinforcement learning experiments on medium-sized environments such as ALE is known to be time-consuming because of the high sample complexity, high experimental variance and difficulty of parallelization. Therefore the plan accommodates for the development and testing of efficient, distributed code for experimentation.
First semester:
Second semester: