Flat A3C agent

Hi,

I really like your work, and want to ask for some clarifications on your new observation on training a flat A3C agent without the meta-controller. In this case are the sub-goals randomly generated every 'c' timesteps? (instead of the meta-controller outputting the sub-goal)

Thanks,
Dai