A future improvement of Hierarchical Reinforcement Learning for Zero-shot generalization with Subtask Dependencies.

This work is still focus on ‘high’ level of multi-task decision making problem, which is decide which subtask should be execute at time step .

They use classification and regression tree to infer the precondition function, which can used to build the whole subtask tree. Once we have this subtask tree, we can use the policy training method mentioned before(previous work) to train a subtask decision policy.