FF's Roam Notes

❯

Learning from Trajectories via Subgoal Discovery

Learning from Trajectories via Subgoal Discovery

Jun 05, 20251 min read

imitation

Train a subgoal policy $π (s_{g} ∣ s_{t})$ by using imitation learning. That is, by collecting a bunch of expert data.

Graph View

Created with Quartz v4.5.1 © 2025

Portfolio