FF's Notes

Graph Home

← Home

#off_policy

3 notes

Transitive RL: Value Learning via Divide and Conquer
off_policy rl
Notes on Deep rl at scale: sorting waste in office building with a fleet of mobile manipulators
off_policy rl
Off Policy Actor Critic
rl off_policy