Dhruv Madeka
Dhruv Madeka
Home
Publications
Posts
Talks
Contact
Philip Amortila
Latest
A few expert queries suffices for sample-efficient rl with resets and linear value approximation
Cite
×