Dhruv Madeka
Dhruv Madeka
Home
Publications
Posts
Talks
Contact
Randy Jia
Latest
Contextual Bandits for Evaluating and Improving Inventory Control Policies
Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Linear Reinforcement Learning with Ball Structure Action Space
Cite
×