Dhruv Madeka
Dhruv Madeka
Home
Publications
Posts
Talks
Contact
Dhruv Madeka
Latest
Contextual Bandits for Evaluating and Improving Inventory Control Policies
Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Scaling Laws for Imitation Learning in NetHack
Linear Reinforcement Learning with Ball Structure Action Space
A few expert queries suffices for sample-efficient rl with resets and linear value approximation
Deep Inventory Management
MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
A Framework for the Meta-Analysis of Randomized Experiments with Applications to Heavy-Tailed Response Data
Mqtransformer: Multi-horizon forecasts with context dependent and feedback-aware attention
All roads lead to quantitative finance
Sample path generation for probabilistic demand forecasting
A multi-horizon quantile recurrent forecaster
Accurate prediction of electoral outcomes
Scatteract: Automated extraction of data from scatter plots
Estimating Covariance Matrices for Investments Whose Histories Differ in Length
Cite
×