2

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

In this paper we address the problem of learning and backtesting inventory control policies in the presence of general arrival dynamics – which we term as a quantity-over-time arrivals model (QOT). We also allow for order quantities to be …

Scaling Laws for Imitation Learning in NetHack

Imitation Learning (IL) is one of the most widely used methods in machine learning. Yet, while powerful, many works find it is often not able to fully recover the underlying expert behavior. However, none of these works deeply investigate the role of …

A few expert queries suffices for sample-efficient rl with resets and linear value approximation

Deep Inventory Management

This work provides a Deep Reinforcement Learning approach to solving a periodic review inventory control system with stochastic vendor lead times, lost sales, correlated demand, and price matching. While this dynamic program has historically been …

MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation

Multi-horizon probabilistic time series forecasting has wide applicability to real-world tasks such as demand forecasting. Recent work in neural time-series forecasting mainly focus on the use of Seq2Seq architectures. For example, MQTransformer - an …

A Framework for the Meta-Analysis of Randomized Experiments with Applications to Heavy-Tailed Response Data

A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance. In this paper, we propose a novel cross-validation-like …

2

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Scaling Laws for Imitation Learning in NetHack

A few expert queries suffices for sample-efficient rl with resets and linear value approximation

Deep Inventory Management

MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation

A Framework for the Meta-Analysis of Randomized Experiments with Applications to Heavy-Tailed Response Data

Mqtransformer: Multi-horizon forecasts with context dependent and feedback-aware attention

All roads lead to quantitative finance

Sample path generation for probabilistic demand forecasting

A multi-horizon quantile recurrent forecaster