A deep reinforcement learning approach for the meal delivery problem

作者:

Highlights:

• Our MDP model for the courier assignment task characterizes on-demand meal delivery service.

• We tailor deep reinforcement learning algorithms to address the problem in a dynamic environment.

• We incorporate the notion of order rejection to reduce the number of late orders.

• We investigate the importance of intelligent repositioning of the couriers during their idle times.

摘要

•Our MDP model for the courier assignment task characterizes on-demand meal delivery service.•We tailor deep reinforcement learning algorithms to address the problem in a dynamic environment.•We incorporate the notion of order rejection to reduce the number of late orders.•We investigate the importance of intelligent repositioning of the couriers during their idle times.

论文关键词:Meal delivery,Courier assignment,Reinforcement learning,DQN,DDQN

论文评审过程:Received 1 April 2021, Revised 17 February 2022, Accepted 19 February 2022, Available online 25 February 2022, Version of Record 9 March 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.108489