Skip to main content
Image alt text
Image alt text
Image alt text
Image alt text
Image alt text
Image alt text
Image alt text

Deep RL and Optimization applied to Operations Research problem - 2/2 Reinforcement Learning approach

This article is part of a series of articles which will introduce several optimization techniques, from traditional (yet advanced) Mathematical Optimization solvers and associated packages to Deep Reinforcement Learning algorithms, while tackling a very famous Operations Research problem: the multi-knapsack problem. Here, the focus is on an approach based on two famous reinforcement learning algorithms: Q-Learning and Policy Gradient.