Final colloquium Emma Boelen

30 augustus 2024 08:30 t/m 09:30 - Locatie: IDE-studio 23/24, 32.A-0-950 - Door: DCSC | Zet in mijn agenda

Optimizing a Robot Fleet Scheduling Model and Floorplan using Max-Plus Linear Algebra and Deep Q-Learning

Supervisor: Dr. Ton van den Boom

Abstract: Automation of machines is becoming increasingly widespread and advanced, of which an

example is the use robots for Prime Vision, which sorts parcels for postal services. The

coordination of scheduling a fleet of robots picking up and dropping off many parcels while

avoiding collisions, within a limited space, following predefined routes in a floorplan, is a

complex problem.

This logistical challenge can be effectively modelled using max-plus linear algebra to allow

an optimization for the route scheduling as was previously done by L. Smeets. The goal of this

research is to improve the existing scheduling model and use this to develop a reinforcement

learning-based algorithm that determines the optimal floorplan for the parcel delivery robots.

Two methods are applied to improve the existing scheduling model. Firstly, nodes where no

decisions are made are identified and removed. Secondly, certain constraints are also removed

to simplify the model.

The results of the scheduler are used to determine a key performance indicator to allow

a reinforcement learning based algorithm to identify the optimal floorplan for the robots.

The reinforcement learning algorithm employed a deep Q-learning approach, with the neural

network trained using various action space approaches, tuned rewards and hyper-parameters.

The greedy-epsilon method was applied to address the exploration vs. exploitation problem.

While the scheduler improvements significantly enhanced its computational costs, the neural

network did not converge, and the potential causes are thoroughly discussed.