In this paper, the researchers study the operations of an imaginary coffee shop with a focus on the barista’s actions. They also show how the sequence of actions affects the overall performance of the coffee shop by using reinforcement learning and simulation as its policy training environment. This model acts as a guiding example that shows the ease of applying RL in AnyLogic models using the Pathmind Library.