This lecture explores the use of spatial and pattern memory in reinforcement learning agents through maze navigation tasks. The approach involves designing mazes of increasing complexity to test spatial orientation and path choosing abilities. The RL agent is based on a Dueling Network architecture with Double Q-learning for state updates. Results show that increasing visual landmarks improve agent performance in spatial orientation, while path choosing remains inconsistent. Limitations include the lack of environment map building and the need for better reward functions. Possible extensions involve refining starting points and orientations.
This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.
Watch on Mediaspace