Changes in the stimulus‐preceding negativity and lateralized readiness potential during reinforcement learning

Xi Ren, Fernando Valle, Sergii Tukaiev, Steven A. Hackley

Published online on April 06, 2017

Abstract

According to reinforcement learning theory, dopamine‐dependent anticipatory processes play a critical role in learning from action outcomes such as feedback or reward. To better understand outcome anticipation, we examined variation in slow cortical potentials and assessed their changes over the course of motor‐skill acquisition. Healthy young adults learned a series of precisely timed, key press sequences. Feedback was delivered at a delay of either 2.5 or 8 s, to encourage use of either the striatally mediated, habit learning system or the hippocampus‐dependent, episodic memory system, respectively. During the 2.5‐s delay, the stimulus‐preceding negativity (SPN) was shown to decline in amplitude across trials, confirming previous results from a perceptual categorization task (Morís, Luque, & Rodríguez‐Fornells, 2013). This falsifies the hypothesis that SPN reflects specific outcome predictions, on the assumption that the ability to make such predictions should improve as a task is mastered. An SPN was also evident during the 8‐s delay, but it increased in amplitude across trials. At the conclusion of the 8‐s but not the 2.5‐s prefeedback interval, a reversed‐polarity lateralized readiness potential (LRP) was noted. It was suggested that this might indicate maintenance of an action representation for comparison with the feedback display. If so, this would constitute the first direct psychophysiological evidence for a popular hypothetical construct in quantitative models of reinforcement learning, the so‐called eligibility trace.