Please enable JavaScript.
Coggle requires JavaScript to display documents.
Task 6: (Taylor et al. (Reinforcement learning theory (Errors = negative,…
Task 6:
Taylor et al.
ERN - happend 50-100ms after an error. the magnitude responds to subjective importance of good performance
dACC is the source of ERN, processes errors, as part of general conflict/interference detection system
-
medial frontal feedback activity - 250ms after feedback, sensitive to the relative value of the outcome of task preformance
Error positivity - follows ERN 200-300ms after, originating in parietal cortex or anterior-rostral MFC/ACC
-
-
Conflict theory
cognitive conflict (interference) = when a strong response tendency competes with the intended response (e.g.: Stroop effect)
dACC monitors for the presence of cognitive conflict + passes a signal to the lateral PFC to increase cognitive control when conflict is high
errors generate high conflict when the stronger, but undesired, response tendency reaches a sufficient threshold to command an actual response
dACC activity:
-
predicted dlPFC activity in a subsequent trial + adjustments following an error (post- error slowing)
-
criticism: Several lines of research have challenged the contention that error signals reflect the same processes and neural systems as response conflict
-
-
Ullsperger et al.
rostral cingulate motor area (rCMA): error detection, generation of an error-specific event-related brain potential (ERP)and event-related negativity (ERN)
ERN results from disinhibited neuronal activity in the rCMA attributable to phasic depression of the dopaminergic activity on errors
self-detection of error + external negative feedback predicts the nonoccurrence of reward - should result in decreased dopamine release
habenula: modulatory between the limbic forebrain structures and the midbrain. VTA and SN recieve inhibiting neurons from this structure, located in dorsal medial thalamus. Recieves fibers from basal forebrain, medial striatum and anterior hypothalamus.
STUDY:
-
-
RESULTS:
EXP 2: (when introducing noninformative stimuli) in contrasting errors with informative (negative) feedback vs. correct trials with informative (positive) feedback, the results from experiment 1 were replicated
EXP1:
-
negative feedback - rCMA, inferior anterior insula + epitalamus (habenular complex)
rCMA reacted only to errors with negative feedback but not to errors without feedback (ruled out an influence of response conflict / uncertainty on its role in error detection by external signals)
DISCUSSION:
-
-
model:
decreased dopamine release during negative feedback errors can result in higher activity in the rCMAit is not rCMA that drives habenula, but the other way around!! We've been assuming that medial error detection drives habenula to inhibit midbrain DA neurons
for correct trials, reward expectancy was slightly higher than during errors because of higher certainty
difference in the habenular activity between informative and noninformative activity correlates with the error in reward prediction
-
rCMA – errors
rCMA is the generator not only of the ERN but also of the feedback-related negativity
CMA neurons respond only when reduced reward leads to a change in behavior
-
Rushworth et al.
ACC = action-outcome learning and representations (relating actions to their consequences, positive reinforcement outcomes and errors, guiding decisions about which actions are worth making)
pre-SMA: medial superior frontal gyrus (SFG) = action selection (task switching, motor sequencing, monitoring of response conflict before error commission)
- task switching - whenever action sets re initated/changed: activity when subjects are instructed to switch between two different sets of rules for f.e. selecting finger press responses to visual shapes
- movement sequences - when each action set is a set of response-response selection rules (sequencing paradigm)
- task switching - select superordinate sets of action-selection rules (action sets), necesseary when the set of action sets are changed/ first selected. anticipatory preparation and selection of a task set
-
STUDIES
STUDY1:
-
METHOD: go-no go in response to picture cues, where the associations of pictures with instructions were sometimes reversed. Also changing the rewards
STUDY 2:
METHOD: - taught monkeys to select one of two joystic movements after free delivery of one of two rewards correct movements were rewarded with a second similar reward
RESULTS: - ACC lesions disrupted performance on the reward guided action selection task without affecting visual stimulus selection
-
a) ACC neuron that encodes both a specific motor intention (No-go) + reward expectancy increases in activity when a visual cue is presented, although activity time is locked ,to the cues it is not specific to either cue. - But encodes the expectation of reward delivery on a No-go trial as opposed to the association of a reward on a Go trial or Go or No-go trials in which no reward is expected
STUDY 3
Method: - Rats performed a T-maze task in which the choice of one arm was always followed by a small reward whereas choosing the other arm resulted in a large reward large reward could only be obtained if the rats selected a more effortful action
RESULTS: Normal rats went for high reward, where ACC lesioned rats rarely did.
impaired ability to integrate both the expected costs and benefits of an action
rather than a simple insensitivity to reward differentials, failure to remember the size or positions of rewards, or difficulty climbing the barrier
Camille et al.
OFC - selecting preferred stimuli (objects, options, cues, envirnmental states) and stimulus value learning (linking stimuli to subjective relative values)
dACC - action-value learning (encode info that supports selection of action based on value) - impairment in selecting appropriate actions to obtain a reward
STUDY
Methods: patients with lesions in OFC and dACC were compared with control subjects on 1. stimulus-value task (choosing two decks of cards - win or loss of 50e) 2, action value task where participants had to choose between 2 movements
RESULTS
OFC damage = disrupted ability to sustain the correct choice of stimulus after positive feedback (not for action-value task) - more likely to shift the choice after win in stimulus-value task. no effect for action-value tasks
dACC damage= more likely to shift to the other option after a previously winning choice (win–shift) than were healthy controls in the action-value task
were no more likely to shift after a win in the stimulus-value task than were healthy subjects
-
-