• Non ci sono risultati.

The effect of counterfactual information on outcome value signal encoding: Evidence for fully-adaptive coding along the rostrocaudal axis of the medial prefrontal cortex

N/A
N/A
Protected

Academic year: 2021

Condividi "The effect of counterfactual information on outcome value signal encoding: Evidence for fully-adaptive coding along the rostrocaudal axis of the medial prefrontal cortex"

Copied!
2
0
0

Testo completo

(1)

Evidence for past and present subjecGve value signals in the human orbitofrontal cortex
 Wan-Yu Shih¹, Chien-Chen Chou², Jeng-Ren Duann³, Cheng-Chia Lee², Shih-Chieh Lin¹, Hsiang-Yu Yu², Paul Glimcher⁴, Shih-Wei Wu¹
 ¹NaQonal Yang-Ming University, ²Taipei Veterans General Hospital, ³NaQonal Central University, ⁴New York University
 There is accumula0ng evidence sugges0ng that the orbitofrontal cortex (OFC) represents the subjec0ve value (SV) of op0ons essen0al for value-based decision-making. A growing body of research further indicates that value representa0ons in OFC are sensi0ve to both past and present rewards and that OFC encodes rela0ve SV that depends on temporal context of experience. However, these findings are predominantly based on single-unit electrophysiology in non-human primates and have not been widely reported in humans. In this study, we inves0gated value representa0ons in the human OFC using intracranial electrophysiology (stereo-electroencephalography, SEEG). Four subjects implanted with mul0-contact depth electrodes performed a Becker-DeGroot-Marschack (BDM) auc0on task on snack food items to elicit and es0mate the subjec0ve value (SV) of rewards. Two 2500-ms 0me windows were extracted from the SEEG data: one 0me-locked to the s0mulus onset, the other 0me-locked to the response onset. The high gamma power (80-150 Hz) - previously shown to posi0vely correlate with the firing rate of single neurons - was extracted. For each SEEG contact on each subject, we regressed 50-ms 0me-binned high-gamma power against the SV of the present reward and of the reward in the previous trial. A contact was considered value-related if the regression coefficient is significant (p < 0.05) for three consecu0ve 50-ms 0me bins. All four subjects showed value-related representa0ons in the OFC: three subjects showed significant representa0on of current SV and three showed significant representa0on of previous SV. Two out of four subjects showed both current and previous SV representa0ons in the same 0me windows (s0mulus-locked and response-locked). Interes0ngly, both subjects had contacts that showed opposite signs of correla0on with previous and current SV, sugges0ng subtrac0on-based rela0ve SV computa0ons. In summary, using human SEEG, we found evidence that high-gamma ac0vity in the human OFC encodes current as well as previous SV for food rewards. Furthermore, our results provide support to the hypothesis that OFC computes rela0ve value that is sensi0ve to the temporal context of experience.

Session III Learning and Memory

The effect of counterfactual informaGon on outcome value signal encoding: Evidence for fully-adapGve coding along the rostrocaudal axis of the medial prefrontal cortex
 Doris Pischedda¹, Stefano Palminteri², Giorgio Coricelli³
 ¹University of Trento, ²InsQtut NaQonal de la Santé et de la Recherche Médicale, ³University of Southern California
 OBJECTIVE When we make a decision, usually we experience its consequences but we have no informa0on about poten0al outcomes of alterna0ve decisions we could have made. Yet, this addi0onal informa0on is useful to improve learning (e.g., Palminteri et al., 2015). In this study, we analyzed brain ac0vity related to outcome processing in loss and gain contexts to explore the neural coding of value for both factual and counterfactual outcomes, when either par0al or complete feedback about the outcomes is provided. The aim of the experiment is two-fold: 1) to inves0gate where factual and counterfactual informa0on is represented and 2) to unravel how outcome value is encoded in different feedback condi0ons. We hypothesize that counterfactual informa0on will produce rescaling of value 6

(2)

signal depending on the context, such that the value of a neutral outcome becomes posi0ve in a loss context (as absence of punishment) and nega0ve in a gain context (as absence of reward), thus reflec0ng fully-adap0ve coding of value signals. METHODS Twenty-eight par0cipants performed a probabilis0c instrumental learning task while undergoing func0onal magne0c resonance imaging scanning. On each trial, they had to choose between two symbols probabilis0cally associated with a certain reward (or punishment). Then, the outcome of the decision was revealed. Importantly, on half of the trials par0cipants received feedback only about the outcome of their decision (par0al feedback condi0on), while in the other half they were informed about both the factual and the counterfactual outcome (complete feedback condi0on). We used univariate analysis as well as mul0variate paPern classifica0on (Haynes, 2015) to explore outcome value processing and encoding of factual and counterfactual outcomes, in trials with either complete or par0al feedback. Furthermore, we assessed neural coding of outcome value in these two condi0ons. RESULTS Our results show that: 1) not only factual but also counterfactual informa0on can be decoded from local paPerns of brain ac0vity, 2) brain ac0va0on in regions along the rostrocaudal axis of the medial prefrontal cortex (mPFC) increases for processing of factual outcomes but decreases for counterfactual informa0on processing, 3) outcome decoding is significant in less rostral regions of mPFC, and 4) outcome value is represented with a fully-adap0ve code when complete feedback is provided, while it is encoded in an absolute way when feedback is par0al. CONCLUSION These findings suggest that outcome value processing is implemented through mul0ple coding mechanisms flexibly ac0vated depending on the specific choice segng at hand. Modeling structure in learning to self-regulate moGvaGon via veridical real-Gme fMRI neurofeedback from the ventral tegmental area
 Shabnam Hakimi¹, Jeffrey MacInnes², Kathryn Dickerson¹, R Adcock¹
 ¹Duke University, ²University of Washington
 INTRODUCTION: The ventral tegmental area (VTA) and its dopaminergic projec0ons are central to voli0onal behavior. Previous work from our group demonstrated that individuals can learn to self-ac0vate the VTA using self-generated mo0va0onal imagery during real-0me neurofeedback training, but only with veridical VTA feedback. Here, we inves0gate how the temporal structure of neurofeedback training predicts transfer. METHODS: Using a previously detailed task (MacInnes et al., 2016), par0cipants learned to voli0onally upregulate VTA response. Briefly, par0cipants completed a pre-test, 3 training runs, and a post-test. During training, they received real-0me fMRI neurofeedback from the VTA (N=19) via a graphical thermometer every 1 s over a total of 15 20-s trials (3 per run). Another group (N=14) received Gaussian noise instead of veridical VTA feedback. All par0cipants received minimal instruc0on about how to self-ac0vate and their strategies were not reported un0l aler the scan. Thus, our analyses were blind to the strategies used, instead focusing on the effect of context. To examine the structure of neurofeedback learning, we used MATLAB (v2016b), first extrac0ng mean VTA response during training (ie, the feedback signal) at the single-par0cipant level; we then resampled the signal to 2 Hz and rescaled it to [-1,1]. Next, we computed the slope of the feedback 0me series in three temporal contexts: single trial, scanner run, and full training session. This produced parameters reflec0ng learning during each context (trial: 15, run: 3, session: 1). We then used linear regression to examine how well these parameters predicted transfer, indicated by the change in average VTA response from pre- to post-test, figng one model per temporal context on the group-level. Finally, we es0mated and compared model evidence. RESULTS: The slope of VTA ac0va0on change over the en0re course of training predicted the magnitude of transfer (adj. R2=0.335, F2,17=8.56, p<0.01). An ini0al comparison of model evidence suggested that the full session was the best model for the data (AICc: -2.58, BIC: -1.44) rela0ve to other contexts (AICc, BIC; trial: 243, -13.9 and run: 10.8, 11.7). Further, only the full session explained 7

Riferimenti

Documenti correlati

Scopo dello studio WARFASA è stato valutare i benefici clinici della terapia con aspirina per la prevenzione delle recidive in pazienti con storia di tromboembolismo venoso

The first step of the dynamical comparison procedure relies on the iden- tification of sets of amino acids in the two enzymes that can be put in structural correspondence. For the

There are insufficient data to recommend on high- frequency oscillatory ventilation (HFOV) in obstructive airway (strong agreement), restrictive (strong agree- ment), mixed

The more interesting result is Theorem 4.15, which assures that isomorphic normal types have the same number of intersections and unions and that the atomic and arrow types are

In the third mesmerizing tale in Melissa Marr's #1 New York Times bestselling Wicked Lovely series, old friends become new enemies—and one wrong move could plunge the Earth

2.1 shows the composition of space debris: 59% of the objects orbiting the Earth are fragments generated by collisions or explosions; 16% are retired satellites; 12% are parts

They are obtained from the photo-oxidative-polymerization of hexafluoropropylene or tetrafluoroethylene and they were tested and then suggested as water repellents in

[r]