Lefner Merridee J, Moghaddam Bita
Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, USA.
Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, USA; Department of Psychiatry, Oregon Health & Science University, Portland, OR 97239, USA.
Curr Biol. 2025 Aug 18;35(16):3973-3985.e3. doi: 10.1016/j.cub.2025.07.021. Epub 2025 Jul 31.
In dynamic environments where stimuli predicting reward or punishment unexpectedly change, it is critical to flexibly update behavior while preserving recollection of previous associations. Dopamine and gamma-aminobutyric acid (GABA) neurons in the ventral tegmental area (VTA) are implicated in reward and punishment learning, yet little is known about how each population adapts when the predicted outcome valence changes. To address this, VTA GABA and dopamine population calcium activity fluctuations were measured with fiber photometry while male and female rats learned to associate three discrete auditory cues with three distinct outcomes: reward, punishment, or no outcome within the same session. Contingency learning was determined by quantifying conditioned responding and probability of approach behaviors. After learning, the reward and punishment cue-outcome contingencies were reversed and subsequently re-reversed. The dopamine population displayed the expected adaptation to learning and contingency reversals by increasing the response to appetitive stimuli and decreasing the response to aversive stimuli. By contrast, the GABA population increased activity to all sensory events regardless of valence. Reversing learned contingencies selectively influenced GABA responses to the reward-predictive cue, prolonging increased activity both within and across sessions. Trial-by-trial analysis further confirmed that sustained GABA activity tracks contingency reversal. The valence-specific dissociations in the directionality and temporal progression of VTA GABA and dopamine neuronal activity suggest that these populations serve distinct roles during reward or punishment associative learning and reversal. These findings also describe a novel role for VTA GABA in behavioral flexibility.
在刺激预测奖励或惩罚意外变化的动态环境中,在保留对先前关联的记忆的同时灵活更新行为至关重要。腹侧被盖区(VTA)中的多巴胺和γ-氨基丁酸(GABA)神经元与奖励和惩罚学习有关,但对于每种神经元群体在预测结果效价变化时如何适应却知之甚少。为了解决这个问题,在雄性和雌性大鼠学习将三种离散的听觉线索与三种不同结果(奖励、惩罚或同一会话内无结果)建立关联时,用纤维光度法测量了VTA中GABA和多巴胺神经元群体的钙活性波动。通过量化条件反应和接近行为的概率来确定偶然性学习。学习后,奖励和惩罚线索-结果的偶然性被颠倒,随后再颠倒回来(恢复原状)。多巴胺神经元群体通过增加对奖励性刺激的反应和减少对厌恶性刺激的反应,表现出对学习和偶然性逆转的预期适应。相比之下,GABA神经元群体对所有感觉事件的活性都增加,而不管其效价如何。颠倒已习得的偶然性会选择性地影响GABA对奖励预测线索的反应,在同一会话内和不同会话之间都延长了增加的活性。逐次试验分析进一步证实,持续的GABA活性跟踪偶然性逆转。VTA中GABA和多巴胺神经元活动的方向性和时间进程中的效价特异性分离表明,这些神经元群体在奖励或惩罚联想学习及逆转过程中发挥着不同的作用。这些发现还描述了VTA中GABA在行为灵活性方面的新作用。