Weber Lilian A, Yee Debbie M, Small Dana M, Petzschner Frederike H
Department of Psychiatry, University of Oxford, Oxford, UK; Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, UK.
Cognitive and Psychological Sciences, Brown University, Providence, RI, USA; Robert J. and Nancy D. Carney Institute for Brain Science, Brown University, Providence, RI, USA.
Trends Cogn Sci. 2025 Sep;29(9):840-854. doi: 10.1016/j.tics.2025.05.008. Epub 2025 Jun 10.
Rewards play a crucial role in sculpting all motivated behavior. Traditionally, research on reinforcement learning has centered on how rewards guide learning and decision-making. Here, we examine the origins of rewards themselves. Specifically, we discuss that the critical signal sustaining reinforcement for food is generated internally and subliminally during the process of digestion. As such, a shift in our understanding of primary rewards as an immediate sensory gratification to a state-dependent evaluation of an action's impact on vital physiological processes is called for. We integrate this perspective into a revised reinforcement learning framework that recognizes the subliminal nature of biological rewards and their dependency on internal states and goals.
奖励在塑造所有有动机的行为中起着至关重要的作用。传统上,强化学习的研究集中在奖励如何引导学习和决策。在这里,我们研究奖励本身的起源。具体来说,我们讨论了维持食物强化的关键信号是在消化过程中在内部和潜意识中产生的。因此,需要将我们对主要奖励的理解从即时感官满足转变为对行为对重要生理过程影响的状态依赖性评估。我们将这一观点整合到一个修订后的强化学习框架中,该框架认识到生物奖励的潜意识本质及其对内部状态和目标的依赖性。