Predictive and reactive reward signals conveyed by climbing fiber inputs to cerebellar Purkinje cells

Kostadinov, Dimitar; Beau, Maxime; Blanco-Pozo, Marta; Häusser, Michael

doi:10.1038/s41593-019-0381-8

Article
Published: 29 April 2019

Predictive and reactive reward signals conveyed by climbing fiber inputs to cerebellar Purkinje cells

Nature Neuroscience volume 22, pages 950–962 (2019)Cite this article

14k Accesses
120 Citations
54 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 24 January 2020

This article has been updated

Abstract

There is increasing evidence for a cerebellar contribution to cognitive processing, but the specific input pathways conveying this information remain unclear. We probed the role of climbing fiber inputs to Purkinje cells in generating and evaluating predictions about associations between motor actions, sensory stimuli and reward. We trained mice to perform a visuomotor integration task to receive a reward and interleaved cued and random rewards between task trials. Using two-photon calcium imaging and Neuropixels probe recordings of Purkinje cell activity, we show that climbing fibers signal reward expectation, delivery and omission. These signals map onto cerebellar microzones, with reward delivery activating some microzones and suppressing others, and with reward omission activating both reward-activated and reward-suppressed microzones. Moreover, responses to predictable rewards are progressively suppressed during learning. Our findings elucidate a specific input pathway for cerebellar contributions to reward signaling and provide a mechanistic link between cerebellar activity and the creation and evaluation of predictions.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Population Purkinje cell complex spike imaging during a sensorimotor task.**

**Fig. 2: Reward-activated and suppressed Purkinje cells segregate to distinct cerebellar microzones.**

**Fig. 3: Predictability modulates reward responses in trained mice.**

**Fig. 4: Electrophysiological recordings of complex spikes during cued and random reward presentation.**

**Fig. 5: Modulation of reward-related responses develops with training.**

**Fig. 6: Fictive rewards on operant trials trigger error signals across microzones.**

**Fig. 7: Omission of cued rewards triggers feedback error signals.**

Cerebellar climbing fibers multiplex movement and reward signals during a voluntary movement task in mice

Article Open access 09 September 2023

Koji Ikezoe, Naoki Hidaka, … Kazuo Kitamura

Climbing fibers provide essential instructive signals for associative learning

Article Open access 02 April 2024

N. Tatiana Silva, Jorge Ramírez-Buriticá, … Megan R. Carey

Cerebellar dopamine D2 receptors regulate social behaviors

Article 16 June 2022

Laura Cutando, Emma Puighermanal, … Emmanuel Valjent

Data availability

The data that support the findings of this study are available from the corresponding authors upon reasonable request.

Code availability

The custom analysis code used in this study is available from the corresponding authors upon reasonable request.

Change history

24 January 2020
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Wolpert, D. M., Miall, R. C. & Kawato, M. Internal models in the cerebellum. Trends Cogn. Sci. 2, 338–347 (1998).
Article CAS PubMed Google Scholar
Kawato, M., Furukawa, K. & Suzuki, R. A hierarchical neural-network model for control and learning of voluntary movement. Biol. Cybern. 57, 169–185 (1987).
Article CAS PubMed Google Scholar
Medina, J. F. The multiple roles of Purkinje cells in sensori-motor calibration: to predict, teach and command. Curr. Opin. Neurobiol. 21, 616–622 (2011).
Article CAS PubMed PubMed Central Google Scholar
Marr, D. A theory of cerebellar cortex. J. Physiol. 202, 437–470 (1969).
Article CAS PubMed PubMed Central Google Scholar
Albus, J. A. A theory of cerebellar function. Math. Biosci. 10, 25–61 (1971).
Article Google Scholar
Ito, M. Cerebellar long-term depression: characterization, signal transduction, and functional roles. Physiol. Rev. 81, 1143–1195 (2001).
Article CAS PubMed Google Scholar
Apps, R. & Garwicz, M. Anatomical and physiological foundations of cerebellar information processing. Nat. Rev. Neurosci. 6, 297–311 (2005).
Article CAS PubMed Google Scholar
Sugihara, I. & Shinoda, Y. Molecular, topographic, and functional organization of the cerebellar cortex: a study with combined aldolase C and olivocerebellar labeling. J. Neurosci. 24, 8771–8785 (2004).
Article CAS PubMed PubMed Central Google Scholar
Mathy, A. et al. Encoding of oscillations by axonal bursts in inferior olive neurons. Neuron 62, 388–399 (2009).
Article CAS PubMed PubMed Central Google Scholar
Llinas, R., Baker, R. & Sotelo, C. Electrotonic coupling between neurons in cat inferior olive. J. Neurophysiol. 37, 560–571 (1974).
Article CAS PubMed Google Scholar
Tang, T., Blenkinsop, T. A. & Lang, E. J. Complex spike synchrony dependent modulation of rat deep cerebellar nuclear activity. eLife 8, e40101 (2019).
Article PubMed PubMed Central Google Scholar
Welsh, J. P., Lang, E. J., Suglhara, I. & Llinás, R. Dynamic organization of motor control within the olivocerebellar system. Nature 374, 453–457 (1995).
Article CAS PubMed Google Scholar
Ozden, I., Dombeck, D. A., Hoogland, T. M., Tank, D. W. & Wang, S. S. Widespread state-dependent shifts in cerebellar activity in locomoting mice. PLoS One 7, e42650 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ghosh, K. K. et al. Miniaturized integration of a fluorescence microscope. Nat. Methods 8, 871–878 (2011).
Article CAS PubMed PubMed Central Google Scholar
Heffley, W. et al. Coordinated cerebellar climbing fiber activity signals learned sensorimotor predictions. Nat. Neurosci. 21, 1431–1441 (2018).
Article CAS PubMed PubMed Central Google Scholar
De Gruijl, J. R., Hoogland, T. M. & De Zeeuw, C. I. Behavioral correlates of complex spike synchrony in cerebellar microzones. J. Neurosci. 34, 8937–8947 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hoogland, T. M., De Gruijl, J. R., Witter, L., Canto, C. B. & De Zeeuw, C. I. Role of synchronous activation of cerebellar purkinje cell ensembles in multi-joint movement control. Curr. Biol. 25, 1157–1165 (2015).
Article CAS PubMed PubMed Central Google Scholar
Najafi, F., Giovannucci, A., Wang, S. S. & Medina, J. F. Coding of stimulus strength via analog calcium signals in Purkinje cell dendrites of awake mice. eLife 3, e03663 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mukamel, E. A., Nimmerjahn, A. & Schnitzer, M. J. Automated analysis of cellular signals from large-scale calcium imaging data. Neuron 63, 747–760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Deverett, B., Koay, S. A., Oostland, M. & Wang, S. S. Cerebellar involvement in an evidence-accumulation decision-making task. eLife 7, e36781 (2018).
Article PubMed PubMed Central Google Scholar
Medina, J. F. & Lisberger, S. G. Links from complex spikes to local plasticity and motor learning in the cerebellum of awake-behaving monkeys. Nat. Neurosci. 11, 1185–1192 (2008).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. & Lisberger, S. G. Purkinje-cell plasticity and cerebellar motor learning are graded by complex-spike duration. Nature 510, 529–532 (2014).
Article CAS PubMed PubMed Central Google Scholar
Strick, P. L., Dum, R. P. & Fiez, J. A. Cerebellum and nonmotor function. Annu. Rev. Neurosci. 32, 413–434 (2009).
Article CAS PubMed Google Scholar
Rochefort, C. et al. Cerebellum shapes hippocampal spatial code. Science 334, 385–389 (2011).
Article CAS PubMed Google Scholar
Stoodley, C. J. & Schmahmann, J. D. Functional topography in the human cerebellum: a meta-analysis of neuroimaging studies. Neuroimage 44, 489–501 (2009).
Article PubMed Google Scholar
Wagner, M. J., Kim, T. H., Savall, J., Schnitzer, M. J. & Luo, L. Cerebellar granule cells encode the expectation of reward. Nature 544, 96–100 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lee, K. H. et al. Circuit mechanisms underlying motor memory formation in the cerebellum. Neuron 86, 529–540 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ozden, I., Sullivan, M. R., Lee, H. M. & Wang, S. S. Reliable coding emerges from coactivation of climbing fibers in microbands of cerebellar Purkinje neurons. J. Neurosci. 29, 10463–10473 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kitamura, K. & Häusser, M. Dendritic calcium signaling triggered by spontaneous and sensory-evoked climbing fiber input to cerebellar Purkinje cells in vivo. J. Neurosci. 31, 10847–10858 (2011).
Article CAS PubMed PubMed Central Google Scholar
Schultz, S. R., Kitamura, K., Post-Uiterweer, A., Krupic, J. & Häusser, M. Spatial pattern coding of sensory information by climbing fiber-evoked calcium signals in networks of neighboring cerebellar Purkinje cells. J. Neurosci. 29, 8005–8015 (2009).
Article CAS PubMed PubMed Central Google Scholar
Gaffield, M. A., Bonnan, A. & Christie, J. M. Conversion of graded presynaptic climbing fiber activity into graded postsynaptic Ca²⁺ signals by Purkinje cell dendrites. Neuron https://doi.org/10.1016/j.neuron.2019.03.010 (2019).
Oscarsson, O. Functional units of the cerebellum - sagittal zones and microzones. Trends Neurosci. 2, 143–145 (1979).
Article Google Scholar
Ohmae, S. & Medina, J. F. Climbing fibers encode a temporal-difference prediction error during cerebellar learning in mice. Nat. Neurosci. 18, 1798–1803 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rowan, M. J. M. et al. Graded control of climbing-fiber-mediated plasticity and learning by inhibition in the cerebellum. Neuron 99, 999–1015.e6 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ivry, R. B. & Keele, S. W. Timing functions of the cerebellum. J. Cogn. Neurosci 1, 136–152 (1989).
Article CAS PubMed Google Scholar
Lang, E. J. et al. The roles of the olivocerebellar pathway in motor learning and motor control. A consensus paper. Cerebellum 16, 230–252 (2017).
Article PubMed PubMed Central Google Scholar
Ten Brinke, M. M., Boele, H. J. & De Zeeuw, C. I. Conditioned climbing fiber responses in cerebellar cortex and nuclei. Neurosci. Lett. 688, 26–36 (2019).
Article CAS PubMed Google Scholar
Giovannucci, A. et al. Cerebellar granule cells acquire a widespread predictive feedback signal during motor learning. Nat. Neurosci. 20, 727–734 (2017).
Article CAS PubMed PubMed Central Google Scholar
Sutton, R. S. Learning to predict by methods of temporal differences. Mach. Learn. 3, 9–44 (1988).
Google Scholar
Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).
Article CAS PubMed Google Scholar
Watabe-Uchida, M., Eshel, N. & Uchida, N. Neural circuitry of reward prediction error. Annu. Rev. Neurosci. 40, 373–394 (2017).
Article CAS PubMed PubMed Central Google Scholar
Turecek, J. et al. NMDA receptor activation strengthens weak electrical coupling in mammalian brain. Neuron 81, 1375–1388 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mathy, A., Clark, B. A. & Häusser, M. Synaptically induced long-term modulation of electrical coupling in the inferior olive. Neuron 81, 1290–1296 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lefler, Y., Yarom, Y. & Uusisaari, M. Y. Cerebellar inhibitory input to the inferior olive decreases electrical coupling and blocks subthreshold oscillations. Neuron 81, 1389–1400 (2014).
Article CAS PubMed Google Scholar
Miyazaki, K., Miyazaki, K. W. & Doya, K. Activation of dorsal raphe serotonin neurons underlies waiting for delayed rewards. J. Neurosci. 31, 469–479 (2011).
Article CAS PubMed PubMed Central Google Scholar
Garden, D. L. F., Rinaldi, A. & Nolan, M. F. Active integration of glutamatergic input to the inferior olive generates bidirectional postsynaptic potentials. J. Physiol. 595, 1239–1251 (2017).
Article CAS PubMed Google Scholar
Carta, I., Chen, C. H., Schott, A. L., Dorizan, S. & Khodakhah, K. Cerebellar modulation of the reward circuitry and social behavior. Science 363, eaav0581 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gao, Z. et al. A cortico-cerebellar loop for motor planning. Nature 563, 113–116 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chabrol, F. P., Blot, A. & Mrsic-Flogel, T. D. Cerebellar contribution to preparatory activity in motor neocortex. Preprint at biorXiv https://doi.org/10.1101/335703 (2018).
Chen, C. H., Fremont, R., Arteaga-Bracho, E. E. & Khodakhah, K. Short latency cerebellar modulation of the basal ganglia. Nat. Neurosci. 17, 1767–1775 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhang, X. M. et al. Highly restricted expression of Cre recombinase in cerebellar Purkinje cells. Genesis 40, 45–51 (2004).
Article CAS PubMed Google Scholar
Chen, T. W. et al. Ultrasensitive fluorescent proteins for imaging neuronal activity. Nature 499, 295–300 (2013).
Article CAS PubMed PubMed Central Google Scholar
Aronov, D. & Tank, D. W. Engagement of neural circuits underlying 2D spatial navigation in a rodent virtual reality system. Neuron 84, 442–456 (2014).
Article CAS PubMed PubMed Central Google Scholar
Burgess, C. P. et al. High-yield methods for accurate two-alternative visual psychophysics in head-fixed mice. Cell Rep. 20, 2513–2524 (2017).
Article CAS PubMed PubMed Central Google Scholar
Slotnick, B. A simple 2-transistor touch or lick detector circuit. J. Exp. Anal. Behav. 91, 253–255 (2009).
Article PubMed PubMed Central Google Scholar
Jun, J. J. et al. Fully integrated silicon probes for high-density recording of neural activity. Nature 551, 232–236 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pachitariu, M. et al. Suite2p: beyond 10,000 neurons with standard two-photon microscopy. Preprint at biorXiv https://doi.org/10.1101/061507 (2017).
Deneux, T. et al. Accurate spike estimation from noisy calcium signals for ultrafast three-dimensional imaging of large neuronal populations in vivo. Nat. Commun. 7, 12190 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ozden, I., Lee, H. M., Sullivan, M. R. & Wang, S. S. Identification and clustering of event patterns from in vivo multiphoton optical recordings of neuronal ensembles. J. Neurophysiol. 100, 495–503 (2008).
Article PubMed PubMed Central Google Scholar
Tsutsumi, S. et al. Structure-function relationships between aldolase C/zebrin II expression and complex spike synchrony in the cerebellum. J. Neurosci. 35, 843–852 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ramirez, J. E. & Stell, B. M. Calcium imaging reveals coordinated simple spike pauses in populations of cerebellar Purkinje cells. Cell Rep. 17, 3125–3132 (2016).
Article CAS PubMed Google Scholar
Streng, M. L., Popa, L. S. & Ebner, T. J. Climbing fibers control Purkinje cell representations of behavior. J. Neurosci. 37, 1997–2009 (2017).
Article CAS PubMed PubMed Central Google Scholar
Watson, B. O., Yuste, R. & Packer, A. M. PackIO and EphysViewer: software tools for acquisition and analysis of neuroscience data. Preprint at biorXiv https://doi.org/10.1101/054080 (2016).
Pachitariu, M., Steinmetz, N. A., Kadir, S. N., Carandini, M. & Harris, K. D. Fast and accurate spike sorting of high-channel count probes with Kilosort. Adv. Neural Inf. Process. Syst. 29, 4448–4456 (2016).
Google Scholar
Armstrong, D. M. & Rawson, J. A. Activity patterns of cerebellar cortical neurones and climbing fibre afferents in the awake cat. J. Physiol. 289, 425–448 (1979).
Article CAS PubMed PubMed Central Google Scholar
Gao, H., Solages, Cd & Lena, C. Tetrode recordings in the cerebellar cortex. J. Physiol. Paris 106, 128–136 (2012).
Article PubMed Google Scholar
Dong, H.-W. The Allen Reference Atlas: A Digital Color Brain Atlas of the C57Bl/6J Male Mouse (Wiley, 2008).
Oh, S. W. et al. A mesoscale connectome of the mouse brain. Nature 508, 207–214 (2014).
Article CAS PubMed PubMed Central Google Scholar
Shamash, P., Carandini, M., Harris, K. & Steinmetz, N. A tool for analyzing electrode tracks from slice histology. Preprint at biorXiv https://doi.org/10.1101/447995 (2018).

Download references

Acknowledgements

We are grateful to P. Dayan, M. Fisek, S. Tsutsumi, C. Buetfering, B. Clark, Y. Chung and the members of the Häusser laboratory for discussions and comments on the manuscript. We would like to thank N. Steinmetz for help with Neuropixels recordings, M. Pachitariu for generously providing us access to Kilosort2 before general distribution and N. Smith for illustrations. This work was supported by the Wellcome Trust (M.H., PRF 201225), ERC (M.H., AdG 695709) and EMBO (D.K., ALTF 914-2015).

Author information

Marta Blanco-Pozo
Present address: Department of Experimental Psychology, University of Oxford, Oxford, UK

Authors and Affiliations

Wolfson Institute for Biomedical Research and Department of Neuroscience, Physiology and Pharmacology, University College London, London, UK
Dimitar Kostadinov, Maxime Beau, Marta Blanco-Pozo & Michael Häusser

Authors

Dimitar Kostadinov
View author publications
You can also search for this author in PubMed Google Scholar
Maxime Beau
View author publications
You can also search for this author in PubMed Google Scholar
Marta Blanco-Pozo
View author publications
You can also search for this author in PubMed Google Scholar
Michael Häusser
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.K. and M.H. conceived the project and wrote the manuscript. D.K., M.B. and M.B.-P. performed experiments and analysis.

Corresponding authors

Correspondence to Dimitar Kostadinov or Michael Häusser.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Integrated supplementary information

Supplementary Figure 1 Activity aligned to movement onset is preferentially related to movement and not to object appearance.

a. Top: Events extracted from population two-photon calcium imaging of Purkinje cells during the behavioral task, expressed as a heatmap. Activity in an example session is aligned to object appearance (dashed line) on trials in which the mouse did not immediately turn the wheel (turn latency >0.75 s). Bottom: summary of activity and behavior for all sessions. N = 1101 neurons, 6 mice, 6 sessions (1 session per mouse). Data are shown as mean ± s.e.m. b. Top: Heatmap of activity in an example session aligned to wheel turns (dashed line) that occurred within a trial (left) or outside of a trial (right). Bottom: summary of activity and behavior for all sessions. N = 1101 neurons, 6 mice, 6 sessions (1 session per mouse). Data are shown as mean ± s.e.m. c. Top: Heatmap of activity in an example session aligned to wheel turns (dashed line) that occurred early within a trial (turn latency ≤0.75 s) or late in a trial (turn latency >0.75 s). Bottom: summary of activity and behavior for all sessions. N = 859 neurons, 5 mice, 5 sessions (1 session per mouse). Note that one session was excluded from these analyses because it only had 4 trials in which it initiated a movement with turn latency >0.75 s. Data are shown as mean ± s.e.m. d. Comparison of first significant population-wide event after object appearance (defined as the time points when the mean event rate for a field of view exceeded the mean + 2 s.d. of activity during the withhold period on each trial) to the reaction time of the mouse on each trial. Dots represent individual trials and dots of the same color represent individual sessions (one session per mouse). Black line and shaded bar represent linear fit and 95% confidence interval through all data points (linear regression, p = 3 × 10⁻⁴²). N = 594 trials, 6 mice, 6 sessions (1 session per mouse). e. Same as panel d but for the last population-wide event before movement onset (linear regression, p = 8 ×1 0⁻¹⁹⁰). f. Summary of Pearson’s correlation coefficients of activity traces for individual neurons between wheel turns within trial and outside of trial (green bar, interval -300 to +300 ms relative to wheel movement, N = 1101 neurons, 6 mice, 6 sessions), between trial onsets without movement and wheel turns within a trial (gray bar, interval +100 to 700 ms for trial onset and -300 to +300 ms for wheel turns in order to align peaks of activity, N = 1101 neurons, 6 mice, 6 sessions), and between wheel turns that occurred early within a trial and movements that occurred late within a trial (blue bar, interval -300 to +300 ms relative to wheel movement, N = 859 neurons, 5 mice, 5 sessions). Intervals for correlation analysis were chosen to align the peaks of the responses for each condition. Data are shown as box plots: center line, median; box edges, interquartile ranges; whiskers, range without outliers (1.5 times the interquartile range from box edges); black points, outliers (Kruskal-Wallis test, H = 359, d.f. = 2, p = 9 × 10⁻⁷⁹, significance values for Bonferroni-corrected individual comparisons: wheel turns: within trial and outside of trial vs trial onset without movement and wheel turns within trial, p = 7 × 10⁻⁶³; wheel turns: within trial and outside of trial vs wheel turns within trial: early and late, p > 0.9; trial onset without movement and wheel turns within trial vs wheel turns within trial: early and late, p = 1 × 10⁻⁵⁴). Statistics summary: n.s. = not significant, ***p < 0.001.

Supplementary Figure 2 PCA-based clustering reveals correlated groups of Purkinje cell dendrites.

Workflow for microzone identification is shown: a. Significant principal components were identified by performing 1000 iterations of PCA on cell-wise jittered data (interval ± 400 ms, whole recording) and choosing first p principal components that explained significantly more variance than the jittered components. b. The appropriate number of clusters for each data set (spontaneous data only) were chosen from the interval [1:12] based on silhouette analysis. PCA projection in 3 dimensions shown before clustering (left, pseudocolored by coefficients of 4th component) and after initial clustering (right, colored by cluster). c. The initial clusters for each recording were mapped on anatomy and exclusion criteria were applied to obtain pure clusters. d. ROI centroids were projected on to the mediolateral anatomical axis and binned at ~80 μm (64 pixels). Secondary peaks were identified with a threshold of 40% of the maximum bin. If secondary peaks were found, individual ROIs were assigned to the closest peak (none found in example dataset). e. The median position for each cluster was computed and ROIs that were more than 3 median absolute deviations away from median were excluded as outliers. f. Final cluster designations were assigned, ROIs were organized spatially within each cluster and clusters were organized spatially relative to each other.

Supplementary Figure 3 Microzonal identification for each recorded dataset.

Anatomical maps (subpanel 1), clustering (subpanels 2 and 3), and correlation heatmaps (subpanels 4 and 5) for fields of view used in Figs. 1, 2. Analysis was done independently for spontaneous activity acquired during withhold periods (spont.) and for whole recording session (all data). Numbers of neurons for fields of view in panels a-f= 210, 124, 122, 273, 237, and 135, respectively.

Supplementary Figure 4 Mapping recorded fields of view onto cerebellar anatomy.

Coarse anatomical maps with landmarks are shown in grayscale and reward response is shown in color. Field of view number corresponds to numbers given in Supplementary Fig. 3. Purkinje cell dendrites with high reward activity (’reward-activated’) are shown in magenta, and Purkinje cell dendrites with low reward activity (’reward-suppressed’) are shown in cyan. Number of trial rewards per field of view: 120 (Field of view 2), 143 (Field of view 3), 156 (Field of view 4), and 52 (Field of view 5).

Supplementary Figure 5 Relationship between delay-period activity and predictive licking on operant trials.

a. Example of microzone from animal that exhibited predictive licking showing delay period activity on both correct (left, black) and incorrect (right, red) trials. N = 162 correct trials and 124 incorrect trials. b. Same as panel a but for animal that did not exhibit predictive licking. N = 156 correct trials and 115 incorrect trials. c. Comparison of mean microzonal activity during delay period (-500 to 0 ms relative to reward) in significantly activated microzones on correct and incorrect trials. Dots of the same color represent microzones from the same recording. N = 14 microzones from 6 mice. d. Comparison of degree of predictive licking on correct and incorrect trials. N = 6 sessions from 6 mice. e. Relationship between level of delay period activation and degree of predictive licking on correct trials. N = 14 microzones from 6 mice. f. Same as panel e but for incorrect trials. Traces in panels a and b and data points in panels d-f are shown as mean ± s.e.m. Black line and shaded bars in panels d-f represent linear fit and 95% confidence interval through all data points (linear regression; panel c, p = 2 × 10⁻⁶; panel d, p = 5 × 10⁻⁸; panel e, p = 0.5; panel f, p = 0.9).

Supplementary Figure 6 Initial observation of reward-response modulation.

a. Top: Trial-averaged population response in the first mouse in which we observed reward response modulation (Supplementary Fig. 3, FOV 1). Middle: Trial-averaged steering wheel velocity. Bottom: Trial-averaged licking response. Data are shown as mean ± s.e.m. across trials (n = 162 trial rewards and 5 random rewards). In this experiment, the mouse was not subjected to all of the potential reward contingencies, only to random and trial-dependent (operant) rewards, but still exhibited the characteristic reward response modulation. b. Scatter plot comparing reward-related responses over interval 0 to +100 ms post reward for random rewards and rewards earned on correct motor trials. c. Cell-wise average of Purkinje cell dendritic response to trial rewards and random rewards taken over interval 0 to +100 ms post reward. Data are shown as mean ± s.e.m. across cells (n = 210 cells, two-sided Wilcoxon signed-rank test, p = 3 × 10⁻¹⁸). Statistics summary: ***p < 0.001.

Supplementary Figure 7 Quantification of licking responses across reward conditions.

a. Comparison of delay period (reward-predictive) licking for tone-cued rewards and correct motor trials (-500 to 0 ms relative to reward in both conditions, which had the same delay interval). Mean licking responses in individual mice (n = 5) are as single colored dots with error bars showing across trial s.e.m. All mice exhibited higher predictive licking for tone-cued rewards than on correct motor trials (p = 3 × 10⁻²² (Mouse 1, yellow), p = 5 × 10⁻¹⁷ (Mouse 2, red), p = 1 × 10⁻¹⁶ (Mouse 3, green), p = 1 × 10⁻⁷ (Mouse 4, blue), p = 1 × 10⁻⁶ (Mouse 5, magenta), two-sided Wilcoxon rank sum test across trials). b. Normalized response to trial rewards (mean activity in interval 0 to +100 ms after reward in Purkinje cells from reward-activated microzones (top) and reward-suppressed microzones (bottom) as a function of degree of predictive licking. Reward-related responses and predictive licking were quantified on individual trials, after which responses in individual neurons were normalized to the mean response to random reward per neuron and all neuron-trial pairs were binned for plotting according level of predictive licking in each trial. c. Same as panel a but for cued rewards. Data are shown as mean ± s.e.m. In panels b and c, N = 38337 neuron-condition pairs (trial rewards, activated), 51888 pairs (trial rewards, suppressed), 11584 pairs (cued rewards, activated), and 11314 pairs (cued rewards, suppressed). The number of Purkinje cells =361 (reward-activated) and 450 (reward-suppressed) pooled from 5 mice. Black line represents linear fit through all data points (not binned).

Supplementary Figure 8 Stronger suppression of reward response on operant trials with an easier task.

a. Top: Trial-averaged population response of a representative field of view to random, operant, and tone-cued rewards on a different task version in which all wheel movements towards the midline were rewarded. ROIs are by mediolateral position within the field of view. Middle: Trial-averaged steering wheel velocity. Bottom: Trial-averaged licking response. Velocity and licking are shown as mean ± s.e.m. across trials (n = 11 random rewards, 210 trial rewards, and 20 tone-cued rewards). b. Scatter plots showing pairwise comparisons of response amplitude (computed as mean over 0 to +100 ms after each reward-related event) across different reward conditions. N = 556 neurons from 2 FOVs in 2 mice. Data points from representative field of view (panel a) are shown in darker gray. c. Relative response magnitude in neurons responsive to random reward (mean response over 0 to 100 ms after random reward > 2 s.d. above baseline) in mice trained on the task used throughout the majority of this study (black) and the easier task version (gray). Data are shown as mean ± s.e.m., and n = 400 neurons (of 891) for mice trained on the hard task and n = 417 neurons (of 556) for mice trained on the easy task (Kruskal-Wallis test, H = 2543, d.f. = 7, p < 1 × 10⁻⁹⁹, significance values for Bonferroni-corrected individual comparisons: hard task vs easy task (trial reward), p = 5 × 10⁻²⁵; hard task vs easy task (cued reward), p = 9 × 10⁻¹⁵). Statistics summary: ***p < 0.001.

Supplementary Figure 9 Recordings from Purkinje cells using Neuropixels probes.

a. Histological localization of the Neuropixels recording track in a coronal section of cerebellum (6.2 mm caudal from bregma). Blue: Neurotrace, Red: DiI (highlighting the electrode track). Dashed red line shows the projection of the electrode track onto this anatomical section. b. Examples of complex spikes (CSs) and simple spikes (SSs) recorded using Neuropixels probes. Each panel shows 100 overlaid raw traces (with mean and s.d. shown as black line and shaded area, respectively) on adjacent recording sites (20 µm vertical and horizontal separation), situated from -1890 µm to -1990 µm below the surface of the brain shown in panel a. Red: CS recordings showing channels located in the molecular (top channels) and Purkinje (middle channels) layers of the cerebellar cortex. Blue: corresponding SS recordings from the same putative Purkinje cells exhibiting post-CS pauses, as in Fig. 4c (SS1 and SS2 associated with CS1 and CS2, respectively). c. Example raw traces (gray) recorded on three adjacent vertically consecutive channels (20 µm separation) of a Neuropixels probe within the Purkinje cell layer. The spikes of the simple spike cluster (black) and complex spike cluster (red) shown in Fig. 4a–c are highlighted. Note that several other simple and complex spike clusters were identified within this Purkinje cell layer but are not highlighted for clarity.

Supplementary Figure 10 Video analysis of orofacial movements on omission trials.

To better understand how complex spiking responses upon reward omission might be reflected behaviorally, we analyzed videos of orofacial movements in a subset of mice presented with real and fictive trial reward and with delivered and omitted cued rewards. We observed that upon violations of reward expectations, mice exhibited larger orofacial movements, consistent with them searching for an expected reward. a. Two video frames (taken at 100 Hz) show mouse 500 ms before a random reward was delivered (left) and 220 ms after the random reward was delivered (right). White dashed boxes indicate regions of analysis (~4 × 8 mm). b. Orofacial movement signal recorded for a single random reward (left, corresponding to video frames in panel a) and the mean orofacial movement signal (right) across all random rewards in a session (n = 30). c. Mean orofacial movement signals from example mouse (same session as panels a and b) on operant trials (left) with real reward (black, n = 52) and fictive reward (red, n = 8), and signals on cued reward trials (right) on which reward was given (black, n = 40) and omitted (red, n = 14). d. Same as panel c but for a different mouse. N = 54 operant trials with real reward, 14 operant trials with fictive reward, 53 cue + reward trials, and 12 cue + omission trials. Data in panels b - d are shown as mean ± s.d. across trials.

Supplementary Figure 11 Modulation of reward responses calculated using unweighted event detection.

Data used in this figure are the same as in Fig. 3 of the main text, but all detected event amplitudes are set to a value of 1. a. Schematic of reward perturbation experiments: during each behavioral session, we randomly interspersed random rewards (10% of inter-trial intervals) or tone-cued rewards (also 10% of inter-trial intervals; 500 ms delay between cue onset and reward). b. Top: Trial-averaged population response of a representative field of view (same as Fig. 2) to random, operant, and tone-cued rewards. Dendritic calcium event matrix was binarized for this analysis, setting all event amplitudes to a value of 1. ROIs are sort first by medio-lateral position of identified microzones, then mediolaterally within each identified microzone. Color blocks adjacent to each heatmap denote microzonal designation, following the color scheme of Fig. 2 (gray = unclustered). Middle: Trial-averaged steering wheel velocity. Bottom: Trial-averaged licking response. Velocity and licking are shown as mean ± s.e.m. across trials. N = 30 random rewards, 156 trial rewards, and 30 tone-cued rewards. c. Scatter plots showing pairwise comparisons of response amplitude (computed as mean over 0 to +100 ms after each reward-related event) across different reward conditions. N = 891 neurons from 5 FOVs in 5 mice. Data points from representative field of view (panel b) are shown in darker gray. d. Cell-wise average of Purkinje cell dendritic response to each reward-related event. Data are shown as mean ± s.e.m. (n = 891 neurons from 5 FOVs in 5 mice, Kruskal-Wallis test, H = 428, d.f. = 3, p = 2 × 10⁻⁹², significance values for Bonferroni-corrected individual comparisons: random vs trial reward, p = 4 × 10⁻¹⁷; random vs cued reward, p = 1 × 10⁻³²; trial vs cued reward, p = 0.004; trial reward vs tone cue, p = 1 × 10⁻⁵¹; cued reward vs tone cue, p = 4 × 10⁻⁷⁷). e, Cell-wise comparison of Pearson’s correlations between pairs of reward-related events computed over 0 to +500 ms after each event. Data are shown as box plots: center line, median; box edges, interquartile ranges; whiskers, range without outliers (1.5 times the interquartile range from box edges); gray points, outliers (n = 891 neurons from 5 FOVs in 5 mice, Kruskal-Wallis test, H = 245, d.f. = 3, p = 8 × 10⁻⁵³, significance values for Bonferroni-corrected individual comparisons: random and trial reward vs random and cued reward, p = 1 × 10⁻³⁴; random and trial reward vs trial and cued reward, p = 6 × 10⁻³⁷; random and cued reward vs trial and cued reward, p > 0.9; random and cued reward vs random reward and tone cue, p = 5 × 10⁻¹⁷; trial and cued rewards vs random reward and tone cue, p = 1 × 10⁻¹⁸). f. Time course of mean responses across reward conditions for Purkinje cells in reward-activated microzones (top, n = 361 neurons) and reward-suppressed microzones (bottom, n = 470 neurons). Note that 60 neurons were not clustered into a microzone and so excluded from this analysis. Data are shown as mean ± s.e.m. Statistics summary: n.s. = not significant, **p < 0.01, ***p < 0.001.

Supplementary information

Supplementary Information

Supplementary Figs. 1–11.

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kostadinov, D., Beau, M., Blanco-Pozo, M. et al. Predictive and reactive reward signals conveyed by climbing fiber inputs to cerebellar Purkinje cells. Nat Neurosci 22, 950–962 (2019). https://doi.org/10.1038/s41593-019-0381-8

Download citation

Received: 30 July 2018
Accepted: 11 March 2019
Published: 29 April 2019
Issue Date: June 2019
DOI: https://doi.org/10.1038/s41593-019-0381-8

This article is cited by

The cerebellum directly modulates the substantia nigra dopaminergic activity
- Samantha Washburn
- Maritza Oñate
- Kamran Khodakhah
Nature Neuroscience (2024)
Cerebellar state estimation enables resilient coupling across behavioural domains
- Ensor Rafael Palacios
- Paul Chadderton
- Conor Houghton
Scientific Reports (2024)
Organization of reward and movement signals in the basal ganglia and cerebellum
- Noga Larry
- Gil Zur
- Mati Joshua
Nature Communications (2024)
A cerebro-cerebellar network for learning visuomotor associations
- Naveen Sendhilnathan
- Andreea C. Bostan
- Michael E. Goldberg
Nature Communications (2024)
Exploring unknown environments: motivated developmental learning for autonomous navigation of mobile robots
- Yuyang Zhou
- Dongshu Wang
- Lei Liu
Intelligent Service Robotics (2024)