{"month":"03","date_updated":"2023-08-04T10:53:14Z","title":"The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks","acknowledgement":"F.Z. was supported by the Wellcome Trust (110124/Z/15/Z) and the Novartis Research Foundation. T.P.V. was supported by a Wellcome Trust Sir Henry Dale Research fellowship (WT100000), a Wellcome Trust Senior Research Fellowship (214316/Z/18/Z), and an ERC Consolidator Grant SYNAPSEEK.","article_processing_charge":"No","isi":1,"scopus_import":"1","date_created":"2020-08-12T12:08:24Z","has_accepted_license":"1","citation":{"ama":"Zenke F, Vogels TP. The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks. <i>Neural Computation</i>. 2021;33(4):899-925. doi:<a href=\"https://doi.org/10.1162/neco_a_01367\">10.1162/neco_a_01367</a>","ieee":"F. Zenke and T. P. Vogels, “The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks,” <i>Neural Computation</i>, vol. 33, no. 4. MIT Press, pp. 899–925, 2021.","short":"F. Zenke, T.P. Vogels, Neural Computation 33 (2021) 899–925.","ista":"Zenke F, Vogels TP. 2021. The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks. Neural Computation. 33(4), 899–925.","mla":"Zenke, Friedemann, and Tim P. Vogels. “The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.” <i>Neural Computation</i>, vol. 33, no. 4, MIT Press, 2021, pp. 899–925, doi:<a href=\"https://doi.org/10.1162/neco_a_01367\">10.1162/neco_a_01367</a>.","chicago":"Zenke, Friedemann, and Tim P Vogels. “The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.” <i>Neural Computation</i>. MIT Press, 2021. <a href=\"https://doi.org/10.1162/neco_a_01367\">https://doi.org/10.1162/neco_a_01367</a>.","apa":"Zenke, F., &#38; Vogels, T. P. (2021). The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks. <i>Neural Computation</i>. MIT Press. <a href=\"https://doi.org/10.1162/neco_a_01367\">https://doi.org/10.1162/neco_a_01367</a>"},"type":"journal_article","oa_version":"Published Version","department":[{"_id":"TiVo"}],"article_type":"original","status":"public","ec_funded":1,"issue":"4","publisher":"MIT Press","abstract":[{"lang":"eng","text":"Brains process information in spiking neural networks. Their intricate connections shape the diverse functions these networks perform. In comparison, the functional capabilities of models of spiking networks are still rudimentary. This shortcoming is mainly due to the lack of insight and practical algorithms to construct the necessary connectivity. Any such algorithm typically attempts to build networks by iteratively reducing the error compared to a desired output. But assigning credit to hidden units in multi-layered spiking networks has remained challenging due to the non-differentiable nonlinearity of spikes. To avoid this issue, one can employ surrogate gradients to discover the required connectivity in spiking network models. However, the choice of a surrogate is not unique, raising the question of how its implementation influences the effectiveness of the method. Here, we use numerical simulations to systematically study how essential design parameters of surrogate gradients impact learning performance on a range of classification problems. We show that surrogate gradient learning is robust to different shapes of underlying surrogate derivatives, but the choice of the derivative’s scale can substantially affect learning performance. When we combine surrogate gradients with a suitable activity regularization technique, robust information processing can be achieved in spiking networks even at the sparse activity limit. Our study provides a systematic account of the remarkable robustness of surrogate gradient learning and serves as a practical guide to model functional spiking neural networks."}],"ddc":["000","570"],"year":"2021","publication_identifier":{"eissn":["1530-888X"],"issn":["0899-7667"]},"pmid":1,"author":[{"first_name":"Friedemann","orcid":"0000-0003-1883-644X","full_name":"Zenke, Friedemann","last_name":"Zenke"},{"first_name":"Tim P","id":"CB6FF8D2-008F-11EA-8E08-2637E6697425","orcid":"0000-0003-3295-6181","last_name":"Vogels","full_name":"Vogels, Tim P"}],"doi":"10.1162/neco_a_01367","_id":"8253","volume":33,"external_id":{"isi":["000663433900003"],"pmid":["33513328"]},"project":[{"call_identifier":"H2020","grant_number":"819603","name":"Learning the shape of synaptic plasticity rules for neuronal architectures and function through machine learning.","_id":"0aacfa84-070f-11eb-9043-d7eb2c709234"},{"_id":"c084a126-5a5b-11eb-8a69-d75314a70a87","name":"What’s in a memory? Spatiotemporal dynamics in strongly coupled recurrent neuronal networks.","grant_number":"214316/Z/18/Z"}],"date_published":"2021-03-01T00:00:00Z","publication_status":"published","day":"01","publication":"Neural Computation","user_id":"4359f0d1-fa6c-11eb-b949-802e58b17ae8","page":"899-925","file_date_updated":"2022-04-08T06:05:39Z","file":[{"date_updated":"2022-04-08T06:05:39Z","file_size":1611614,"content_type":"application/pdf","creator":"dernst","checksum":"eac5a51c24c8989ae7cf9ae32ec3bc95","file_id":"11131","date_created":"2022-04-08T06:05:39Z","success":1,"access_level":"open_access","relation":"main_file","file_name":"2021_NeuralComputation_Zenke.pdf"}],"intvolume":"        33","language":[{"iso":"eng"}],"quality_controlled":"1","oa":1,"tmp":{"legal_code_url":"https://creativecommons.org/licenses/by/4.0/legalcode","short":"CC BY (4.0)","name":"Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)","image":"/images/cc_by.png"}}