<?xml version="1.0" encoding="UTF-8"?>

<modsCollection xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods version="3.3">

<genre>article</genre>

<titleInfo><title>Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks</title></titleInfo>


<note type="publicationStatus">published</note>


<note type="qualityControlled">yes</note>

<name type="personal">
  <namePart type="given">Torsten</namePart>
  <namePart type="family">Hoefler</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Dan-Adrian</namePart>
  <namePart type="family">Alistarh</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">4A899BFC-F248-11E8-B48F-1D18A9856A87</identifier><description xsi:type="identifierDefinition" type="orcid">0000-0003-3650-940X</description></name>
<name type="personal">
  <namePart type="given">Tal</namePart>
  <namePart type="family">Ben-Nun</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Nikoli</namePart>
  <namePart type="family">Dryden</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Elena-Alexandra</namePart>
  <namePart type="family">Peste</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">32D78294-F248-11E8-B48F-1D18A9856A87</identifier></name>







<name type="corporate">
  <namePart></namePart>
  <identifier type="local">DaAl</identifier>
  <role>
    <roleTerm type="text">department</roleTerm>
  </role>
</name>








<abstract lang="eng">The growing energy and performance costs of deep learning have driven the community to reduce the size of neural networks by selectively pruning components. Similarly to their biological counterparts, sparse networks generalize just as well, sometimes even better than, the original dense networks. Sparsity promises to reduce the memory footprint of regular networks to fit mobile devices, as well as shorten training time for ever growing networks. In this paper, we survey prior work on sparsity in deep learning and provide an extensive tutorial of sparsification for both inference and training. We describe approaches to remove and add elements of neural networks, different training strategies to achieve model sparsity, and mechanisms to exploit sparsity in practice. Our work distills ideas from more than 300 research papers and provides guidance to practitioners who wish to utilize sparsity today, as well as to researchers whose goal is to push the frontier forward. We include the necessary background on mathematical methods in sparsification, describe phenomena such as early structure adaptation, the intricate relations between sparsity and the training process, and show techniques for achieving acceleration on real hardware. We also define a metric of pruned parameter efficiency that could serve as a baseline for comparison of different sparse networks. We close by speculating on how sparsity can improve future workloads and outline major open problems in the field.</abstract>

<relatedItem type="constituent">
  <location>
    <url displayLabel="2021_JMachLearnRes_Hoefler.pdf">https://research-explorer.ista.ac.at/download/10180/10192/2021_JMachLearnRes_Hoefler.pdf</url>
  </location>
  <physicalDescription><internetMediaType>application/pdf</internetMediaType></physicalDescription><accessCondition type="restrictionOnAccess">no</accessCondition>
</relatedItem><accessCondition type="use and reproduction">https://creativecommons.org/licenses/by/4.0/</accessCondition>
<originInfo><publisher>ML Research Press</publisher><dateIssued encoding="w3cdtf">2021</dateIssued>
</originInfo>
<language><languageTerm authority="iso639-2b" type="code">eng</languageTerm>
</language>



<relatedItem type="host"><titleInfo><title>Journal of Machine Learning Research</title></titleInfo>
  <identifier type="issn">1532-4435</identifier>
  <identifier type="eIssn">1533-7928</identifier>
  <identifier type="arXiv">2102.00554</identifier>
<part><detail type="volume"><number>22</number></detail><detail type="issue"><number>241</number></detail><extent unit="pages">1-124</extent>
</part>
</relatedItem>


<extension>
<bibliographicCitation>
<mla>Hoefler, Torsten, et al. “Sparsity in Deep Learning: Pruning and Growth for Efficient Inference and Training in Neural Networks.” &lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;, vol. 22, no. 241, ML Research Press, 2021, pp. 1–124.</mla>
<ama>Hoefler T, Alistarh D-A, Ben-Nun T, Dryden N, Krumes A. Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks. &lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;. 2021;22(241):1-124.</ama>
<short>T. Hoefler, D.-A. Alistarh, T. Ben-Nun, N. Dryden, A. Krumes, Journal of Machine Learning Research 22 (2021) 1–124.</short>
<apa>Hoefler, T., Alistarh, D.-A., Ben-Nun, T., Dryden, N., &amp;#38; Krumes, A. (2021). Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks. &lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;. ML Research Press.</apa>
<ista>Hoefler T, Alistarh D-A, Ben-Nun T, Dryden N, Krumes A. 2021. Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks. Journal of Machine Learning Research. 22(241), 1–124.</ista>
<ieee>T. Hoefler, D.-A. Alistarh, T. Ben-Nun, N. Dryden, and A. Krumes, “Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks,” &lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;, vol. 22, no. 241. ML Research Press, pp. 1–124, 2021.</ieee>
<chicago>Hoefler, Torsten, Dan-Adrian Alistarh, Tal Ben-Nun, Nikoli Dryden, and Alexandra Krumes. “Sparsity in Deep Learning: Pruning and Growth for Efficient Inference and Training in Neural Networks.” &lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;. ML Research Press, 2021.</chicago>
</bibliographicCitation>
</extension>
<recordInfo><recordIdentifier>10180</recordIdentifier><recordCreationDate encoding="w3cdtf">2021-10-24T22:01:34Z</recordCreationDate><recordChangeDate encoding="w3cdtf">2025-06-26T11:53:12Z</recordChangeDate>
</recordInfo>
</mods>
</modsCollection>
