TY  - CONF
AB  - We introduce Clover, a new library for efficient computation using low-precision data, providing mathematical routines required by fundamental methods in optimization and sparse recovery. Our library faithfully implements variants of stochastic quantization that guarantee convergence at low precision, and supports data formats from 4-bit quantized to 32-bit IEEE-754 on current Intel processors. In particular, we show that 4-bit can be implemented efficiently using Intel AVX despite the lack of native support for this data format. Experimental results with dot product, matrix-vector multiplication (MVM), gradient descent (GD), and iterative hard thresholding (IHT) demonstrate that the attainable speedups are in many cases close to linear with respect to the reduction of precision due to reduced data movement. Finally, for GD and IHT, we show examples of absolute speedup achieved by 4-bit versus 32-bit, by iterating until a given target error is achieved.
AU  - Stojanov, Alen
AU  - Smith, Tyler Michael
AU  - Alistarh, Dan-Adrian
AU  - Puschel, Markus
ID  - 6031
T2  - 2018 IEEE International Workshop on Signal Processing Systems
TI  - Fast quantized arithmetic on x86: Trading compute for data movement
VL  - 2018-October
ER  -