1.1 KiB

Raw Permalink Blame History

Optimum

The Optimum library supports quantization for Intel, Furiosa, ONNX Runtime, GPTQ, and lower-level PyTorch quantization functions. Consider using Optimum for quantization if you're using specific and optimized hardware like Intel CPUs, Furiosa NPUs or a model accelerator like ONNX Runtime.

1.1 KiB Raw Permalink Blame History

Optimum

1.1 KiB

Raw Permalink Blame History