transformers/tests/quantization
Andrei Panferov e3fc90ae68
Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility (#29079)
* input_layernorm as the beacon of hope

* cleaner dtype extraction

* AQLM + CUDA graph test

* is available check

* shorter text test
2024-02-27 09:32:39 +01:00
..
aqlm_integration Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility (#29079) 2024-02-27 09:32:39 +01:00
autoawq `HfQuantizer` class for quantization-related stuff in `modeling_utils.py` (#26610) 2024-01-30 02:48:25 +01:00
bnb FIX [`bnb` / `tests`] Propagate the changes from #29092 to 4-bit tests (#29122) 2024-02-20 11:11:15 +01:00
gptq [GPTQ] Fix test (#28018) 2024-01-15 11:22:54 -05:00