transformers/docs/source/en/internal
Raushan Turganbay d583f1317b
Quantized KV Cache (#30483)
* clean-up

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* Update tests/quantization/quanto_integration/test_quanto.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/generation/configuration_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* more suggestions

* mapping if torch available

* run tests & add 'support_quantized' flag

* fix jamba test

* revert, will be fixed by another PR

* codestyle

* HQQ and versatile cache classes

* final update

* typo

* make tests happy

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2024-05-23 17:25:20 +05:00
..
audio_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
file_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
generation_utils.md Quantized KV Cache (#30483) 2024-05-23 17:25:20 +05:00
image_processing_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
modeling_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
pipelines_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
time_series_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
tokenization_utils.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
trainer_utils.md translate internal folder files to chinese (#27638) 2023-12-04 10:04:28 -08:00