transformers

Commit Graph

Author	SHA1	Message	Date
Yih-Dar	5fd5ef7624	Fix docker file (#28452 ) fix docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-01-11 15:34:05 +01:00
Joao Gante	ee2482b6f8	CI: limit natten version (#28432 )	2024-01-10 12:39:05 +00:00
Patrick von Platen	8604dd308d	[SDPA] Make sure attn mask creation is always done on CPU (#28400 ) * [SDPA] Make sure attn mask creation is always done on CPU * Update docker to 2.1.1 * revert test change	2024-01-09 11:05:19 +01:00
Younes Belkada	fa21ead73d	[`Awq`] Enable the possibility to skip quantization for some target modules (#27950 ) * v1 * add docstring * add tests * add awq 0.1.8 * oops * fix test	2023-12-25 11:06:56 +01:00
Younes Belkada	fdb85be40f	Faster generation using AWQ + Fused modules (#27411 ) * v1 fusing modules * add fused mlp support * up * fix CI * block save_pretrained * fixup * small fix * add new condition * add v1 docs * add some comments * style * fix nit * adapt from suggestion * add check * change arg names * change variables name * Update src/transformers/integrations/awq.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * style * split up into 3 different private methods * more conditions * more checks * add fused tests for custom models * fix * fix tests * final update docs * final fixes * fix importlib metadata * Update src/transformers/utils/quantization_config.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * change it to `do_fuse` * nit * Update src/transformers/utils/quantization_config.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/utils/quantization_config.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/utils/quantization_config.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * few fixes * revert * fix test * fix copies * raise error if model is not quantized * add test * use quantization_config.config when fusing * Update src/transformers/modeling_utils.py --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2023-12-05 12:14:45 +01:00
Yih-Dar	3b59621310	Install `python-Levenshtein` for `nougat` in CI image (#27465 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-11-13 16:38:13 +01:00
Younes Belkada	26d8d5f211	Fix autoawq docker image (#27339 ) * Update Dockerfile * Update docker/transformers-all-latest-gpu/Dockerfile	2023-11-07 11:21:04 +01:00
Yih-Dar	d788d37d24	Fix daily CI image build (#27307 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-11-06 11:27:22 +01:00
Younes Belkada	ae093eef01	[`core` / `Quantization` ] AWQ integration (#27045 ) * working v1 * oops * Update src/transformers/modeling_utils.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * fixup * oops * push * more changes * add docs * some fixes * fix copies * add v1 doc * added installation guide * relax constraints * revert * attempt llm-awq * oops * oops * fixup * raise error when incorrect cuda compute capability * nit * add instructions for llm-awq * fixup * fix copies * fixup and docs * change * few changes + add demo * add v1 tests * add autoawq in dockerfile * finalize * Update tests/quantization/autoawq/test_awq.py * fix test * fix * fix issue * Update src/transformers/integrations/awq.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update docs/source/en/main_classes/quantization.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update docs/source/en/main_classes/quantization.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/integrations/awq.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/integrations/awq.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add link to example script * Update docs/source/en/main_classes/quantization.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add more content * add more details * add link to quantization docs * camel case + change backend class name * change to string * fixup * raise errors if libs not installed * change to `bits` and `group_size` * nit * nit * Apply suggestions from code review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * disable training * address some comments and fix nits * fix * final nits and fix tests * adapt to our new runners * make fix-copies * Update src/transformers/utils/quantization_config.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/utils/quantization_config.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/integrations/awq.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/integrations/awq.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * move to top * add conversion test * final nit * add more elaborated test --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-11-01 09:06:31 +01:00
Yih-Dar	b219ae6bd4	Update docker files to use `torch==2.1.0` (#26735 ) Update docker files to use torch 2.1 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-10-11 16:23:36 +02:00
Younes Belkada	584eeb5387	[`AutoGPTQ`] Add correct installation of GPTQ library + fix slow tests (#25713 ) * add correct installation of GPTQ library * update tests values	2023-08-24 14:57:16 +02:00
Younes Belkada	faed2ca46f	[`PEFT`] Peft integration alternative design (#25077 ) * a draft version * v2 integration * fix * make it more generic and works for IA3 * add set adapter and multiple adapters support * fixup * adapt a bit * oops * oops * oops * adapt more * fix * add more refactor * now works with model class * change it to instance method as it causes issues with `jit`. * add CR * change method name * add `add_adapter` method * clean up * Update src/transformers/adapters/peft_mixin.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add moe utils * fixup * Update src/transformers/adapters/peft_mixin.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * adapt * oops * fixup * add is_peft_available * remove `requires_backend` * trainer compatibility * fixup + docstring * more details * trigger CI * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_utils.py * fixup + is_main_process * added `save_peft_format` in save_pretrained * up * fix nits here and there * nits here and there. * docs * revert `encoding="utf-8"` * comment * added slow tests before the PEFT release. * fixup and nits * let's be on the safe zone * added more comments * v1 docs * add remaining docs * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * move to `lib_integrations` * fixup * this time fixup * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address final comments * refactor to use `token` * add PEFT to DockerFile for slow tests. * added pipeline support. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-08-18 19:08:03 +02:00
Younes Belkada	d4c0aa1443	[`Tests`] Fix failing 8bit test (#25564 ) * fix failing 8bit test * trigger CI	2023-08-17 17:34:25 +02:00
Marc Sun	55db70c63d	GPTQ integration (#25062 ) * GTPQ integration * Add tests for gptq * support for more quantization model * fix style * typo * fix method * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add dataclass and fix quantization_method * fix doc * Update tests/quantization/gptq/test_gptq.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * modify dataclass * add gtpqconfig import * fix typo * fix tests * remove dataset as req arg * remove tokenizer import * add offload cpu quantization test * fix check dataset * modify dockerfile * protect trainer * style * test for config * add more log * overwrite torch_dtype * draft doc * modify quantization_config docstring * fix class name in docstring * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * more warning * fix 8bit kwargs tests * peft compatibility * remove var * fix is_gptq_quantized * remove is_gptq_quantized * fix wrap * Update src/transformers/modeling_utils.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * add exllama * skip test * overwrite float16 * style * fix skip test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix docsting formatting * add doc * better test --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2023-08-10 16:06:29 -04:00
Yih-Dar	b0f23036f1	Update TF pin in docker image (#25343 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-07 12:32:34 +02:00
Yih-Dar	0fd8d2aa2c	Fix docker image build failure (#25214 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-31 20:13:15 +02:00
Yih-Dar	906afa1d5c	Revert "Unpin protobuf in docker file (for daily CI)" (#24800 ) Revert "Unpin protobuf in docker file (for daily CI) (#24761)" This reverts commit `45025d92f8`.	2023-07-13 04:19:45 +02:00
Yih-Dar	45025d92f8	Unpin protobuf in docker file (for daily CI) (#24761 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-11 23:55:55 +02:00
Yih-Dar	22a0769933	Update 3 docker files to use cu118 (#23406 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-17 14:26:50 +02:00
Yih-Dar	cf11493dce	Use cu118 with cudnn >= 8.6 in docker file (#23339 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 21:58:15 +02:00
Yih-Dar	8c8744a94a	Fix docker image (caused by `tensorflow_text`) (#23321 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 13:37:37 +02:00
Yih-Dar	ba71d9e94c	unpin tf prob (#23293 ) * unpin tf prob --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-11 21:28:08 +02:00
Yih-Dar	5f26a23d03	pin `tensorflow-probability` in docker files (#23260 ) * pong TF prob * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-10 16:21:09 +02:00
fxmarty	3042c63a95	Add methods to PreTrainedModel to use PyTorch's BetterTransformer (#21259 ) * fix mess * better documentation * typo * fix doc * update * add test * fix test * more tests * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * move to utils * Apply suggestions from code review Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * nit --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>	2023-04-27 11:03:42 +02:00
Yih-Dar	01203475c9	Update docker files to use official torch 2.0.0 (#22357 ) * update docker files to use official torch 2.0.0 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-24 14:29:05 +01:00
Yih-Dar	bec075612a	Revert "Use `dash==2.8.1` for now for daily CI" (#22233 ) Revert "Use `dash==2.8.1` for now for daily CI (#22227)" This reverts commit `53218671d9`.	2023-03-17 16:54:27 +01:00
Yih-Dar	53218671d9	Use `dash==2.8.1` for now for daily CI (#22227 ) Use dash 2.8.1 for now Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-17 13:27:14 +01:00
Yih-Dar	ba9e0191de	Prepare daily CI for torch 2.0.0 (#22135 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-13 22:21:15 +01:00
amyeroberts	3412f5979d	Use PyAV instead of Decord in examples (#21572 ) * Use PyAV instead of Decord * Get frame indices * Fix number of frames * Update src/transformers/models/videomae/image_processing_videomae.py * Fix up * Fix copies * Update timesformer doctests * Update docstrings	2023-03-02 12:30:38 +00:00
Yih-Dar	db572b3854	Use torch `1.13.1` in push/schedule CI (#21421 ) Use torch 1.13.1 in push/scheduled CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-02 14:58:52 +01:00
Yih-Dar	d4bf9ee1ff	Update CI to torch 1.13.0 (#20687 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-12 20:04:56 +01:00
Yih-Dar	147fa37fb1	pin TF 2.11 in docker files (#20642 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-07 15:46:48 +01:00
Yih-Dar	f68796bd60	Fix `natten` installation in docker file (#20632 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-06 22:23:06 +01:00
Yih-Dar	8639cfb4c2	Install `natten` with CUDA version (#20546 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-05 15:08:32 +01:00
Yih-Dar	dd6fb1319b	Add `natten` for CI (#20511 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-30 19:49:34 +01:00
Yih-Dar	f10cdba22e	Pin TF 2.10.1 for Push CI (#20319 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-18 18:24:35 +01:00
Bartosz Szmelczynski	78a471ff71	Fix tapas scatter (#20149 ) * First draft * Remove scatter dependency * Add require_torch * update vectorized sum test, add clone call * remove artifacts * fix style * fix style v2 * remove "scatter" mentions from the code base * fix isort error Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-14 01:04:26 -05:00
raghavanone	7829c890db	Change the import of kenlm from github to pypi (#19770 ) * Change the import of kenlm from github to pypi * Change the import of kenlm from github to pypi in circleci config * Fix code quality issues * Fix isort issue, add kenlm in extras for audio * Add kenlm to deps * Add kenlm to deps * Commit 'make fixup' changes * Remove version from kenlm deps * commit make fixup changes * Remove manual installation of kenlm * Remove manual installation of kenlm * Remove manual installation of kenlm	2022-10-26 17:06:46 +02:00
Yih-Dar	15fd39ea0e	Install tf2onnx dev version (#19755 ) * pin tf2onnx<=1.12.0 * Install tf2onnx main * Pin to a specific commit Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-20 20:24:39 +02:00
Yih-Dar	d7dc774a79	Fix `TFGroupViT` CI (#19461 ) * Fix TFGroupViT CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-11 14:29:15 +02:00
Joao Gante	1182b945a6	TF: TF 2.10 unpin + related onnx test skips (#18995 )	2022-09-12 19:30:27 +01:00
Sylvain Gugger	a26114777e	Revert "TF: unpin maximum TF version (#18917 )" (#18972 ) This reverts commit `d8cf3b2087`.	2022-09-10 09:11:46 -04:00
Joao Gante	d8cf3b2087	TF: unpin maximum TF version (#18917 )	2022-09-10 13:33:01 +01:00
Yih-Dar	6690ba3f4d	pin TF 2.9.1 for self-hosted CIs (#18925 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-07 19:46:14 +02:00
Yih-Dar	84beb8a49b	Unpin detectron2 (#18727 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-23 11:10:07 +02:00
Yih-Dar	30992ef0d9	[Hotfix] pin detectron2 5aeb252 to avoid test fix (#18701 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-20 00:37:38 +02:00
Younes Belkada	6d175c1129	[bnb] Minor modifications (#18631 ) * bnb minor modifications - refactor documentation - add troubleshooting README - add PyPi library on DockerFile * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * put in one block - put bash instructions in one block * update readme - refactor a bit hardware requirements * change text a bit * Apply suggestions from code review Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * apply suggestions Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * add link to paper * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update tests/mixed_int8/README.md * Apply suggestions from code review * refactor a bit * add instructions Turing & Amperer Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * add A6000 * clarify a bit * remove small part * Update tests/mixed_int8/README.md Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2022-08-17 00:48:10 +02:00
Yih-Dar	510c2a0b32	Change scheduled CIs to use torch 1.12.1 (#18644 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-16 13:41:37 +02:00
Younes Belkada	4a51075a96	`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901 ) * first commit * correct replace function * add final changes - works like charm! - cannot implement tests yet - tested * clean up a bit * add bitsandbytes dependencies * working version - added import function - added bitsandbytes utils file * small fix * small fix - fix import issue * fix import issues * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit - move bitsandbytes utils to utils - change comments on functions * reformat docstring - reformat docstring on init_empty_weights_8bit * Update src/transformers/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * revert bad formatting * change to bitsandbytes * refactor a bit - remove init8bit since it is useless * more refactoring - fixed init empty weights issue - added threshold param * small hack to make it work * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * revmoe the small hack * modify utils file * make style + refactor a bit * create correctly device map * add correct dtype for device map creation * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions - remove with torch.grad - do not rely on Python bool magic! * add docstring - add docstring for new kwargs * add docstring - comment `replace_8bit_linear` function - fix weird formatting * - added more documentation - added new utility function for memory footprint tracking - colab demo to add * few modifs - typo doc - force cast into float16 when load_in_8bit is enabled * added colab link * add test architecture + docstring a bit * refactor a bit testing class * make style + refactor a bit * enhance checks - add more checks - start writing saving test * clean up a bit * male style * add more details on doc * add more tests - still needs to fix 2 tests * replace by "or" - could not fix it from GitHub GUI Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit testing code + add readme * make style * fix import issue * Update src/transformers/modeling_utils.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * add few comments * add more doctring + make style * more docstring * raise error when loaded in 8bit * make style * add warning if loaded on CPU * add small sanity check * fix small comment * add bitsandbytes on dockerfile * Improve documentation - improve documentation from comments * add few comments * slow tests pass on the VM but not on the CI VM * Fix merge conflict * make style * another test should pass on a multi gpu setup * fix bad import in testing file * Fix slow tests - remove dummy batches - no more CUDA illegal memory errors * odify dockerfile * Update docs/source/en/main_classes/model.mdx * Update Dockerfile * Update model.mdx * Update Dockerfile * Apply suggestions from code review * few modifications - lm head can stay on disk/cpu - change model name so that test pass * change test value - change test value to the correct output - torch bmm changed to baddmm in bloom modeling when merging * modify installation guidelines * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace `n`by `name` * merge `load_in_8bit` and `low_cpu_mem_usage` * first try - keep the lm head in full precision * better check - check the attribute `base_model_prefix` instead of computing the number of parameters * added more tests * Update src/transformers/utils/bitsandbytes.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Merge branch 'integration-8bit' of https://github.com/younesbelkada/transformers into integration-8bit * improve documentation - fix typos for installation - change title in the documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>	2022-08-10 09:13:36 +02:00
NielsRogge	82bb682643	[VideoMAE] Add model to doc tests (#18523 ) * Add videomae to doc tests * Add pip install decord Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-08 19:28:51 +02:00

1 2

60 Commits