Commit Graph

16041 Commits

Author SHA1 Message Date
ydshieh 48681e6e5e build 2024-05-31 17:21:06 +02:00
ydshieh a51a3fbc48 build 2024-05-31 17:12:01 +02:00
ydshieh 4da667edfd build 2024-05-31 15:35:19 +02:00
ydshieh 65bce8f0f9 build 2024-05-31 14:25:07 +02:00
ydshieh 4ef4539d89 build 2024-05-31 14:23:33 +02:00
ydshieh 0918d509b9 build 2024-05-31 14:18:01 +02:00
ydshieh e3ed8d91c8 build 2024-05-31 14:01:49 +02:00
ydshieh 1c76378693 build 2024-05-31 13:55:26 +02:00
ydshieh 01f8532d51 build 2024-05-31 13:23:18 +02:00
ydshieh 57bf79956f build 2024-05-31 12:38:55 +02:00
ydshieh 231aed279f build 2024-05-31 12:25:56 +02:00
ydshieh 100e0dc336 build 2024-05-30 16:59:59 +02:00
ydshieh 8c2a653809 build 2024-05-30 16:46:19 +02:00
ydshieh ef197e6197 benchmark workflow 2024-05-30 16:40:09 +02:00
ydshieh edfc006d93 benchmark workflow 2024-05-30 16:29:58 +02:00
ydshieh 55e826eb02 benchmark workflow 2024-05-30 16:22:59 +02:00
ydshieh 058cecf0a2 benchmark workflow 2024-05-30 16:12:14 +02:00
Younes Belkada 5e5c4d629d
FIX / Quantization: Add extra validation for bnb config (#31135)
add validation for bnb config
2024-05-30 11:45:03 +02:00
Yih-Dar 2b9e252b16
Cleanup docker build (#31119)
* remove

* build

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-05-29 19:43:51 +02:00
Dhruv Pai 5c88253556
Add on_optimizer_step to callback options (#31095)
* Modified test

* Added on_optimizer_step to callbacks

* Move callback after step is called

* Added on optimizer step callback
2024-05-29 16:20:59 +02:00
Joao Gante 4af705c6ce
Add VLM generation default contributor (#31115)
* add Raushan

* add Raushan
2024-05-29 15:17:14 +01:00
Younes Belkada cb879c5801
FIX / Docs: Fix GPTQ expected number of bits (#31111)
Update overview.md
2024-05-29 15:56:28 +02:00
Yih-Dar 1f84141391
Fix nightly circleci (#31114)
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-05-29 15:42:39 +02:00
Zach Mueller d16053c867
Rm maintainer + migrate (#31089) 2024-05-29 09:35:37 -04:00
Matt 0bef4a2738
Fix faulty rstrip in module loading (#31108) 2024-05-29 13:33:26 +01:00
Matt 97a58a5d2c
Fix env.py in cases where torch is not present (#31113)
* Fix env.py in cases where torch is not present

* Simplify the fix (and avoid some issues)
2024-05-29 13:20:36 +01:00
Huazhong Ji c8861376ad
Improve `transformers-cli env` reporting (#31003)
* Improve `transformers-cli env` reporting

* move the line `"Using GPU in script?": "<fill in>"` to in if conditional
statement

* same option for npu
2024-05-29 11:57:54 +01:00
Lucain c3044ec2f3
Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016)
* Fix has_file in offline mode

* harmonize env variable for offline mode

* Switch to HF_HUB_OFFLINE

* fix test

* revert test_offline to test TRANSFORMERS_OFFLINE

* Add new offline test

* merge conflicts

* docs
2024-05-29 11:55:43 +01:00
Younes Belkada bfe6f513b9
FEAT: Add mistral v3 conversion script (#30981)
* add mistral v3 conversion script

* Update src/transformers/models/mistral/convert_mistral_weights_to_hf.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-05-29 11:43:54 +02:00
Raushan Turganbay d521ba5797
Quantized KV cache: update quanto (#31052)
* quanto latest version was refactored

* add error msg

* incorrect compare sign

* Update src/transformers/cache_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-05-29 14:25:44 +05:00
amyeroberts a564d10afe
Deprecate low use models (#30781)
* Deprecate models
- graphormer
- time_series_transformer
- xlm_prophetnet
- qdqbert
- nat
- ernie_m
- tvlt
- nezha
- mega
- jukebox
- vit_hybrid
- x_clip
- deta
- speech_to_text_2
- efficientformer
- realm
- gptsan_japanese

* Fix up

* Fix speech2text2 imports

* Make sure message isn't indented

* Fix docstrings

* Correctly map for deprecated models from model_type

* Uncomment out

* Add back time series transformer and x-clip

* Import fix and fix-up

* Fix up with updated ruff
2024-05-28 18:07:07 +01:00
Younes Belkada 7f08817be4
Docs / Quantization: Redirect deleted page (#31063)
Update _redirects.yml
2024-05-28 18:29:22 +02:00
Younes Belkada 3264be4114
TST: Fix instruct-blip tests (#31088)
* fix flan t5 tests

* better format
2024-05-28 18:29:11 +02:00
Jonny Li 476890e9ae
Fix DeepSpeed compatibility with weight_norm (#30881) (#31018) 2024-05-28 17:25:15 +01:00
Albert Villanova del Moral aada568f73
Fix PretrainedConfig docstring with deprecated resume_download (#31014) 2024-05-28 17:47:35 +02:00
Yih-Dar 3af7bf30ad
skip `test_multi_gpu_data_parallel_forward` for `vit` and `deit` (#31086)
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-05-28 17:44:52 +02:00
Younes Belkada ab19f907fd
FIX / OPT: Fix OPT multi-GPU training for `OPTForQuestionAnswering` (#31092)
Update modeling_opt.py
2024-05-28 17:06:00 +02:00
Younes Belkada 94d416f018
FIX: Add `accelerate` as a hard requirement (#31090)
add accelerate
2024-05-28 17:05:44 +02:00
Sigbjørn Skjæret 22dab246c5
Render chat template tojson filter as unicode (#31041)
* Render chat template tojson filter as unicode

* ruff--
2024-05-28 15:02:51 +01:00
Younes Belkada 4f98b14465
Docs / PEFT: Add PEFT API documentation (#31078)
* add peft references

* add peft references

* Update docs/source/en/peft.md

* Update docs/source/en/peft.md
2024-05-28 15:04:43 +02:00
Raushan Turganbay 779bc360ff
Watermark: fix tests (#30961)
* fix tests

* style

* Update tests/generation/test_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-05-28 17:07:42 +05:00
Lysandre Debut a3c7b59e31
Fix failing tokenizer tests (#31083)
* Fix failing tokenizer tests

* Use small tokenizer

* Fix remaining reference
2024-05-28 13:34:23 +02:00
NielsRogge 90da0b1c9f
[SuperPoint, PaliGemma] Update docs (#31025)
* Update docs

* Add PaliGemma resources

* Address comment

* Update docs
2024-05-28 13:22:06 +02:00
Sina Taslimi 66add161dc
Fix typo in trainer.py (#31048) 2024-05-28 12:09:32 +01:00
Pavel Iakubovskii 98e2d48e9a
Fix OWLv2 post_process_object_detection for multiple images (#31082)
* Add test for multiple images

* [run slow] owlv2

* Fix box rescaling

* [run slow] owlv2
2024-05-28 12:06:06 +01:00
Pavel Iakubovskii c31473ed44
Remove float64 cast for OwlVit and OwlV2 to support MPS device (#31071)
Remove float64
2024-05-28 11:41:40 +01:00
oOraph 936ab7bae5
fix from_pretrained in offline mode when model is preloaded in cache (#31010)
* Unit test to verify fix

Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* fix from_pretrained in offline mode when model is preloaded in cache

Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* minor: fmt

Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

---------

Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>
Co-authored-by: Raphael Glon <oOraph@users.noreply.github.com>
2024-05-28 11:56:05 +02:00
Hengwen Tong 537deb7869
Remove redundant backend checks in training_args.py (#30999)
* Remove backend checks in training_args.py

* Expilicit initialize the device

---------

Co-authored-by: tonghengwen <tonghengwen@cambricon.com>
2024-05-28 11:52:47 +02:00
AP dd4654eab7
Update quicktour.md to fix broken link to Glossary (#31072)
Update quicktour.md to fix broken link

Missing '/' in attention mask link in the transformers quicktour
2024-05-28 11:50:45 +02:00
Clint Adams e18da4e3f2
fix "piano" typo (#31027) 2024-05-28 11:48:23 +02:00