Commit Graph

15369 Commits

Author SHA1 Message Date
ydshieh 6ad1e309b5 update 2024-03-20 15:47:57 +01:00
ydshieh 0670a0110b update 2024-03-20 15:03:41 +01:00
ydshieh 185435c607 update 2024-03-20 11:47:28 +01:00
ydshieh 5f98ee7b46 update 2024-03-20 11:22:22 +01:00
ydshieh ff3b978e5a update 2024-03-20 10:38:13 +01:00
ydshieh 472ce78baf update 2024-03-18 10:48:09 +01:00
ydshieh 2a04bd3386 update 2024-03-18 10:27:09 +01:00
ydshieh 93d6ccaae0 update 2024-03-18 10:26:27 +01:00
ydshieh f25b457534 update 2024-03-18 10:18:11 +01:00
ydshieh a5070e90af update 2024-03-18 10:17:29 +01:00
ydshieh 751c4a4f86 update 2024-03-18 10:17:02 +01:00
ydshieh ec2a34a5af update 2024-03-18 10:16:23 +01:00
ydshieh 35ff6456c2 update 2024-03-18 10:11:20 +01:00
ydshieh cd2db2fdec update 2024-03-15 18:42:58 +01:00
ydshieh bb5da0421d update 2024-03-15 18:26:19 +01:00
ydshieh 01f89c9256 update 2024-03-15 17:07:50 +01:00
ydshieh 65224b5357 update 2024-03-15 16:54:38 +01:00
Yih-Dar 37e178847a
Update src/transformers/benchmark/benchmark_utils_generic.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-03-13 19:38:48 +01:00
Yih-Dar 47837ddd28
Update src/transformers/benchmark/from_pretrained_benchmark.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-03-13 19:38:36 +01:00
Yih-Dar e825a4169e
Update src/transformers/benchmark/from_pretrained_benchmark.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-03-13 19:38:12 +01:00
ydshieh d556c587fa update 2024-03-13 19:23:09 +01:00
ydshieh f9a86a78f1 update 2024-03-13 14:38:59 +01:00
ydshieh 324ef81649 update 2024-03-13 14:38:59 +01:00
ydshieh 2eb4a9d45b save 2024-03-13 14:38:59 +01:00
ydshieh d7dfdf7281 save 2024-03-13 14:38:59 +01:00
ydshieh 335d241be1 save 2024-03-13 14:38:59 +01:00
ydshieh 101b639651 style 2024-03-13 14:38:59 +01:00
ydshieh b00a094787 fix 2024-03-13 14:38:59 +01:00
ydshieh c27bdbe8e3 rename classes 2024-03-13 14:38:59 +01:00
ydshieh 5a7bf5c937 rename 2024-03-13 14:38:59 +01:00
ydshieh b4fe856387 update 2024-03-13 14:38:58 +01:00
ydshieh 4c4010b136 update 2024-03-13 14:38:58 +01:00
ydshieh dc47f69d3f update 2024-03-13 14:38:58 +01:00
ydshieh 28aedd00b7 benchmark 2024-03-13 14:38:58 +01:00
Younes Belkada 9acce7de1c
Core: Fix copies on main (#29624)
fix fix copies
2024-03-13 09:16:59 +01:00
bytebarde be3fd8a262
[Flash Attention 2] Add flash attention 2 for GPT-J (#28295)
* initial implementation of flash attention for gptj

* modify flash attention and overwrite test_flash_attn_2_generate_padding_right

* update flash attention support list

* remove the copy line in the `CodeGenBlock`

* address copy mechanism

* Update src/transformers/models/gptj/modeling_gptj.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add GPTJ attention classes

* add expected outputs in the gptj test

* Ensure repo consistency with 'make fix-copies'

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-03-13 08:43:00 +01:00
Younes Belkada d522afea13
[`Gemma`] Supports converting directly in half-precision (#29529)
* Update convert_gemma_weights_to_hf.py

* Update src/transformers/models/gemma/convert_gemma_weights_to_hf.py

* fixup
2024-03-12 22:44:49 +01:00
Joao Gante d47966536c
Examples: check `max_position_embeddings` in the translation example (#29600)
check max_position_embeddings
2024-03-12 18:58:12 +00:00
Bharat Ramanathan 6b660d5ed5
Fix: handle logging of scalars in Weights & Biases summary (#29612)
fix: handle logging of scalars in wandb summary

fixes:  #29430
2024-03-12 18:26:09 +00:00
Raushan Turganbay 8e64ba2890
Add tests for batching support (#29297)
* add tests for batching support

* Update src/transformers/models/fastspeech2_conformer/modeling_fastspeech2_conformer.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/fastspeech2_conformer/modeling_fastspeech2_conformer.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/test_modeling_common.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/test_modeling_common.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/test_modeling_common.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* fixes and comments

* use cosine distance for conv models

* skip mra model testing

* Update tests/models/vilt/test_modeling_vilt.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* finzalize  and make style

* check model type by input names

* Update tests/models/vilt/test_modeling_vilt.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fixed batch size for all testers

* Revert "fixed batch size for all testers"

This reverts commit 525f3a0a05.

* add batch_size for all testers

* dict from model output

* do not skip layoutlm

* bring back some code from git revert

* Update tests/test_modeling_common.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/test_modeling_common.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* clean-up

* where did minus go in tolerance

* make whisper happy

* deal with consequences of losing minus

* deal with consequences of losing minus

* maskformer needs its own test for happiness

* fix more models

* tag flaky CV models from Amy's approval

* make codestyle

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-03-12 17:46:19 +00:00
Furkan Akkurt 11163fff58
Fix typo ; Update quantization.md (#29615)
Update quantization.md
2024-03-12 16:32:50 +00:00
Yih-Dar a15bd3af4e
Update flava tests (#29611)
* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-03-12 17:04:53 +01:00
Matt df1542581e
Set env var to hold Keras at Keras 2 (#29598)
* Set env var to hold Keras at Keras 2

* Add Amy's update

* make fixup

* Use a warning instead
2024-03-12 13:49:57 +00:00
Hilco van der Wilk b6404866cd
Update legacy Repository usage in various example files (#29085)
* Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py`

Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs

* Fix import order

* Replace all example usage of deprecated Repository

* Fix remaining repo call and rename args variable

* Revert removing creation of gitignore files and don't change research examples
2024-03-12 13:20:49 +00:00
tomigee f1a565a39f
Implemented add_pooling_layer arg to TFBertModel (#29603)
Implemented add_pooling_layer argument
2024-03-12 13:01:55 +00:00
Kola 50ec493363
Fix typo (determine) (#29606)
* Fix type (determine)

* ruff

* Update src/transformers/models/mamba/configuration_mamba.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-03-12 12:56:51 +00:00
Matt 81ec8028f9
Stop passing None to compile() in TF examples (#29597)
* Fix examples to stop passing None to compile(), rework example invocation for run_text_classification.py

* Add Amy's fix
2024-03-12 12:22:29 +00:00
Dries Verachtert 73efe896df
Fix minor typo: softare => software (#29602) 2024-03-12 10:39:56 +00:00
Raushan Turganbay 6cc5411d81
Fix Fuyu doc typos (#29601)
fix fuyu docs
2024-03-12 10:16:21 +00:00
Pedro Cuenca b382a09e28
Experimental loading of MLX files (#29511)
* Experimental loading of MLX files

* Update exception message

* Add test

* Style

* Use model from hf-internal-testing
2024-03-11 18:42:06 +00:00