transformers

Commit Graph

Author	SHA1	Message	Date
ydshieh	6ad1e309b5	update	2024-03-20 15:47:57 +01:00
ydshieh	0670a0110b	update	2024-03-20 15:03:41 +01:00
ydshieh	185435c607	update	2024-03-20 11:47:28 +01:00
ydshieh	5f98ee7b46	update	2024-03-20 11:22:22 +01:00
ydshieh	ff3b978e5a	update	2024-03-20 10:38:13 +01:00
ydshieh	472ce78baf	update	2024-03-18 10:48:09 +01:00
ydshieh	2a04bd3386	update	2024-03-18 10:27:09 +01:00
ydshieh	93d6ccaae0	update	2024-03-18 10:26:27 +01:00
ydshieh	f25b457534	update	2024-03-18 10:18:11 +01:00
ydshieh	a5070e90af	update	2024-03-18 10:17:29 +01:00
ydshieh	751c4a4f86	update	2024-03-18 10:17:02 +01:00
ydshieh	ec2a34a5af	update	2024-03-18 10:16:23 +01:00
ydshieh	35ff6456c2	update	2024-03-18 10:11:20 +01:00
ydshieh	cd2db2fdec	update	2024-03-15 18:42:58 +01:00
ydshieh	bb5da0421d	update	2024-03-15 18:26:19 +01:00
ydshieh	01f89c9256	update	2024-03-15 17:07:50 +01:00
ydshieh	65224b5357	update	2024-03-15 16:54:38 +01:00
Yih-Dar	37e178847a	Update src/transformers/benchmark/benchmark_utils_generic.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-03-13 19:38:48 +01:00
Yih-Dar	47837ddd28	Update src/transformers/benchmark/from_pretrained_benchmark.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-03-13 19:38:36 +01:00
Yih-Dar	e825a4169e	Update src/transformers/benchmark/from_pretrained_benchmark.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-03-13 19:38:12 +01:00
ydshieh	d556c587fa	update	2024-03-13 19:23:09 +01:00
ydshieh	f9a86a78f1	update	2024-03-13 14:38:59 +01:00
ydshieh	324ef81649	update	2024-03-13 14:38:59 +01:00
ydshieh	2eb4a9d45b	save	2024-03-13 14:38:59 +01:00
ydshieh	d7dfdf7281	save	2024-03-13 14:38:59 +01:00
ydshieh	335d241be1	save	2024-03-13 14:38:59 +01:00
ydshieh	101b639651	style	2024-03-13 14:38:59 +01:00
ydshieh	b00a094787	fix	2024-03-13 14:38:59 +01:00
ydshieh	c27bdbe8e3	rename classes	2024-03-13 14:38:59 +01:00
ydshieh	5a7bf5c937	rename	2024-03-13 14:38:59 +01:00
ydshieh	b4fe856387	update	2024-03-13 14:38:58 +01:00
ydshieh	4c4010b136	update	2024-03-13 14:38:58 +01:00
ydshieh	dc47f69d3f	update	2024-03-13 14:38:58 +01:00
ydshieh	28aedd00b7	benchmark	2024-03-13 14:38:58 +01:00
Younes Belkada	9acce7de1c	Core: Fix copies on main (#29624 ) fix fix copies	2024-03-13 09:16:59 +01:00
bytebarde	be3fd8a262	[Flash Attention 2] Add flash attention 2 for GPT-J (#28295 ) * initial implementation of flash attention for gptj * modify flash attention and overwrite test_flash_attn_2_generate_padding_right * update flash attention support list * remove the copy line in the `CodeGenBlock` * address copy mechanism * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add GPTJ attention classes * add expected outputs in the gptj test * Ensure repo consistency with 'make fix-copies' --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-03-13 08:43:00 +01:00
Younes Belkada	d522afea13	[`Gemma`] Supports converting directly in half-precision (#29529 ) * Update convert_gemma_weights_to_hf.py * Update src/transformers/models/gemma/convert_gemma_weights_to_hf.py * fixup	2024-03-12 22:44:49 +01:00
Joao Gante	d47966536c	Examples: check `max_position_embeddings` in the translation example (#29600 ) check max_position_embeddings	2024-03-12 18:58:12 +00:00
Bharat Ramanathan	6b660d5ed5	Fix: handle logging of scalars in Weights & Biases summary (#29612 ) fix: handle logging of scalars in wandb summary fixes: #29430	2024-03-12 18:26:09 +00:00
Raushan Turganbay	8e64ba2890	Add tests for batching support (#29297 ) * add tests for batching support * Update src/transformers/models/fastspeech2_conformer/modeling_fastspeech2_conformer.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/fastspeech2_conformer/modeling_fastspeech2_conformer.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update tests/test_modeling_common.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update tests/test_modeling_common.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update tests/test_modeling_common.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * fixes and comments * use cosine distance for conv models * skip mra model testing * Update tests/models/vilt/test_modeling_vilt.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * finzalize and make style * check model type by input names * Update tests/models/vilt/test_modeling_vilt.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fixed batch size for all testers * Revert "fixed batch size for all testers" This reverts commit `525f3a0a05`. * add batch_size for all testers * dict from model output * do not skip layoutlm * bring back some code from git revert * Update tests/test_modeling_common.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/test_modeling_common.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * clean-up * where did minus go in tolerance * make whisper happy * deal with consequences of losing minus * deal with consequences of losing minus * maskformer needs its own test for happiness * fix more models * tag flaky CV models from Amy's approval * make codestyle --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-03-12 17:46:19 +00:00
Furkan Akkurt	11163fff58	Fix typo ; Update quantization.md (#29615 ) Update quantization.md	2024-03-12 16:32:50 +00:00
Yih-Dar	a15bd3af4e	Update flava tests (#29611 ) * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-03-12 17:04:53 +01:00
Matt	df1542581e	Set env var to hold Keras at Keras 2 (#29598 ) * Set env var to hold Keras at Keras 2 * Add Amy's update * make fixup * Use a warning instead	2024-03-12 13:49:57 +00:00
Hilco van der Wilk	b6404866cd	Update legacy Repository usage in various example files (#29085 ) * Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py` Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs * Fix import order * Replace all example usage of deprecated Repository * Fix remaining repo call and rename args variable * Revert removing creation of gitignore files and don't change research examples	2024-03-12 13:20:49 +00:00
tomigee	f1a565a39f	Implemented add_pooling_layer arg to TFBertModel (#29603 ) Implemented add_pooling_layer argument	2024-03-12 13:01:55 +00:00
Kola	50ec493363	Fix typo (determine) (#29606 ) * Fix type (determine) * ruff * Update src/transformers/models/mamba/configuration_mamba.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-03-12 12:56:51 +00:00
Matt	81ec8028f9	Stop passing None to compile() in TF examples (#29597 ) * Fix examples to stop passing None to compile(), rework example invocation for run_text_classification.py * Add Amy's fix	2024-03-12 12:22:29 +00:00
Dries Verachtert	73efe896df	Fix minor typo: softare => software (#29602 )	2024-03-12 10:39:56 +00:00
Raushan Turganbay	6cc5411d81	Fix Fuyu doc typos (#29601 ) fix fuyu docs	2024-03-12 10:16:21 +00:00
Pedro Cuenca	b382a09e28	Experimental loading of MLX files (#29511 ) * Experimental loading of MLX files * Update exception message * Add test * Style * Use model from hf-internal-testing	2024-03-11 18:42:06 +00:00

1 2 3 4 5 ...

15369 Commits All Branches Search

15369 Commits

All Branches