transformers

Commit Graph

Author	SHA1	Message	Date
ydshieh	48681e6e5e	build	2024-05-31 17:21:06 +02:00
ydshieh	a51a3fbc48	build	2024-05-31 17:12:01 +02:00
ydshieh	4da667edfd	build	2024-05-31 15:35:19 +02:00
ydshieh	65bce8f0f9	build	2024-05-31 14:25:07 +02:00
ydshieh	4ef4539d89	build	2024-05-31 14:23:33 +02:00
ydshieh	0918d509b9	build	2024-05-31 14:18:01 +02:00
ydshieh	e3ed8d91c8	build	2024-05-31 14:01:49 +02:00
ydshieh	1c76378693	build	2024-05-31 13:55:26 +02:00
ydshieh	01f8532d51	build	2024-05-31 13:23:18 +02:00
ydshieh	57bf79956f	build	2024-05-31 12:38:55 +02:00
ydshieh	231aed279f	build	2024-05-31 12:25:56 +02:00
ydshieh	100e0dc336	build	2024-05-30 16:59:59 +02:00
ydshieh	8c2a653809	build	2024-05-30 16:46:19 +02:00
ydshieh	ef197e6197	benchmark workflow	2024-05-30 16:40:09 +02:00
ydshieh	edfc006d93	benchmark workflow	2024-05-30 16:29:58 +02:00
ydshieh	55e826eb02	benchmark workflow	2024-05-30 16:22:59 +02:00
ydshieh	058cecf0a2	benchmark workflow	2024-05-30 16:12:14 +02:00
Younes Belkada	5e5c4d629d	FIX / Quantization: Add extra validation for bnb config (#31135 ) add validation for bnb config	2024-05-30 11:45:03 +02:00
Yih-Dar	2b9e252b16	Cleanup docker build (#31119 ) * remove * build --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-29 19:43:51 +02:00
Dhruv Pai	5c88253556	Add on_optimizer_step to callback options (#31095 ) * Modified test * Added on_optimizer_step to callbacks * Move callback after step is called * Added on optimizer step callback	2024-05-29 16:20:59 +02:00
Joao Gante	4af705c6ce	Add VLM generation default contributor (#31115 ) * add Raushan * add Raushan	2024-05-29 15:17:14 +01:00
Younes Belkada	cb879c5801	FIX / Docs: Fix GPTQ expected number of bits (#31111 ) Update overview.md	2024-05-29 15:56:28 +02:00
Yih-Dar	1f84141391	Fix nightly circleci (#31114 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-29 15:42:39 +02:00
Zach Mueller	d16053c867	Rm maintainer + migrate (#31089 )	2024-05-29 09:35:37 -04:00
Matt	0bef4a2738	Fix faulty rstrip in module loading (#31108 )	2024-05-29 13:33:26 +01:00
Matt	97a58a5d2c	Fix env.py in cases where torch is not present (#31113 ) * Fix env.py in cases where torch is not present * Simplify the fix (and avoid some issues)	2024-05-29 13:20:36 +01:00
Huazhong Ji	c8861376ad	Improve `transformers-cli env` reporting (#31003 ) * Improve `transformers-cli env` reporting * move the line `"Using GPU in script?": "<fill in>"` to in if conditional statement * same option for npu	2024-05-29 11:57:54 +01:00
Lucain	c3044ec2f3	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 ) * Fix has_file in offline mode * harmonize env variable for offline mode * Switch to HF_HUB_OFFLINE * fix test * revert test_offline to test TRANSFORMERS_OFFLINE * Add new offline test * merge conflicts * docs	2024-05-29 11:55:43 +01:00
Younes Belkada	bfe6f513b9	FEAT: Add mistral v3 conversion script (#30981 ) * add mistral v3 conversion script * Update src/transformers/models/mistral/convert_mistral_weights_to_hf.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-05-29 11:43:54 +02:00
Raushan Turganbay	d521ba5797	Quantized KV cache: update quanto (#31052 ) * quanto latest version was refactored * add error msg * incorrect compare sign * Update src/transformers/cache_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-29 14:25:44 +05:00
amyeroberts	a564d10afe	Deprecate low use models (#30781 ) * Deprecate models - graphormer - time_series_transformer - xlm_prophetnet - qdqbert - nat - ernie_m - tvlt - nezha - mega - jukebox - vit_hybrid - x_clip - deta - speech_to_text_2 - efficientformer - realm - gptsan_japanese * Fix up * Fix speech2text2 imports * Make sure message isn't indented * Fix docstrings * Correctly map for deprecated models from model_type * Uncomment out * Add back time series transformer and x-clip * Import fix and fix-up * Fix up with updated ruff	2024-05-28 18:07:07 +01:00
Younes Belkada	7f08817be4	Docs / Quantization: Redirect deleted page (#31063 ) Update _redirects.yml	2024-05-28 18:29:22 +02:00
Younes Belkada	3264be4114	TST: Fix instruct-blip tests (#31088 ) * fix flan t5 tests * better format	2024-05-28 18:29:11 +02:00
Jonny Li	476890e9ae	Fix DeepSpeed compatibility with weight_norm (#30881 ) (#31018 )	2024-05-28 17:25:15 +01:00
Albert Villanova del Moral	aada568f73	Fix PretrainedConfig docstring with deprecated resume_download (#31014 )	2024-05-28 17:47:35 +02:00
Yih-Dar	3af7bf30ad	skip `test_multi_gpu_data_parallel_forward` for `vit` and `deit` (#31086 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-28 17:44:52 +02:00
Younes Belkada	ab19f907fd	FIX / OPT: Fix OPT multi-GPU training for `OPTForQuestionAnswering` (#31092 ) Update modeling_opt.py	2024-05-28 17:06:00 +02:00
Younes Belkada	94d416f018	FIX: Add `accelerate` as a hard requirement (#31090 ) add accelerate	2024-05-28 17:05:44 +02:00
Sigbjørn Skjæret	22dab246c5	Render chat template tojson filter as unicode (#31041 ) * Render chat template tojson filter as unicode * ruff--	2024-05-28 15:02:51 +01:00
Younes Belkada	4f98b14465	Docs / PEFT: Add PEFT API documentation (#31078 ) * add peft references * add peft references * Update docs/source/en/peft.md * Update docs/source/en/peft.md	2024-05-28 15:04:43 +02:00
Raushan Turganbay	779bc360ff	Watermark: fix tests (#30961 ) * fix tests * style * Update tests/generation/test_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-28 17:07:42 +05:00
Lysandre Debut	a3c7b59e31	Fix failing tokenizer tests (#31083 ) * Fix failing tokenizer tests * Use small tokenizer * Fix remaining reference	2024-05-28 13:34:23 +02:00
NielsRogge	90da0b1c9f	[SuperPoint, PaliGemma] Update docs (#31025 ) * Update docs * Add PaliGemma resources * Address comment * Update docs	2024-05-28 13:22:06 +02:00
Sina Taslimi	66add161dc	Fix typo in trainer.py (#31048 )	2024-05-28 12:09:32 +01:00
Pavel Iakubovskii	98e2d48e9a	Fix OWLv2 post_process_object_detection for multiple images (#31082 ) * Add test for multiple images * [run slow] owlv2 * Fix box rescaling * [run slow] owlv2	2024-05-28 12:06:06 +01:00
Pavel Iakubovskii	c31473ed44	Remove float64 cast for OwlVit and OwlV2 to support MPS device (#31071 ) Remove float64	2024-05-28 11:41:40 +01:00
oOraph	936ab7bae5	fix from_pretrained in offline mode when model is preloaded in cache (#31010 ) * Unit test to verify fix Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> * fix from_pretrained in offline mode when model is preloaded in cache Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> * minor: fmt Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> --------- Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> Co-authored-by: Raphael Glon <oOraph@users.noreply.github.com>	2024-05-28 11:56:05 +02:00
Hengwen Tong	537deb7869	Remove redundant backend checks in training_args.py (#30999 ) * Remove backend checks in training_args.py * Expilicit initialize the device --------- Co-authored-by: tonghengwen <tonghengwen@cambricon.com>	2024-05-28 11:52:47 +02:00
AP	dd4654eab7	Update quicktour.md to fix broken link to Glossary (#31072 ) Update quicktour.md to fix broken link Missing '/' in attention mask link in the transformers quicktour	2024-05-28 11:50:45 +02:00
Clint Adams	e18da4e3f2	fix "piano" typo (#31027 )	2024-05-28 11:48:23 +02:00

1 2 3 4 5 ...

16041 Commits All Branches Search

16041 Commits

All Branches