transformers

Commit Graph

Author	SHA1	Message	Date
Sylvain Gugger	94056b57be	New version of Accelerate for the Trainer (#23204 )	2023-05-08 09:47:08 -04:00
Sylvain Gugger	fd6970bc56	Skip failing test	2023-05-08 08:52:44 -04:00
Orr Zohar	843fdf2e42	Fixing class embedding selection in owl-vit (#23157 ) fixing class embedding selection in owl-vit	2023-05-08 07:35:04 -04:00
Joao Gante	bbfb9fc22b	Generate: starcoder 🤜 🤛 assisted generation (#23182 ) * starcoder has joined the chat * indexing that works for all	2023-05-08 10:45:40 +01:00
Robert Baruch	dbc12269ed	Fix hf_argparser.parse_json_file to open file with utf-8 encoding, close file when finished (#23194 ) * Open json args in utf-8 encoding, close file when finished * black formatted	2023-05-07 19:06:24 -04:00
Bartosz Szmelczynski	6f8a02844a	fix random attention for pytorch's bigbird/pegasus_bigbird (#23056 ) * fix random attention usage for bigbird and pegasus_bigbird * remove staticmethod, update tests target valus * revert style changes	2023-05-07 18:55:04 -04:00
Ashwin Mathur	ef0c380c12	Update LLaMA docs with arxiv link (#23191 ) * Update docs with arxiv link * Update llama model docs	2023-05-07 18:52:44 -04:00
cyy	ef42c2c487	search buffers for dtype (#23159 )	2023-05-06 11:41:08 -04:00
raghavanone	312b104ff6	Add FlaxWhisperForAudioClassification model (#23173 ) * Add FlaxWhisperForAudioClassification model * Add models to init * Add models to init * Fix copies * Fix automapping * Fix failing test	2023-05-05 13:23:46 -04:00
Ashwin Mathur	fc6c8b0eaa	Add `no_trainer` scripts to pre-train Vision Transformers (#23156 ) * Add run_mim_no_trainer.py draft from #20412 Add parse_args method and copy over other dependencies Add Method call for sending telemetry Initialize Accelerator Make one log on every process Set seed and Handle repository creation Initialize dataset and Set validation split Create Config Adapt Config Update Config Create Feature Extractor Create model Set column names Create transforms Create mask generator Create method to preprocess images Shuffle datasets if needed and set transforms Create Dataloaders Add optimizer Add learning rate scheduler Prepare everything with our accelerator Tie weights for TPU training Recalculate training steps and training epochs Set accelerator checkpointing steps Initialize trackers and store configuration Set total batch size Fix typo: mlm -> mim Log info at the start of training Load in the weights and states from previous save update the progress_bar if load from checkpoint Define train loop Add evaluation loop to training Add to parse_args method Push repo to hub Save accelerator state End training and save model and feature extractor Remove unused imports Fix trailing whitespace * Update code based on comments, Rename feature_extractor to image_processor * Fix linting * Add argument for learning rate * Add argument for setting number of training epochs * Remove incorrect logger argument * Convert max_train_steps to int for tqdm --------- Co-authored-by: Saad Mahmud <shuvro.mahmud79@gmail.com>	2023-05-05 13:22:49 -04:00
Connor Henderson	17083b9b84	fix: Passing language as acronym to Whisper generate (#23141 ) * add fix * address comments * remove error formatting	2023-05-05 11:52:19 -04:00
Gabriel Yang	40082d598b	🌐 [i18n-KO] docs: ko: Translate `multiple_choice.mdx` (#23064 ) * update doctree * doc: ko: translate multiple choice * Update reviews	2023-05-05 11:36:56 -04:00
Andrei Filatov	77412343c8	fixed whisper positional encoding (#23167 )	2023-05-05 11:36:15 -04:00
Perry Huang	1b9c352e55	Add TrOCR resources (#23142 ) * Add TrOCR resources * Made fixes suggested by stevhliu	2023-05-05 11:29:20 -04:00
Sylvain Gugger	01734dba84	Revert "Add FlaxWhisperForAudioClassification model" (#23154 ) Revert "Add FlaxWhisperForAudioClassification model (#22883)" This reverts commit `c8f2c5c56e`.	2023-05-04 13:47:07 -04:00
Joao Gante	b369e507aa	Generate: text generation pipeline no longer emits `max_length` warning when it is not set (#23139 )	2023-05-04 18:36:23 +01:00
Maria Khalusova	516dc6305f	[docs] Text to speech task guide (#23107 ) * First draft * Some polishing * Text polishing * added TOC entry for TTS * make style * added links to images * fixed links to images * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * feedback addressed * feedback from Matthijs addresed * Update docs/source/en/tasks/text-to-speech.mdx Co-authored-by: Matthijs Hollemans <mail@hollance.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Matthijs Hollemans <mail@hollance.com>	2023-05-04 13:17:13 -04:00
raghavanone	c8f2c5c56e	Add FlaxWhisperForAudioClassification model (#22883 ) * Add FlaxWhisperForAudioClassification model * Add models to init * Add models to init * Fix copies * Fix automapping	2023-05-04 13:00:16 -04:00
Sylvain Gugger	3341bb41cd	Pin urllib3	2023-05-04 12:00:22 -04:00
Younes Belkada	57ffd8ab4c	[`GPT-J`] Fix causal mask dtype (#23147 ) * fix #23136 * better fix * same fix for `masked_bias`	2023-05-04 16:31:19 +02:00
peter-sk	83b38fbea8	GPTNeoXForQuestionAnswering (#23059 ) * first draft - gives index error in question_answering.py * maturing * no labels * pipeline should know about QA * fixing checks * formatting * fixed docstring * initial commit * formatting * adding the class to many places * towards less unhappy checks * nearly there * and gpt neox for qa * use right model * forgot this one * base_model_prefix is "gpt_neox" for GPTNeoX* models * unnecessary stuff * Update src/transformers/models/gpt_neox/modeling_gpt_neox.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * format * Update src/transformers/models/gpt_neox/modeling_gpt_neox.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * removed gpt2 stuff --------- Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-04 10:15:15 -04:00
peter-sk	510ad0a8b8	gpt2 multi-gpu fix (#23149 ) Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>	2023-05-04 09:58:38 -04:00
Qingyang Wu	adb0760b5f	fix resume fsdp (#23111 ) * fix resume fsdp * fix rank 0 loading * fix style and quality	2023-05-04 09:57:32 -04:00
Victor Geislinger	3b74889e8f	Remove typo in perf_train_gpu_many.mdx (#23144 ) - Excess `w` in the word `bottom`	2023-05-04 09:56:45 -04:00
digger-yu	5eeb556484	fix spelling error (#23143 ) change referrred to referred	2023-05-04 09:56:28 -04:00
amyeroberts	90e8263d91	Add methods to update and verify out_features out_indices (#23031 ) * Add methods to update and verify out_features out_indices * Safe update for config attributes * Fix function names * Save config correctly * PR comments - use property setters * PR comment - directly set attributes * Update test * Add updates to recently merged focalnet backbone	2023-05-04 10:15:06 +01:00
peter-sk	78b7debf56	GPTNeoForQuestionAnswering (#23057 ) * first draft - gives index error in question_answering.py * maturing * no labels * pipeline should know about QA * fixing checks * formatting * fixed docstring * initial commit * formatting * adding the class to many places * towards less unhappy checks * nearly there * Update src/transformers/models/gpt_neo/modeling_gpt_neo.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * avoid error * moving to device of star/end_logits --------- Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-03 15:59:19 -04:00
Robert Stone	b6933d76d2	Tidy Pytorch GLUE benchmark example (#23134 ) Migration to Evaluate for metric is not quite complete	2023-05-03 15:50:41 -04:00
Alara Dirik	b0a78091a5	Remove redundant print statements (#23133 ) remove redundant print statements	2023-05-03 18:04:48 +01:00
regisss	e3ee45aa54	Enable to use custom tracer in FX `symbolic_trace` (#23105 ) * Enable to use custom tracer in FX `symbolic_trace` * Integrate feedback from review * Formatting Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-05-03 12:47:36 -04:00
Alara Dirik	441658dd6c	Add focalnet backbone (#23104 ) Adds FocalNet backbone to return features from all stages	2023-05-03 19:32:42 +03:00
Julien Chaumond	ca7eb27ed5	[doc] Try a few ≠ ways of linking to Papers, users, and org profiles (#22611 ) * [doc] Try a few ≠ ways of linking to Papers, users, and org profiles * Empty commit * Empty commit now that the backend is fixed --------- Co-authored-by: Lysandre <lysandre@huggingface.co>	2023-05-03 18:23:09 +02:00
Nayeon Han	fbe0178f08	docs: ko: update `_toctree.yml` (#23112 ) * docs: ko: update `_toctree.yml` * fix: ko: update toc * fix: resolve suggestions * fix: resolve build issue --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-05-03 11:04:58 -04:00
Mayank Agarwal	c4e32e206f	Add support for beam search's num_return_sequencs flag in flax (#23082 ) * add code for numReturnSeq * add flax support for num return sequences * Make Fix up for changes * add test for num return sequences * lint	2023-05-03 10:50:34 -04:00
Xuehai Pan	ee4bc07474	Support union types `X \| Y` syntax for `HfArgumentParser` for Python 3.10+ (#23126 ) * Support union types `X \| Y` syntax for `HfArgumentParser` for Python 3.10+ * Add tests for PEP 604 for `HfArgumentParser` * Reorganize tests	2023-05-03 10:49:54 -04:00
Alara Dirik	56b8d49ddf	Fix ConvNext V2 paramater naming issue (#23122 ) Fixes the parameter naming issue in ConvNextV2GRN module	2023-05-03 17:21:27 +03:00
Samin Yasar	b53004fdce	Add resources for LayoutLmV2 and reformat documentation resources (#23115 ) * add resources for layoutlmv2 * remove 🌎 from some resources	2023-05-03 09:53:00 -04:00
Joao Gante	3a08dc63fd	Generate: better warnings with pipelines (#23128 )	2023-05-03 14:43:17 +01:00
Manuel	2a16d8b275	improve unclear documentation (#23123 )	2023-05-03 09:36:30 -04:00
Joao Gante	a0bd464776	Generate: correct beam search length on score calculation for multi batch generation (#23127 )	2023-05-03 14:29:55 +01:00
Joao Gante	ce31e3c8bf	Generate: slow assisted generation test (#23125 )	2023-05-03 14:24:50 +01:00
Younes Belkada	b61d5b47f6	[`Doctest`] Fix pix2struct doctest (#23121 ) fix pix2struct doctest	2023-05-03 11:21:59 +02:00
Sylvain Gugger	4b6aecb48e	Pin numba for now (#23118 )	2023-05-02 22:02:39 -04:00
Gregory (Gabriel) Barello	3ff89f29f5	Fixed default config for `Pix2Struct` model to set `Pix2StructTextModel` to `is_decoder=True` (#23051 ) added as default keyword arg. to in order to correctly configure the decoder	2023-05-02 13:40:41 -04:00
Alex Punnen	805db1fe13	num_noise_spans should be <= num_items #22246 (#22938 )	2023-05-02 13:07:30 -04:00
Michael Benayoun	9ade58f055	[ONNX] Sam fix (#23110 ) * [WIP] Fix for the ONNX export * Apply changes * Remove commented code * Resolve todo * empty -> zeros * fix slow tests --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-05-02 17:20:02 +02:00
Younes Belkada	4baa34c18f	[`Flava`] Fix flava `torch.distributed.nn.functional import all_gather` issue (#23108 ) * fix flava `torch.distributed.nn.functional import all_gather` issue * more comments	2023-05-02 15:35:57 +02:00
Wing Lian	c6c6658499	Fix check for backword_pos (#23075 )	2023-05-02 09:32:42 -04:00
Sohyun Sim	f31a510bb3	🌐 [i18n-KO] Translated `torchscript.mdx` to Korean (#23060 ) * docs: ko: torchscript.mdx * feat: gpt and deepl draft * fix: manual edits * fix: edit anchor link * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> * fix: resolve suggestions --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-05-02 09:27:59 -04:00
peter-sk	2b0c924568	GPT2ForQuestionAnswering (#23030 ) * first draft - gives index error in question_answering.py * maturing * no labels * pipeline should know about QA * fixing checks * formatting * fixed docstring * make sure legacy code executes * comment * like this --------- Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>	2023-05-02 09:25:46 -04:00

1 2 3 4 5 ...

12846 Commits All Branches Search

12846 Commits

All Branches