transformers

Commit Graph

Author	SHA1	Message	Date
Anton Lozhkov	196cce6e9b	Add a device argument to the eval script (#15371 ) * Device argument for the eval script * Default to none * isort	2022-01-27 15:58:55 +01:00
François REMY	19732cc07a	Fix 'eval_split_name' described as defaulting to 'train' (#15348 ) The default is correct (`test`) but the description is not.	2022-01-26 10:19:38 -05:00
Patrick von Platen	457dd4392b	[Examples] Correct run ner label2id for fine-tuned models (#15017 ) * up * up * make style * apply sylvains suggestions * apply changes to accelerate as well * more changes * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-01-24 21:18:04 +01:00
Patrick von Platen	4bf97415a4	Update eval.py (#15310 )	2022-01-24 11:46:38 +01:00
Sylvain Gugger	4cff3fae11	Second failing test	2022-01-21 12:19:28 -05:00
Sylvain Gugger	f6253147df	Skip failing test	2022-01-21 12:03:21 -05:00
Patrick von Platen	11afb709ec	[Robust Speech Challenge] Add timeline (#15274 )	2022-01-21 17:12:09 +01:00
lewtun	833635e259	Move BART + ONNX example to research_projects (#15271 ) * Move BART + ONNX example to research_projects * Add author information	2022-01-21 14:47:34 +01:00
NielsRogge	6c7b68d414	[ViTMAE] Add image pretraining script (#15242 ) * Add script * Improve script * Fix data collator * Update README * Add label_names argument * Apply suggestions from code review * Add config parameters * Update script * Fix bug * Improve README * Improve README and add test * Fix import * Add image_column_name	2022-01-21 12:11:08 +01:00
Anton Lozhkov	85ea462c08	Update README.md (#15246 ) Clarify OVH instruction	2022-01-20 13:40:26 +03:00
Anton Lozhkov	e57468b8a8	Update README.md (#15239 ) Add an OVHcloud tutorial URL for the Robust Speech Challenge	2022-01-20 11:46:50 +03:00
Patrick von Platen	691878ee2f	Update README.md (#15233 )	2022-01-19 18:03:17 +01:00
Suraj Patil	2a5a384970	fix speech event readme (#15227 )	2022-01-19 15:30:03 +01:00
Patrick von Platen	6d92c429c7	Update README.md (#15226 )	2022-01-19 15:23:00 +01:00
Patrick von Platen	19c217b4b7	Update README.md	2022-01-19 15:21:03 +01:00
Patrick von Platen	5439cda7f0	Update README.md	2022-01-19 15:19:57 +01:00
Kamal Raj	d1f5ca1afd	[FLAX] glue training example refactor (#13815 ) * refactor run_flax_glue.py * updated readme * rm unused import and args typo fix * refactor * make consistent arg name across task * has_tensorboard check * argparse -> argument dataclasses * refactor according to review * fix	2022-01-19 12:04:51 +01:00
Patrick von Platen	e118e085ea	[Robust Speech Event] Add guides (#15155 ) * up * improve readme * up * up * more info * up * up * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * add more stuff for eval * update * up * Update README.md * Update examples/research_projects/xls_r/README.md Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> * apply omar's suggestions Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>	2022-01-18 18:44:48 +01:00
Sylvain Gugger	6f0a9b41ef	Remove dependency to quiet Dependabot (#15205 )	2022-01-18 09:44:35 -05:00
Sylvain Gugger	531336bbfd	Fix deprecation warnings for int div (#15180 ) * Fix deprecation warnings for int div Co-authored-by: mgoldey <matthew.goldey@gmail.com> * Fix import * ensure that tensor output is python scalar * make backward compatible * make code more readable * adapt test functions Co-authored-by: mgoldey <matthew.goldey@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-01-18 07:28:53 -05:00
Sylvain Gugger	96881729ce	Remove assert on optional arg	2022-01-13 17:34:41 -05:00
Stas Bekman	762416ffa8	[examples/flax/language-modeling] set loglevel (#15129 )	2022-01-13 15:17:28 +01:00
Edoardo Federici	9a94bb8e21	mBART support for run_summarization.py (#15125 ) * Update run_summarization.py * Fixed languages and added missing code * fixed obj, docs, removed source_lang and target_lang * make style, run_summarization.py reformatted	2022-01-12 16:39:33 -05:00
Leandro von Werra	aa0135f2e0	fix: switch from slow to generic tokenizer class (#15122 )	2022-01-12 09:12:43 -05:00
Russell Klopfer	27b819b0e3	use block_size instead of max_seq_length in tf run_clm example (#15036 ) * use block_size instead of max_seq_length * fixup * remove pad_to_block_size Co-authored-by: Russell Klopfer <russell@kloper.us>	2022-01-12 08:57:00 -05:00
Patrick von Platen	d72343d2b8	[Wav2Vec2 Speech Event] Add speech event v2 (#15083 ) * up * up * up * up * up * up * improve * up * up * Update src/transformers/trainer.py * up * up * up	2022-01-10 10:46:21 +01:00
flozi00	b67f345d00	Update run_speech_recognition_seq2seq.py (#14967 )	2022-01-06 19:26:45 +03:00
Yih-Dar	9f89fa02ed	Add Flax image captioning example (#14864 ) * add image captioning example * update README * fix style & quality * simplify * apply review suggestions * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Apply review suggestions * add comments about using np instead jax array * remove unused lines * add model creation script * only support from_pretrained * fix style * fix * not use cache_dir when creating model * fix tokenizer creation * update README * fix quality * apply suggestion * simplify some blocks * Update examples/flax/image-captioning/README.md * Update examples/flax/image-captioning/run_image_captioning_flax.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * apply suggestion Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-01-06 14:00:54 +01:00
flozi00	774ed4a027	Fix Code block (#14983 )	2022-01-04 12:59:20 +01:00
Patrick von Platen	600496fa50	[Wav2Vec2] Rename model's feature extractor to feature encoder (#14959 ) * rename classes * clean up more namings * remove bogus file * Apply suggestions from code review * Apply suggestions from code review * replace more names * more regex replace * make style * correct * correct more * make style * finish * correct more in wav2vec2 * make style * improve freeze_extractor * add aliases * add tf aliases	2021-12-28 20:33:23 +01:00
Patrick von Platen	f80775df2b	Update README.md (#14965 )	2021-12-28 13:41:27 +01:00
Patrick von Platen	1c121916f3	Add Speech Seq2Seq Training script (#14792 ) * start * add gradient checkpointing and feature extractor freezing * Apply suggestions from code review * up * up * up * correct * up * more changes * up * up * up * remove rst	2021-12-28 10:20:51 +01:00
Leandro von Werra	1d651868d6	add custom stopping criteria to human eval script (#14897 )	2021-12-23 14:59:11 +01:00
lewtun	355dc0ce67	Fix installation instructions for BART ONNX example (#14885 )	2021-12-23 04:05:32 -05:00
Patrick von Platen	fa39ff9fc4	Docs for v4.16.0dev0	2021-12-22 20:39:44 +01:00
Patrick von Platen	05fa1a7ac1	Release: v4.15.0	2021-12-22 18:43:15 +01:00
Mario Šaško	1045a36c1f	Fix pytorch image classification example (#14883 ) * Update example * Remove skip in tests	2021-12-22 14:42:19 +01:00
Sylvain Gugger	e51c7b5872	Skip failing test	2021-12-21 15:15:17 -05:00
Stas Bekman	033c3ed95a	[examples/summarization] deal with None in data records (#14816 ) * [examples/summarization] deal with None in data records * rewrite to use a simpler (slower) variant	2021-12-21 09:17:28 -08:00
Patrick von Platen	7ae6f07004	[ASR example] Improve example + add more examples (#14848 ) * up * load up * up	2021-12-21 13:12:22 +01:00
Patrick von Platen	c4a96cecbc	Wav2Vec2 meets phonemes (#14353 ) * up * add tokenizer * improve more * finish tokenizer * finish * adapt speech recognition script * adapt convert * more fixes * more fixes * update phonemizer wav2vec2 * better naming * fix more tests * more fixes swedish * correct tests * finish * improve script * remove file * up * lets get those 100 model architectures until the end of the month * make fix-copies * correct more * correct script * more fixes * more fixes * add to docs * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace assert * fix copies * fix docs * new try docs * boom boom * update * add phonemizer to audio tests * make fix-copies * up * upload models * some changes * Update tests/test_tokenization_wav2vec2_phoneme.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * more fixes * remove @ Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2021-12-17 19:56:44 +01:00
Lysandre	7c9c41f43c	Docs for v4.14.0	2021-12-15 18:29:53 +01:00
Lysandre	960d8cb41d	Release: v4.14.0	2021-12-15 18:20:35 +01:00
Yih-Dar	a94105f95f	Fix preprocess_function in run_summarization_flax.py (#14769 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2021-12-15 11:36:28 +01:00
Benjamin Minixhofer	2a606f9974	Make data shuffling in `run_clm_flax.py` respect global seed (#13410 ) * use jax and jnp instead of numpy in data_loader * return batches as np.ndarray	2021-12-14 11:04:43 +01:00
Josué Nascimento	971e36667a	Change how to load config of XLNetLMHeadModel (#14746 )	2021-12-13 12:34:26 -05:00
Nathan Cooper	48bf7e47a0	Code parrot minor fixes/niceties (#14666 ) * Add some nicety flags for better controlling evaluation. * Fix dependency issue with outdated requirement * Add additional flag to example to ensure eval is done * Wrap code into main function for accelerate launcher to find * Fix valid batch size flag in readme * Add note to install git-lfs when initializing/training the model * Update examples/research_projects/codeparrot/scripts/arguments.py Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * Update examples/research_projects/codeparrot/README.md Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * Revert "Wrap code into main function for accelerate launcher to find" This reverts commit `ff11df1c81`. * Fix formatting issue * Move git-lfs instructions to installation section * Add a quick check before code generation for code evaluation * Fix styling issue * Update examples/research_projects/codeparrot/scripts/human_eval.py Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * Make iterable dataset use passed in tokenizer rather than globally defined one Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> Co-authored-by: ncoop57 <nac33@students.uwf.edu>	2021-12-13 09:30:50 +01:00
Suraj Patil	6a025487a6	[Flax examples] remove dependancy on pytorch training args (#14636 ) * use custom training arguments * update tests	2021-12-12 09:19:12 +05:30
Lysandre	ab31b3e41b	Docs for v4.14.0dev0	2021-12-09 17:09:23 +01:00
Lysandre	4da3a696e4	Release: v4.13.0	2021-12-09 16:55:21 +01:00

1 2 3 4 5 ...

1910 Commits