transformers

Commit Graph

Author	SHA1	Message	Date
Sylvain Gugger	c76de1053e	Add generate kwargs to Seq2SeqTrainingArguments (#13339 ) * Add generate kwargs to Seq2SeqTrainingArguments * typo * Address review comments + doc * Style	2021-08-31 08:42:00 -04:00
Matt	702f4a49cd	Fixed CLM model still using MODEL_FOR_MASKED_LM_MAPPING (#13002 )	2021-08-31 13:21:39 +01:00
Lysandre	aa08a34669	[Flax tests] NVIDIA-SMI failure should continue	2021-08-31 14:18:20 +02:00
Matt	854260ca44	TF/Numpy variants for all DataCollator classes (#13105 ) * Adding a TF variant of the DataCollatorForTokenClassification to get feedback * Added a Numpy variant and a post_init check to fail early if a missing import is found * Fixed call to Numpy variant * Added a couple more of the collators * Update src/transformers/data/data_collator.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fixes, style pass, finished DataCollatorForSeqToSeq * Added all the LanguageModeling DataCollators, except SOP and PermutationLanguageModeling * Adding DataCollatorForPermutationLanguageModeling * Style pass * Add missing `__call__` for PLM * Remove `post_init` checks for frameworks because the imports inside them were making us fail code quality checks * Remove unused imports * First attempt at some TF tests * A second attempt to make any of those tests actually work * TF tests, round three * TF tests, round four * TF tests, round five * TF tests, all enabled! * Style pass * Merging tests into `test_data_collator.py` * Merging tests into `test_data_collator.py` * Fixing up test imports * Fixing up test imports * Trying shuffling the conditionals around * Commenting out non-functional old tests * Completed all tests for all three frameworks * Style pass * Fixed test typo * Style pass * Move standard `__call__` method to mixin * Rearranged imports for `test_data_collator` * Fix data collator typo "torch" -> "pt" * Fixed the most embarrassingly obvious bug * Update src/transformers/data/data_collator.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Renaming mixin * Updating docs Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Dalton Walker <dalton_walker@icloud.com> Co-authored-by: Andrew Romans <andrew.romans@hotmail.com>	2021-08-31 13:06:48 +01:00
Sylvain Gugger	74b3344fbc	Clean up test file	2021-08-31 07:06:49 -04:00
Jongheon Kim	ef8d6f2b4a	Set missing seq_length variable when using inputs_embeds with ALBERT & Remove code duplication (#13152 ) * Set seq_length variable when using inputs_embeds * remove code duplication	2021-08-31 06:51:25 -04:00
Jake Tae	180c6de6a6	docs: fix minor typo (#13289 ) `at` should be `a1`	2021-08-31 06:49:05 -04:00
Stas Bekman	066fd047cc	correct TP implementation resources (#13248 ) fix a few implementation links	2021-08-31 06:47:23 -04:00
Sylvain Gugger	4d10474fa5	Handle nested dict/lists of tensors as inputs in the Trainer (#13338 )	2021-08-31 06:34:31 -04:00
Kamal Raj	3efcfeab67	Deberta_v2 tf (#13120 ) * Deberta_v2 tf * added new line at the end of file, make style * +V2, typo * remove never executed branch of code * rm cmnt and fixed typo in url filter * cleanup according to review comments * added #Copied from	2021-08-31 06:32:47 -04:00
Apoorv Garg	286ccefb48	doc mismatch fixed (#13345 )	2021-08-31 06:28:37 -04:00
tucan9389	41c559415a	Add GPT2ForTokenClassification (#13290 ) * Add GPT2ForTokenClassification * Fix dropout exception for GPT2 NER * Remove sequence label in test * Change TokenClassifierOutput to TokenClassifierOutputWithPast * Fix for black formatter * Remove dummy * Update docs for GPT2ForTokenClassification * Fix check_inits ci fail * Update dummy_pt_objects after make fix-copies * Remove TokenClassifierOutputWithPast * Fix tuple input issue Co-authored-by: danielsejong55@gmail.com <danielsejong55@gmail.com>	2021-08-31 12:19:04 +02:00
Serhiy-Shekhovtsov	11fbc32e3e	Fixing a typo in the data_collator documentation (#13309 )	2021-08-31 06:01:12 -04:00
Patrick von Platen	062300ba7f	[Testing] Add Flax Tests on GPU, Add Speech and Vision to Flax & TF tests (#13313 ) * up * finish * Apply suggestions from code review * apply Lysandres suggestions * adapt circle ci as well * finish * Update setup.py	2021-08-31 11:08:22 +02:00
Sylvain Gugger	8b2de0e483	Tests fetcher tests (#13340 ) * Incorporate tests dependencies in tests_fetcher * Harder modif * Debug * Loop through all files * Last modules * Remove debug statement	2021-08-31 03:57:01 -04:00
Olatunji Ruwase	42f359d015	Use DS callable API to allow hf_scheduler + ds_optimizer (#13216 ) * Use DS callable API to allow hf_scheduler + ds_optimizer * Preserve backward-compatibility * Restore backward compatibility * Tweak arg positioning * Tweak arg positioning * bump the required version * Undo indent * Update src/transformers/trainer.py * style Co-authored-by: Stas Bekman <stas@stason.org> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-08-30 10:01:06 -07:00
Laura Hanu	35236b870e	Add missing module __spec__ (#13321 ) * added missing __spec__ to _LazyModule * test __spec__ is not None after module import * changed module_spec arg to be optional in _LazyModule * fix style issue * added module spec test to test_file_utils	2021-08-30 12:39:05 -04:00
Sylvain Gugger	4ebe798ff2	Fix release utils (#13337 ) * Fix release utils * Update docs/source/conf.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-08-30 12:09:14 -04:00
Sylvain Gugger	c4ecd234f2	Fix AutoTokenizer when no fast tokenizer is available (#13336 ) * Fix AutoTokenizer when a tokenizer has no fast version * Add test	2021-08-30 11:55:18 -04:00
Li-Huai (Allan) Lin	ffecfea949	Correct wrong function signatures on the docs website (#13198 ) * Correct outdated function signatures on website. * Upgrade sphinx to 3.5.4 (latest 3.x) * Test * Test * Test * Test * Test * Test * Revert unnecessary changes. * Change sphinx version to 3.5.4" * Test python 3.7.11	2021-08-30 11:40:25 -04:00
Kamal Raj	98e409abb3	albert flax (#13294 ) * albert flax * year -> 2021 * docstring updated for flax * removed head_mask * removed from_pt * removed passing attention_mask to embedding layer	2021-08-30 17:29:27 +02:00
Ben Nimmo	ee5b24573b	the use_auth_token has not been set up early enough in the model_kwargs. Fixes #12941 (#13205 )	2021-08-30 11:19:50 -04:00
Maxwell Forbes	0305673098	Fall back to `observed_batch_size` when the `dataloader` does not know the `batch_size`. (#13188 )	2021-08-30 11:12:35 -04:00
Nathan Raw	ce6add8ecc	🐛 fix small model card bugs (#13310 ) * 🐛 fix small model card bugs * 💄 style	2021-08-30 08:45:57 -06:00
Sylvain Gugger	139e830158	Update label2id in the model config for run_glue (#13334 )	2021-08-30 10:35:09 -04:00
fcakyon	6f3c99acca	add ability to connect a neptune.ai run (#13319 ) when `NEPTUNE_RUN_ID` environmetnt variable is set, neptune will log into the previous run with id `NEPTUNE_RUN_ID`	2021-08-30 09:59:17 -04:00
Sylvain Gugger	f4f4e6b2d3	Use existing functionality for #13251 (#13333 )	2021-08-30 09:43:23 -04:00
Li-Huai (Allan) Lin	d50649531f	Check None before going through iteration (#13250 ) * Check None before going through iteration * Format	2021-08-30 08:18:51 -04:00
Kamal Raj	774760e6f3	distilbert-flax (#13324 ) * distilbert-flax * added missing self * docs fix * removed tied kernal extra init * updated docs * x -> hidden states * removed head_mask * removed from_pt, +FLAX * updated year	2021-08-30 14:16:18 +02:00
arfy slowy	01977466f4	fix: typo spelling grammar (#13212 ) * fix: typo spelling grammar * fix: make fixup	2021-08-30 08:09:14 -04:00
Navjot	ef83dc4f0c	Improve documentation of pooler_output in ModelOutput (#13228 ) * update documentation of pooler_output in modeling_outputs, making it more clear and available for generic usage * Update src/transformers/modeling_outputs.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_outputs.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * run make style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-30 08:08:16 -04:00
Falk Puschner	7828194ebe	✨ add citation file (#13214 )	2021-08-30 07:46:55 -04:00
NielsRogge	b6ddb08a66	Add LayoutLMv2 + LayoutXLM (#12604 ) * First commit * Make style * Fix dummy objects * Add Detectron2 config * Add LayoutLMv2 pooler * More improvements, add documentation * More improvements * Add model tests * Add clarification regarding image input * Improve integration test * Fix bug * Fix another bug * Fix another bug * Fix another bug * More improvements * Make more tests pass * Make more tests pass * Improve integration test * Remove gradient checkpointing and add head masking * Add integration test * Add LayoutLMv2ForSequenceClassification to the tests * Add LayoutLMv2ForQuestionAnswering * More improvements * More improvements * Small improvements * Fix _LazyModule * Fix fast tokenizer * Move sync_batch_norm to a separate method * Replace dummies by requires_backends * Move calculation of visual bounding boxes to separate method + update README * Add models to main init * First draft * More improvements * More improvements * More improvements * More improvements * More improvements * Remove is_split_into_words * More improvements * Simply tesseract - no use of pandas anymore * Add LayoutLMv2Processor * Update is_pytesseract_available * Fix bugs * Improve feature extractor * Fix bug * Add print statement * Add truncation of bounding boxes * Add tests for LayoutLMv2FeatureExtractor and LayoutLMv2Tokenizer * Improve tokenizer tests * Make more tokenizer tests pass * Make more tests pass, add integration tests * Finish integration tests * More improvements * More improvements - update API of the tokenizer * More improvements * Remove support for VQA training * Remove some files * Improve feature extractor * Improve documentation and one more tokenizer test * Make quality and small docs improvements * Add batched tests for LayoutLMv2Processor, remove fast tokenizer * Add truncation of labels * Apply suggestions from code review * Improve processor tests * Fix failing tests and add suggestion from code review * Fix tokenizer test * Add detectron2 CI job * Simplify CI job * Comment out non-detectron2 jobs and specify number of processes * Add pip install torchvision * Add durations to see which tests are slow * Fix tokenizer test and make model tests smaller * Frist draft * Use setattr * Possible fix * Proposal with configuration * First draft of fast tokenizer * More improvements * Enable fast tokenizer tests * Make more tests pass * Make more tests pass * More improvements * Addd padding to fast tokenizer * Mkae more tests pass * Make more tests pass * Make all tests pass for fast tokenizer * Make fast tokenizer support overflowing boxes and labels * Add support for overflowing_labels to slow tokenizer * Add support for fast tokenizer to the processor * Update processor tests for both slow and fast tokenizers * Add head models to model mappings * Make style & quality * Remove Detectron2 config file * Add configurable option to label all subwords * Fix test * Skip visual segment embeddings in test * Use ResNet-18 backbone in tests instead of ResNet-101 * Proposal * Re-enable all jobs on CI * Fix installation of tesseract * Fix failing test * Fix index table * Add LayoutXLM doc page, first draft of code examples * Improve documentation a lot * Update expected boxes for Tesseract 4.0.0 beta * Use offsets to create labels instead of checking if they start with ## * Update expected boxes for Tesseract 4.1.1 * Fix conflict * Make variable names cleaner, add docstring, add link to notebooks * Revert "Fix conflict" This reverts commit a9b46ce9afe47ebfcfe7b45e6a121d49e74ef2c5. * Revert to make integration test pass * Apply suggestions from @LysandreJik's review * Address @patrickvonplaten's comments * Remove fixtures DocVQA in favor of dataset on the hub Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-08-30 12:35:42 +02:00
Hwijeen Ahn	439e7abd2d	use float 16 in causal mask and masked bias (#13194 )	2021-08-30 06:09:24 -04:00
Nicolas Patry	8be921f9de	Announcing the default model used by the pipeline (with a link). (#13276 )	2021-08-30 06:04:30 -04:00
Patrick von Platen	a75db353c4	[Slow tests] Disable Wav2Vec2 pretraining test for now (#13303 ) * fix_torch_device_generate_test * remove @ * wav2vec2 pretraining Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-08-30 06:03:02 -04:00
Patrick von Platen	4362ee298a	correct (#13304 )	2021-08-30 06:02:08 -04:00
Stefan Schweter	4046e66e40	examples: only use keep_linebreaks when reading TXT files (#13320 ) * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples	2021-08-28 16:22:29 +02:00
Anton Lozhkov	b6f332ecaf	Add Wav2Vec2 & Hubert ForSequenceClassification (#13153 ) * Add hubert classifier + tests * Add hubert classifier + tests * Dummies for all classification tests * Wav2Vec2 classifier + ER test * Fix hubert integration tests * Add hubert IC * Pass tests for all classification tasks on Hubert * Pass all tests + copies * Move models to the SUPERB org	2021-08-27 20:52:51 +03:00
Patrick von Platen	2bef3433e5	[Flax] Correct all return tensors to numpy (#13307 ) * fix_torch_device_generate_test * remove @ * finish find and replace	2021-08-27 17:38:34 +02:00
Nicolas Patry	8aa67fc192	Fixing mbart50 with `return_tensors` argument too. (#13301 ) * Fixing mbart50 with `return_tensors` argument too. * Adding mbart50 tokenization tests.	2021-08-27 17:22:06 +02:00
Nicolas Patry	b89a964d3f	Moving `zero-shot-classification` pipeline to new testing. (#13299 ) * Moving `zero-shot-classification` pipeline to new testing. * Cleaning up old mixins. * Fixing tests `sshleifer/tiny-distilbert-base-uncased-finetuned-sst-2-english` is corrupted in PT. * Adding warning.	2021-08-27 15:46:11 +02:00
NielsRogge	cc27ac1a87	Fix BeitForMaskedImageModeling (#13275 ) * First pass * Fix docs of bool_masked_pos * Add integration script * Fix docstring * Add integration test for BeitForMaskedImageModeling * Remove file * Fix docs	2021-08-27 09:09:57 -04:00
Nicolas Patry	a3f96f366a	Moving `translation` pipeline to new testing scheme. (#13297 ) * Moving `translation` pipeline to new testing scheme. * Update tokenization mbart tests.	2021-08-27 12:26:17 +02:00
Stefan Schweter	319d840b46	examples: add keep_linebreaks option to CLM examples (#13150 ) * examples: add keep_linebreaks option to text dataset loader for all CLM examples * examples: introduce new keep_linebreaks option as data argument in CLM examples	2021-08-27 11:35:45 +02:00
Nicolas Patry	45a8eb66bb	Moving `token-classification` pipeline to new testing. (#13286 ) * Moving `token-classification` pipeline to new testing. * Fix tests.	2021-08-27 11:24:56 +02:00
Nicolas Patry	a6e36558ef	Moving `text-generation` pipeline to new testing framework. (#13285 ) * Moving `text-generation` pipeline to new testing framework. * Keep check_model_type but log instead of raise Exception. * warning -> error.	2021-08-26 17:30:03 +02:00
NielsRogge	0759f2510c	Add DINO conversion script (#13265 ) * First commit * Add interpolation of patch embeddings * Comment out code * Fix bug * Fix another bug * Fix bug * Fix another bug * Remove print statements * Update conversion script * Use the official vit implementation * Add support for converting dino_vits8 * Add DINO to docs of ViT * Remove assertion * Add interpolation of position encodings * Fix bug * Add align_corners * Add interpolate_pos_encoding option to forward pass of ViTModel * Improve interpolate_pos_encoding method * Add docstring	2021-08-26 17:25:20 +02:00
Nicolas Patry	14e52783f6	Moving `text2text-generation` to new pipeline testing mecanism. (#13283 )	2021-08-26 16:26:58 +02:00
Nicolas Patry	662b143b71	Hotfixing master tests. (#13282 )	2021-08-26 10:09:53 -04:00

1 2 3 4 5 ...

7841 Commits All Branches Search

7841 Commits

All Branches