transformers

Commit Graph

Author	SHA1	Message	Date
NielsRogge	fa84540e98	Vit deit fixes (#11309 ) * Improve docs of DeiT and ViT, add community notebook * Add gitignore for test_samples * Add notebook with Trainer Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-05-12 11:46:02 -04:00
Lysandre	d77eb0cf92	Docs for v4.7.0.dev0	2021-05-12 17:08:35 +02:00
Lysandre	64e78564a5	Release: v4.6.0	2021-05-12 17:03:03 +02:00
Patrick von Platen	fd6204b2a7	[Lazy init] Force fall back to slow init for composite models (#11705 ) * fix encoder-decoder & RAG * finalize * Update src/transformers/models/encoder_decoder/modeling_encoder_decoder.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/rag/modeling_rag.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Patrick von Platen <patrick@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-05-12 10:52:54 -04:00
Suraj Patil	5c1cda9d3c	fix example in config doc (#11696 )	2021-05-12 09:48:52 -04:00
Philip May	77f4c46b50	remove defaults to None if optional (#11703 )	2021-05-12 09:11:10 -04:00
Marc van Zee	6797cdc077	Updates README and fixes bug (#11701 )	2021-05-12 13:52:52 +01:00
Suraj Patil	f063c56d94	Fix clip docs (#11694 ) * fix doc url * fix example	2021-05-12 15:28:30 +05:30
Suraj Patil	8719afa1ad	CLIP (#11445 ) * begin second draft * fix import, style * add loss * fix embeds, logits_scale, and projection * fix imports * add conversion script * add feature_extractor and processor * style * add tests for tokenizer, extractor and processor * add vision model tests * add weight init * add more tests * fix save_load test * model output, dosstrings, causal mask * config doc * add clip model tests * return dict * bigin integration test * add integration tests * fix-copies * fix init * Clip => CLIP * fix module name * docs * fix doc * output_dim => projection_dim * fix checkpoint names * remoe fast tokenizer file * fix conversion script * fix tests, quality * put causal mask on device * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix attribute test * style * address sylvains comments * style * fix docstrings * add qucik_gelu in activations, docstrings * clean-up attention test * fix act fun * fix config * fix torchscript tests * even batch_size * remove comment * fix ouput tu_tuple * fix save load tests * fix add tokens test * add fast tokenizer * update copyright * new processor API * fix docs * docstrings * docs * fix doc * fix doc * fix tokenizer * fix import in doc example * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * check types of config * valhalla => openai * load image using url * fix test * typo Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-05-12 13:48:15 +05:30
Marc van Zee	4ce6bcc310	Adds Flax BERT finetuning example on GLUE (#11564 ) * Adds Flax BERT finetuning example * fix traced jax tensor type * Use Optax losses and learning schedulers * Add 1GPU training results * merge into master & make style * fix input * del file * Fix bug in loss and add torch runs * finish bert flax fine-tune * Update examples/flax/text-classification/README.md * Update examples/flax/text-classification/run_flax_glue.py * add requirements * finalize * finalize Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-11 19:02:59 +01:00
Sylvain Gugger	f13f1f8fb8	Test checkpointing (#11682 ) * Add test and see where CI is unhappy * Load with strict=False	2021-05-11 12:02:48 -04:00
Julien Plu	d9b286272c	Fix TF Roberta for mixed precision training (#11675 )	2021-05-11 12:01:03 -04:00
Sylvain Gugger	a135f59536	Auto modelcard (#11599 ) * Autogenerate model cards from the Trainer * ModelCard deprecated * Fix test * Style * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments * Quality * With all metadata * Metadata * Post-merge conflict mess * Data args and all examples * Default license and languages when possible Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-05-11 11:30:34 -04:00
Matt	b3429ab678	Grammar and style edits for the frontpage README (#11679 ) * Grammar and style edits for the frontpage README * Going all-in on em-dashes because you only live once * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-05-11 15:49:34 +01:00
nxznm	901153c61e	Fix docstring of description about input_ids (#11672 )	2021-05-11 08:12:02 -04:00
Jonathan Chang	64232bc0df	Add --text_column to run_summarization_no_trainer (#11673 )	2021-05-11 07:58:38 -04:00
Julien Plu	024cd19bb7	Add MacOS TF version (#11674 ) Co-authored-by: Julien Plu <jplu@argos.local>	2021-05-11 05:42:21 -04:00
Pavel Soriano	9120ae7d66	Fixes NoneType exception when topk is larger than one coupled with a small context in the Question-Answering pipeline (#11628 ) * added fix to decode function. added test to qa pipeline tests * completed topk docstring * fixed formatting with black * applied style_doc to fix line length	2021-05-10 13:28:10 -04:00
Patrick von Platen	dcb0e61430	push (#11667 )	2021-05-10 17:38:17 +01:00
Sylvain Gugger	05a930671f	Save scaler state dict when checkpointing (#11663 )	2021-05-10 10:58:30 -04:00
Matt	ef8d32c5ea	Fix suggested by @bhadreshpsavani (#11660 )	2021-05-10 14:28:04 +01:00
Vasudev Gupta	575c979144	Update community.md (#11654 )	2021-05-10 09:48:21 +01:00
Tanmay Laud	f7f872955d	Big Bird Fast Tokenizer implementation (#11075 ) * Added Big Bird Fast Tokenizer initial file * style fixes * flake fixes * Added big bird fast tokenizer to init files * Added big bird fast to Auto tokenization * fix styles * minor quality fixes * Added initial test code * Fix SpmConverter when precompiled_charsmap doesn't exist * fixed post processor * minor style fix * minor fix input names * Actually fix identity normalization * style * Added token type ids to fast tokenizer * style * flake fix * fix copies Co-authored-by: Anthony MOI <m.anthony.moi@gmail.com>	2021-05-10 03:01:23 -04:00
Bhavitvya Malik	80da304a0f	updated user permissions based on umask (#11119 ) * updated user permissions based on umask * updated user permissions based on umask * changes as per suggestions * minor changes	2021-05-10 02:45:29 -04:00
Quentin Lhoest	1a0b41781d	Update requirements.txt (#11634 )	2021-05-10 11:19:52 +05:30
NielsRogge	f785c51692	Update code example (#11631 ) * Update code example * Code review	2021-05-10 11:18:43 +05:30
Tommy Chiang	7e406f4a65	[Examples] Fix invalid links after reorg (#11650 )	2021-05-10 11:16:48 +05:30
Tommy Chiang	f2ffcaf49f	[Examples] Check key exists in datasets first (#11503 )	2021-05-09 15:42:38 -04:00
Stas Bekman	ba0d50f214	[examples] fix sys.path in conftest.py (#11636 ) * restore conftest.py * fix conftest and make copies * remove unneeded parts * remove unwanted files	2021-05-07 14:44:22 -07:00
Stas Bekman	cd9b8d7efe	[self-push CI] sync with self-scheduled (#11637 ) forgot to add the missing `libaio-dev` to this workflow	2021-05-07 14:06:33 -07:00
Lysandre Debut	da37eb8e43	Reduce to 1 worker and set timeout for GPU TF tests (#11633 )	2021-05-07 11:55:20 -04:00
Lysandre Debut	39084ca663	Add the ImageClassificationPipeline (#11598 ) * Add the ImageClassificationPipeline * Code review Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com> * Have `load_image` at the module level Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2021-05-07 08:08:40 -04:00
Patrick von Platen	e7bff0aabe	make fix copy (#11627 )	2021-05-07 07:48:51 -04:00
Vasudev Gupta	dc3f6758cf	Add BigBirdPegasus (#10991 ) * init bigbird pegasus * add debugging nb ; update config * init conversion * update conversion script * complete conversion script * init forward() * complete forward() * add tokenizer * add some slow tests * commit current * fix copies * add docs * add conversion script for bigbird-roberta-summarization * remove TODO * small fixups * correct tokenizer * add bigbird core for now * fix config * fix more * revert pegasus-tokenizer back * make style * everything working for pubmed; yayygit status * complete tests finally * remove bigbird pegasus tok * correct tokenizer * correct tests * add tokenizer files * finish make style * fix test * update * make style * fix tok utils base file * make fix-copies * clean a bit * small update * fix some suggestions * add to readme * fix a bit, clean tests * fix more tests * Update src/transformers/__init__.py * Update src/transformers/__init__.py * make fix-copies * complete attn switching, auto-padding left * make style * fix auto-padding test * make style * fix batched attention tests * put tolerance at 1e-1 for stand-alone decoder test * fix docs * fix tests * correct slow tokenizer conversion * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * complete remaining suggestions * fix test Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-05-07 09:27:43 +02:00
Jonathan Chang	6f40e31766	Fix comment in run_clm_no_trainer.py (#11624 )	2021-05-07 12:32:30 +05:30
Sylvain Gugger	33fd83bc01	Fix RNG saves in distributed mode. (#11620 ) * Fix RNG saves in distributed mode. * Update src/transformers/trainer.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-05-06 17:14:12 -04:00
Stas Bekman	619200cc42	[cuda ext tests] fixing tests (#11619 ) * fixing tests * cleanup	2021-05-06 13:35:28 -07:00
Patrick von Platen	44c5621db0	fix tests (#11615 )	2021-05-06 20:42:51 +02:00
Sylvain Gugger	7eee950ac3	Re-styling in seq2seq attention (#11613 )	2021-05-06 14:24:19 -04:00
Eldar Kurtic	cf409e5594	Fix docstring typo (#11611 )	2021-05-06 17:09:28 +05:30
Vipul Raheja	f594090a93	fix typo in command (#11605 )	2021-05-06 12:32:54 +05:30
Lysandre Debut	079557c1c5	Fix Python version (#11607 )	2021-05-06 02:50:11 -04:00
baeseongsu	c1780ce7a4	fix head_mask for albert encoder part(`AlbertTransformer`) (#11596 ) * fix head mask for albert encoder part * fix head_mask for albert encoder part	2021-05-06 02:18:02 -04:00
Mats Sjöberg	864c1dfe34	Accept tensorflow-rocm package when checking TF availability (#11595 )	2021-05-05 14:44:29 -04:00
Patrick von Platen	3e3e41ae20	Pytorch - Lazy initialization of models (#11471 ) * lazy_init_weights * remove ipdb * save int * add necessary code * remove unnecessary utils * Update src/transformers/models/t5/modeling_t5.py * clean * add tests * correct * finish tests * finish tests * fix some more tests * fix xlnet & transfo-xl * fix more tests * make sure tests are independent * fix tests more * finist tests * final touches * Update src/transformers/modeling_utils.py * Apply suggestions from code review * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * clean tests * give arg positive name * add more mock weights to xlnet Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-05-05 17:22:20 +02:00
Lysandre	8fa8e19429	Skip Funnel test	2021-05-05 12:38:01 +02:00
Deepali	83e59d8e0b	add importlib_metadata and huggingface_hub as dependency in the conda recipe (#11591 ) * add importlib_metadata as dependency (#11490) Co-authored-by: Deepali Chourasia <deepch23@us.ibm.com> * add huggingface_hub dependency Co-authored-by: Deepali Chourasia <deepch23@us.ibm.com>	2021-05-05 03:36:18 -04:00
Stas Bekman	bf0dfa98d3	copies need to be fixed too (#11585 )	2021-05-05 03:35:15 -04:00
Stas Bekman	c065025c47	[trainer] document resume randomness (#11588 ) * document resume randomness * fix link * reword * fix * reword * style	2021-05-04 14:17:11 -07:00
Sylvain Gugger	6b241e0e3b	Reproducible checkpoint (#11582 ) * Set generator in dataloader * Use generator in all random samplers * Checkpoint all RNG states * Final version * Quality * Test * Address review comments * Quality * Remove debug util * Add python and numpy RNGs * Split states in different files in distributed * Quality * local_rank for TPUs * Only use generator when accepted * Add test * Set seed to avoid flakiness * Make test less flaky * Quality	2021-05-04 16:20:56 -04:00

1 2 3 4 5 ...

7175 Commits All Branches Search

7175 Commits

All Branches