transformers

Commit Graph

Author	SHA1	Message	Date
Stas Bekman	3c27d246e5	[vulnerability] fix dependency (#10914 ) this PR fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/PyYAML/open	2021-03-26 09:06:11 -04:00
Tomy Hsieh	4b2b50aa7b	Rename NLP library to Datasets library (#10920 ) * Rename NLP library to Datasets library * Update github template * Fix styling	2021-03-26 08:07:59 -04:00
lexhuismans	86c6f8a8b1	Fix comment (#10886 )	2021-03-25 21:23:56 +03:00
Sylvain Gugger	9856c9213d	Reorder init imports	2021-03-25 12:51:43 -04:00
Sylvain Gugger	e70068a719	Fix typo	2021-03-25 12:40:25 -04:00
Sylvain Gugger	f183a7a3c3	Sort init imports	2021-03-25 12:38:54 -04:00
Amir Tahmasbi	4684bfc757	Layout lm tf 2 (#10636 ) * Added embeddings layer * Added layoutlm layers, main model, maskedlm and token classification classes * Added model classes to tf auto models * Added model to PT to TF conversion script * Added model to doc README * Added tests * Removed unused imports * Added layoutlm model, test, and doc for sequence classification, and fix imports in __init__.py * Made tests pass! * Fixed typos in imports and docs * Fixed a typo in embeddings layer * Removed imports * Fixed formatting issues, imports, tests * Added layoutlm layers, main model, maskedlm and token classification classes * Added model classes to tf auto models * Added model to PT to TF conversion script * Removed unused imports * Added layoutlm model, test, and doc for sequence classification, and fix imports in __init__.py * Made tests pass! * Fixed typos in imports and docs * Removed imports * Fixed small formatting issues * Removed duplicates import from main __init__.py * Chnaged deafult arg to true for adding pooling layer to tf layoutlm * Fixed formatting issues * Style * Added copied from to classes copied from bert * Fixed doc strings examples to work with layoutlm inputs * Removed PyTorch reference in doc strings example * Added integration tests * Cleaned up initialization file * Updated model checkpoint identifiers * Fixed imports Co-authored-by: Amir Tahmasbi <amir@ehsai.ca> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-03-25 12:32:38 -04:00
Philipp Schmid	1a3e0c4fe6	make local setup more clearer and added missing links (#10899 )	2021-03-25 09:01:31 -04:00
Jethro Kuan	5f1491d3b3	run_glue_no_trainer: datasets -> raw_datasets (#10898 ) Use the correct variable (raw_datasets) instead of the module (datasets) where appropriate.	2021-03-25 08:28:17 -04:00
Sidd Karamcheti	1c06240e1b	Update training args ignore_skip_data -> ignore_data_skip (#10891 )	2021-03-24 16:44:51 -04:00
Sylvain Gugger	3b20e910b4	Remove version warning in pretrained BART models (#10890 ) * Remove version warning in pretrained BART models * Put it at the base model	2021-03-24 15:21:40 -04:00
Lysandre Debut	3c12e3c1c4	Fix overflowing bad word ids (#10889 ) * Removes overflowing bad word IDs * Raise warning	2021-03-24 15:13:56 -04:00
Eliza Szczechla	1f5ea9e04a	Add notebook on fine-tuning Bart (#10883 ) Co-authored-by: Eliza <eliza@habanero.tiger.com.pl>	2021-03-24 11:03:37 -04:00
imzhengzx	f81077fcf3	error type of tokenizer in __init__ definition (#10879 ) the orignal code in line 246 is ``` tokenizer: Optional["PreTrainedTokenizerBase"] = None, ``` it should be ``` tokenizer: Optional[PreTrainedTokenizerBase] = None, ```	2021-03-24 11:00:14 -04:00
Sylvain Gugger	1aed2b908e	Add new notebook links in the docs (#10876 )	2021-03-24 09:45:08 -04:00
Sylvain Gugger	a735f727cc	Fix test_trainer_distributed (#10875 )	2021-03-23 19:03:06 -04:00
Philipp Schmid	8c297cdb30	Sm trainer smp init fix (#10870 ) * rewrote is_sagemaker_model_parallel_available * added is_sagemaker_model_parallel_available to SageMakerTrainer * removed unnecessary mp_parameters as TrainingArguments * make style happy * added mp_parameters again to parse mp-specific args.	2021-03-23 20:07:55 +01:00
RafaelWO	d4d4447d53	fixed prefix_allowed_tokens_fn docstring in generate() (#10862 )	2021-03-23 13:48:22 -04:00
Bhadresh Savani	7ef40120a0	[Examples] Added predict stage and Updated Example Template (#10868 ) * added predict stage * added test keyword in exception message * removed example specific saving predictions * fixed f-string error * removed extra line Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-03-23 10:37:59 -07:00
Stas Bekman	fb2b89840b	[file_utils] import refactor (#10859 ) * import refactor * fix the fallback	2021-03-23 09:41:41 -07:00
Lysandre	3f48b2bc3e	Update stable docs	2021-03-23 11:01:16 -04:00
Philipp Schmid	77ffd5edd5	Amazon SageMaker Documentation (#10867 ) * added finished documentation * changed version from 1.6 to 1.6.0 for distributed * updated versions * updated urls	2021-03-23 10:56:44 -04:00
Sylvain Gugger	bf1f43fbd7	Update the example template for a no Trainer option (#10865 )	2021-03-23 10:02:39 -04:00
Marta Maślankowska	2eb596f085	Fix p_mask cls token masking in qa pipeline (#10863 )	2021-03-23 09:08:39 -04:00
Bhadresh Savani	eb330e8904	fixed typo (#10861 )	2021-03-23 08:15:28 -04:00
Stas Bekman	e21f89f64c	fix nan in full-fp16 label_smoothing eval (#10815 )	2021-03-22 19:23:24 -07:00
Sylvain Gugger	b5b957a65c	Make convert_to_onnx runable as script again (#10857 )	2021-03-22 22:16:39 -04:00
Patrick von Platen	77bf3fe787	[Generate] Add save mode logits processor to remove nans and infs if necessary (#10769 ) * push * finish * finish * make fix copies * change name	2021-03-23 01:00:05 +03:00
Eliza Szczechla	9f8fa4e973	Use DataCollatorForSeq2Seq in run_summarization in all cases (#10856 ) Co-authored-by: Eliza <eliza@habanero.tiger.com.pl>	2021-03-22 15:05:39 -04:00
Ruan Chaves	a8d4d6776d	Modify the Trainer class to handle simultaneous execution of Ray Tune and Weights & Biases (#10823 ) * Modify the _hp_search_setup method on the Trainer class to handle the wandb argument passed by Ray Tune to model config. * Reformat single quotes as double quotes.	2021-03-22 14:04:51 -04:00
Boris Dayma	125ccead71	feat(wandb): logging and configuration improvements (#10826 ) * feat: ensure unique artifact id * feat: allow manual init * fix: simplify reinit logic * fix: no dropped value + immediate commits * fix: wandb use in sagemaker * docs: improve documenation and formatting * fix: typos * docs: improve formatting	2021-03-22 10:45:17 -04:00
Sidd Karamcheti	b230181d41	Add simple one character fix so that on_step_begin and on_step_end are called at the right times (#10839 )	2021-03-22 09:15:39 -04:00
Stas Bekman	24ab5b08a3	[makefile] autogenerate target (#10814 ) * autogenerate target * clarify comment	2021-03-22 09:14:22 -04:00
Sebastian Olsson	2c6684239f	Correct AutoConfig call docstrings (#10822 )	2021-03-22 09:12:44 -04:00
Stas Bekman	8fb4671811	[vulnerability] in example deps fix (#10817 ) Takes care of: https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/jinja2/open @LysandreJik Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-22 09:05:24 -04:00
dependabot[bot]	dbfe379514	Bump jinja2 from 2.11.2 to 2.11.3 in /examples/research_projects/lxmert (#10818 ) Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.2 to 2.11.3. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/master/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/2.11.2...2.11.3) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-03-22 08:54:50 -04:00
Qiushi Pan	29904a967b	Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849 ) Fix typo.	2021-03-22 07:58:59 -04:00
Patrick von Platen	0f226f78ce	push (#10846 )	2021-03-22 10:32:21 +03:00
Suraj Patil	82b8d8c7b0	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-21 22:47:09 +05:30
Patrick von Platen	af6125ffdb	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-21 12:31:33 +03:00
Patrick von Platen	5aaf6e1460	small improvements for wav2vec2 info script (#10829 )	2021-03-21 11:41:44 +03:00
Eric Lam	be87b84276	Add new community notebook - wav2vec2 with GPT (#10794 ) * Add new community notebook - wav2vec2 with GPT * Update:community.md, new nb add * feat: notebook of wav2vec xlsr ctc decoding with gpt logit adjustment * Update: Wav2vec2 CTC decoding with gpt2 adjustment * Update docs/source/community.md Co-authored-by: Suraj Patil <surajp815@gmail.com>	2021-03-21 13:29:53 +05:30
Suraj Patil	68b55885ed	add doc for Local machine (#10828 )	2021-03-21 13:25:34 +05:30
Sylvain Gugger	21e86f99e6	Sort init import (#10801 ) * Initial script * Add script to properly sort imports in init. * Add to the CI * Update utils/custom_init_isort.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Separate scripts that change content from quality * Move class_mapping_update to style_checks Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-19 16:17:13 -04:00
Julien Chaumond	1438c487df	wav2vec doc tweaks (#10808 ) * wording/typos tweaks * Make model upload instructions simpler	2021-03-19 12:48:54 -04:00
Patrick von Platen	b9570a813c	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 19:45:28 +03:00
Philipp Schmid	f2b744f690	Add transformers id to hub requests (#10811 ) * add uuid.hext to user_agent * add log * changed order of it * renamed as session id * renamed variable * reverted naming of the const	2021-03-19 16:26:32 +01:00
Sylvain Gugger	946400fb68	Expand a bit the presentation of examples (#10799 ) * Expand a bit the presentation of examples * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Address review comments Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-03-19 10:06:08 -04:00
Bhadresh Savani	fd1d9f1ab8	[Example] Updating Question Answering examples for Predict Stage (#10792 ) * added prediction stage and eval fix * style correction * removed extra lines	2021-03-19 09:42:17 -04:00
Patrick von Platen	e8968bd03a	[XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806 ) * finish * fix * fix * fix * fix	2021-03-19 12:52:54 +03:00

1 2 3 4 5 ...

6902 Commits All Branches Search

6902 Commits

All Branches