transformers

Commit Graph

Author	SHA1	Message	Date
Stas Bekman	ef032ddd1e	[docs] [testing] gpu decorators table (#8422 ) * gpu decorators table * whitespace * Update docs/source/testing.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * whitespace Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-11-09 14:27:42 -05:00
Sam Shleifer	a8339b9ecc	Fix bart shape comment (#8423 )	2020-11-09 13:25:33 -05:00
Sam Shleifer	46509d1c19	[docs] remove sshleifer from issue-template :( (#8418 )	2020-11-09 12:51:38 -05:00
Patrick von Platen	9c83b96e62	[Tests] Add Common Test for Training + Fix a couple of bugs (#8415 ) * add training tests * correct longformer * fix docs * fix some tests * fix some more train tests * remove ipdb * fix multiple edge case model training * fix funnel and prophetnet * clean gpt models * undo renaming of albert	2020-11-09 18:24:41 +01:00
Sylvain Gugger	52040517b8	Deprecate old data/metrics functions (#8420 )	2020-11-09 12:10:09 -05:00
Stas Bekman	d4d1fbfc5a	[fsmt convert script] fairseq broke chkpt data - fixing that (#8377 ) * fairseq broke chkpt data - fixing that * style * support older bpecodes filenames - specifically "code" in iwslt14	2020-11-09 11:57:42 -05:00
Sylvain Gugger	5c766ecb50	Fix typo	2020-11-09 11:50:51 -05:00
Sylvain Gugger	908a28894c	Add new token classification example (#8340 ) * Add new token classification example * Remove txt file * Add test * With actual testing done * Less warmup is better * Update examples/token-classification/run_ner_new.py Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Fix test * Make Lysandre happy * Last touches and rename * Rename in tests * Address review comments * More run_ner -> run_ner_old Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-11-09 11:39:55 -05:00
Sylvain Gugger	c7cb1aa26c	Bump tokenizers (#8419 )	2020-11-09 11:32:10 -05:00
Stas Bekman	78d706f3ae	[fsmt tokenizer] support lowercase tokenizer (#8389 ) * support lowercase tokenizer * fix arg pos	2020-11-09 10:41:39 -05:00
Shashank Gupta	1e2acd0dcf	Bug fix for permutation language modelling (#8409 )	2020-11-09 10:23:26 -05:00
Philip May	bf8625e70b	add evaluate doc - trainer.evaluate returns 'epoch' from training (#8273 ) * add evaluate doc * fix style with utils/style.doc * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-11-09 09:00:59 -05:00
Sam Shleifer	ebde57acac	examples/docs: caveat that PL examples don't work on TPU (#8309 )	2020-11-09 08:55:22 -05:00
Julien Plu	76e7a44dee	Fix some tooling for windows (#8359 ) * Fix some tooling for windows * Fix conflict * Trigger CI	2020-11-09 13:50:38 +01:00
dartrevan	507dfb40c3	Update README.md (#8406 )	2020-11-09 16:44:43 +08:00
smanjil	7247d0b4ea	updating tag for exbert viz (#8408 )	2020-11-09 16:43:55 +08:00
Stas Bekman	4ab5617b0b	comet_ml temporary fix(#8410 )	2020-11-09 16:36:06 +08:00
Sam Shleifer	e6d9cdaafe	[s2s/distill] remove run_distiller.sh, fix xsum script (#8412 )	2020-11-08 16:57:43 -05:00
Stas Bekman	66582492d3	[s2s test_finetune_trainer] failing multigpu test (#8400 )	2020-11-08 16:45:40 -05:00
Stas Bekman	f62755a600	[s2s examples test] fix data path (#8398 )	2020-11-08 16:44:18 -05:00
Jonathan Chang	4a53e8e9e4	Fix DataCollatorForWholeWordMask again (#8397 )	2020-11-08 09:53:01 -05:00
Manav Rathod	610730998f	fixed default labels for QA model (#8399 )	2020-11-08 09:08:14 -05:00
Chengxi Guo	0b02489b2c	Add gpt2-medium-chinese model card (#8402 ) * Create README.md * Update model_cards/mymusise/gpt2-medium-chinese/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-08 05:00:19 -05:00
Stas Bekman	187554366f	fix md table (#8395 )	2020-11-08 04:25:14 -05:00
Jonathan Chang	77a257fc21	Fix DataCollatorForWholeWordMask (#8379 ) * Fix DataCollatorForWholeWordMask * Replace all tensorize_batch in data_collator.py	2020-11-07 12:51:56 -05:00
Stas Bekman	517eaf460b	[make] rewrite modified_py_files in python to be cross-platform (#8371 ) * rewrite modified_py_files in python to be cross-platform * try a different way to test for variable not being "" * improve comment	2020-11-07 18:45:16 +01:00
Patrick von Platen	07708793f2	fix encoder outputs (#8368 )	2020-11-06 21:03:25 +01:00
Yossi Synett	bc0d26d1de	[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071 ) * Output cross-attention with decoder attention output * Update src/transformers/modeling_bert.py * add cross-attention for t5 and bart as well * fix tests * correct typo in docs * add sylvains and sams comments * correct typo Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-11-06 19:34:48 +01:00
hassoudi	30f2507a07	Update README.md (#8360 ) Fix websitr address	2020-11-06 11:45:46 -05:00
Jonathan Chang	5807ba3fa9	Fix typo (#8351 )	2020-11-06 11:19:41 -05:00
hassoudi	82146496b6	Update README.md (#8338 ) fixes	2020-11-06 06:20:58 -05:00
ktrapeznikov	9e5c4d39ab	Create README.md (#8312 ) * Create README.md * Update model_cards/ktrapeznikov/gpt2-medium-topic-news/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 06:19:59 -05:00
hasantanvir79	06ebc37967	Create README.md (#8255 ) * Create README.md Initial commit * Updated Read me Updated * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:34:24 -05:00
Karthik Uppuluri	41cd031cf2	Create README.md (#8169 )	2020-11-06 03:26:07 -05:00
Karthik Uppuluri	f932ddeff5	Create README.md (#8170 )	2020-11-06 03:25:52 -05:00
Karthik Uppuluri	08b92f78fa	Create README.md (#8168 ) * Create README.md * Update README.md	2020-11-06 03:25:33 -05:00
Karthik Uppuluri	77d62e78b0	Create README.md (#8167 ) * Create README.md Telugu BERTU Readme file * Update model_cards/kuppuluri/telugu_bertu/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:24:31 -05:00
Yifan Peng	dd6bfcaefb	Create README.md (#8327 )	2020-11-06 03:22:52 -05:00
smanjil	ddeecf08e6	german medbert model details (#8266 ) * model details * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:21:13 -05:00
Jiaxin Pei	96baaafd34	Create README.md (#8258 )	2020-11-06 03:19:12 -05:00
Stefan Schweter	185259c261	[model_cards] Update Italian BERT models and introduce new Italian XXL ELECTRA model 🎉 (#8343 )	2020-11-06 03:17:03 -05:00
Manuel Romero	34bbf60bf8	Model card: GPT-2 fine-tuned on CommonGen (#8248 )	2020-11-06 03:15:11 -05:00
Manuel Romero	973218fd3b	Model card: CodeBERT fine-tuned for Insecure Code Detection (#8247 ) * Model card: CodeBERT fine-tuned for Insecure Code Detection * Update model_cards/mrm8488/codebert-base-finetuned-detect-insecure-code/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:13:45 -05:00
Manuel Romero	f833ca418b	Model card: T5-base fine-tuned on QuaRel (#8334 )	2020-11-06 03:09:55 -05:00
Stas Bekman	9edafaebef	[s2s] test_bash_script.py - actually learn something (#8318 ) * use decorator * remove hardcoded paths * make the test use more data and do real quality tests * shave off 10 secs * add --eval_beams 2, reformat * reduce train size, use smaller custom dataset	2020-11-05 23:15:14 -05:00
Leandro von Werra	17450397a7	Docs bart training ref (#8330 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-11-05 17:20:57 -05:00
Stas Bekman	d787935a14	[s2s] test_distributed_eval (#8315 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-11-05 16:01:15 -05:00
Sylvain Gugger	04e442d575	Make Trainer evaluation handle dynamic seq_length (#8336 ) * Make Trainer evaluation handle dynamic seq_length * Document behavior. * Fix test * Better fix * Fixes for realsies this time * Address review comments * Without forgetting to save...	2020-11-05 15:13:51 -05:00
Guillaume Filion	27b402cab0	Output global_attentions in Longformer models (#7562 ) * Output global_attentions in Longformer models * make style * small refactoring * fix tests * make fix-copies * add for tf as well * remove comments in test * make fix-copies * make style * add docs * make docstring pretty Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2020-11-05 21:10:43 +01:00
Sam Shleifer	7abc1d96d1	no warn (#8329 )	2020-11-05 11:42:24 -05:00

... 3 4 5 6 7 ...

6010 Commits All Branches Search

6010 Commits

All Branches