Commit Graph

1975 Commits

Author SHA1 Message Date
thomwolf 328a86d2af adding links to the templates in readme and contributing 2019-10-30 11:37:55 +01:00
thomwolf 7f4226f9e6 adding templates 2019-10-30 11:31:56 +01:00
altsoph 079bfb32fb Evaluation fixed. 2019-10-28 10:18:58 -04:00
altsoph 438f2730a0 Evaluation code fixed. 2019-10-28 10:18:58 -04:00
Matt Maybeno ae1d03fc51 Add roberta to doc 2019-10-24 14:32:48 -04:00
Matt Maybeno 4e5f88b74f Add Roberta to run_ner.py 2019-10-24 14:32:48 -04:00
Matt Maybeno b92d68421d Use roberta model and update doc strings 2019-10-24 14:32:48 -04:00
Matt Maybeno 66085a1321 RoBERTa token classification
[WIP] copy paste bert token classification for roberta
2019-10-24 14:32:48 -04:00
VictorSanh 5b6cafb11b [release] fix table weirdness 2019-10-23 10:35:16 -04:00
VictorSanh 8ad5c591cd [RELEASE] DistilRoBERTa 2019-10-23 10:29:47 -04:00
Julien Chaumond ef1b8b2ae5 [CTRL] warn if generation prompt does not start with a control code
see also https://github.com/salesforce/ctrl/pull/50
2019-10-22 21:30:32 +00:00
Julián Peller (dataista) e16d46843a Fix architectures count 2019-10-22 15:13:47 -04:00
Lysandre 1cfd974868 Option to benchmark only one of the two libraries 2019-10-22 13:32:23 -04:00
Lysandre 777faa8ae7 Fix #1597 2019-10-22 11:26:42 -04:00
Thomas Wolf b8c9ea0010
Merge pull request #1580 from pminervini/master
Gradient norm clipping should be done right before calling the optimiser
2019-10-22 13:59:20 +02:00
Pasquale Minervini abd7110e21 gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well 2019-10-21 19:56:52 +01:00
thomwolf 4d456542e9 Fix citation 2019-10-21 16:34:14 +02:00
Thomas Wolf 0e64fec1ab
Merge pull request #1568 from daemon/patch-1
Fix hanging when loading pretrained models
2019-10-21 14:31:57 +02:00
Pasquale Minervini 3775550c4b gradient norm clipping should be done right before calling the optimiser 2019-10-20 22:33:56 +01:00
Pasquale Minervini bf2c36a920
Merge pull request #1 from huggingface/master
update
2019-10-20 23:30:45 +02:00
Ralph Tang a2c8c8ef00
Fix hanging when loading pretrained models
- Fix hanging when loading pretrained models from the cache without having internet access. This is a widespread issue on supercomputers whose internal compute nodes are firewalled.
2019-10-19 16:19:20 -04:00
LysandreJik 82f6abd98a Benchmark section added to the documentation 2019-10-18 17:27:10 -04:00
LysandreJik 7dd29ed2f1 Benchmarks example script 2019-10-18 10:53:04 -04:00
Lysandre Debut 8efc0ec91a Add Benchmarks to issue templates 2019-10-18 10:45:44 -04:00
VictorSanh fd97761c5a soft launch distilroberta 2019-10-17 15:28:58 -04:00
leo-du ecd15667f3 fix repetition penalty 2019-10-17 14:47:14 -04:00
Lysandre Debut c544194611
Remove `special_tokens_mask` from inputs in README
Co-authored-by: Thomas Wolf @thomwolf
2019-10-16 11:05:13 -04:00
thomwolf be916cb3fb Merge branch 'master' of https://github.com/huggingface/transformers 2019-10-15 10:37:13 +02:00
thomwolf 5875aaf762 install tensorboard 2019-10-15 10:36:46 +02:00
Thomas Wolf 40f14ff545
Merge pull request #1513 from slayton58/amp_fp16_einsum
Force einsum to run in fp16
2019-10-15 10:25:00 +02:00
Thomas Wolf e703e4dfe1
Merge pull request #1509 from julian-pani/patch-3
remove leftover usage of DUMMY_INPUTS
2019-10-15 10:24:13 +02:00
thomwolf 898ce064f8 add tests on TF2.0 & PT checkpoint => model convertion functions 2019-10-15 10:04:19 +02:00
Thomas Wolf d147671c6c
Merge pull request #1508 from tlkh/master
Added performance enhancements (XLA, AMP) to examples
2019-10-15 09:57:18 +02:00
thomwolf 2c1d5564ad add readme information 2019-10-15 09:56:52 +02:00
Thomas Wolf 08bd8f9f39
Merge pull request #1505 from e-budur/master
Fixed the sample code in the title 'Quick tour'.
2019-10-15 09:50:36 +02:00
Thomas Wolf 8aa3b753bd
Merge pull request #1434 from bryant1410/patch-1
Remove unnecessary use of FusedLayerNorm in XLNet
2019-10-15 09:44:19 +02:00
Thomas Wolf 621e7a2529
Merge pull request #1275 from stecklin/ner-fine-tuning
Implement fine-tuning BERT on CoNLL-2003 named entity recognition task
2019-10-15 09:35:24 +02:00
thomwolf c55badcee0 Add NER finetuning details by @stefan-it in example readme 2019-10-15 09:33:52 +02:00
Julien Chaumond 788e632622 [ner] Honor args.overwrite_cache 2019-10-15 09:17:31 +02:00
thomwolf 0f9ebb0b43 add seqeval as requirement for examples 2019-10-15 09:17:31 +02:00
thomwolf 66adb71734 update to transformers 2019-10-15 09:17:31 +02:00
Marianne Stecklina 5ff9cd158a Add option to predict on test set 2019-10-15 09:17:31 +02:00
Marianne Stecklina 7f5367e0b1 Add cli argument for configuring labels 2019-10-15 09:17:31 +02:00
Marianne Stecklina e1d4179b64 Make file reading more robust 2019-10-15 09:17:31 +02:00
Marianne Stecklina 383ef96747 Implement fine-tuning BERT on CoNLL-2003 named entity recognition task 2019-10-15 09:17:31 +02:00
Marianne Stecklina 5adb39e757 Add option to predict on test set 2019-10-15 09:14:53 +02:00
Marianne Stecklina 99b189df6d Add cli argument for configuring labels 2019-10-15 09:14:53 +02:00
Marianne Stecklina 3e9420add1 Make file reading more robust 2019-10-15 09:14:53 +02:00
Marianne Stecklina cde42c4354 Implement fine-tuning BERT on CoNLL-2003 named entity recognition task 2019-10-15 09:14:53 +02:00
Thomas Wolf 80889a0226
Merge pull request #1512 from louismartin/fix-roberta-convert
Fix import error in script to convert faisreq roberta checkpoints
2019-10-14 17:40:32 +02:00