Commit Graph

1950 Commits

Author SHA1 Message Date
leo-du ecd15667f3 fix repetition penalty 2019-10-17 14:47:14 -04:00
Lysandre Debut c544194611
Remove `special_tokens_mask` from inputs in README
Co-authored-by: Thomas Wolf @thomwolf
2019-10-16 11:05:13 -04:00
thomwolf be916cb3fb Merge branch 'master' of https://github.com/huggingface/transformers 2019-10-15 10:37:13 +02:00
thomwolf 5875aaf762 install tensorboard 2019-10-15 10:36:46 +02:00
Thomas Wolf 40f14ff545
Merge pull request #1513 from slayton58/amp_fp16_einsum
Force einsum to run in fp16
2019-10-15 10:25:00 +02:00
Thomas Wolf e703e4dfe1
Merge pull request #1509 from julian-pani/patch-3
remove leftover usage of DUMMY_INPUTS
2019-10-15 10:24:13 +02:00
thomwolf 898ce064f8 add tests on TF2.0 & PT checkpoint => model convertion functions 2019-10-15 10:04:19 +02:00
Thomas Wolf d147671c6c
Merge pull request #1508 from tlkh/master
Added performance enhancements (XLA, AMP) to examples
2019-10-15 09:57:18 +02:00
thomwolf 2c1d5564ad add readme information 2019-10-15 09:56:52 +02:00
Thomas Wolf 08bd8f9f39
Merge pull request #1505 from e-budur/master
Fixed the sample code in the title 'Quick tour'.
2019-10-15 09:50:36 +02:00
Thomas Wolf 8aa3b753bd
Merge pull request #1434 from bryant1410/patch-1
Remove unnecessary use of FusedLayerNorm in XLNet
2019-10-15 09:44:19 +02:00
Thomas Wolf 621e7a2529
Merge pull request #1275 from stecklin/ner-fine-tuning
Implement fine-tuning BERT on CoNLL-2003 named entity recognition task
2019-10-15 09:35:24 +02:00
thomwolf c55badcee0 Add NER finetuning details by @stefan-it in example readme 2019-10-15 09:33:52 +02:00
Julien Chaumond 788e632622 [ner] Honor args.overwrite_cache 2019-10-15 09:17:31 +02:00
thomwolf 0f9ebb0b43 add seqeval as requirement for examples 2019-10-15 09:17:31 +02:00
thomwolf 66adb71734 update to transformers 2019-10-15 09:17:31 +02:00
Marianne Stecklina 5ff9cd158a Add option to predict on test set 2019-10-15 09:17:31 +02:00
Marianne Stecklina 7f5367e0b1 Add cli argument for configuring labels 2019-10-15 09:17:31 +02:00
Marianne Stecklina e1d4179b64 Make file reading more robust 2019-10-15 09:17:31 +02:00
Marianne Stecklina 383ef96747 Implement fine-tuning BERT on CoNLL-2003 named entity recognition task 2019-10-15 09:17:31 +02:00
Marianne Stecklina 5adb39e757 Add option to predict on test set 2019-10-15 09:14:53 +02:00
Marianne Stecklina 99b189df6d Add cli argument for configuring labels 2019-10-15 09:14:53 +02:00
Marianne Stecklina 3e9420add1 Make file reading more robust 2019-10-15 09:14:53 +02:00
Marianne Stecklina cde42c4354 Implement fine-tuning BERT on CoNLL-2003 named entity recognition task 2019-10-15 09:14:53 +02:00
Thomas Wolf 80889a0226
Merge pull request #1512 from louismartin/fix-roberta-convert
Fix import error in script to convert faisreq roberta checkpoints
2019-10-14 17:40:32 +02:00
Simon Layton 4e6a55751a Force einsum to fp16 2019-10-14 11:12:41 -04:00
Thomas Wolf f62f992cf7
Merge pull request #1502 from jeffxtang/master
the working example code to use BertForQuestionAnswering
2019-10-14 16:14:52 +02:00
Louis MARTIN 49cba6e543 Fix import error in script to convert faisreq roberta checkpoints 2019-10-14 01:38:57 -07:00
JulianPani 0993586758
remove usage of DUMMY_INPUTS
Hey @thomwolf  
This change da26bae61b (diff-8ddce309e88e8eb5b4d02228fd8881daL28-L29) removed the constant, but one usage of that constant remains in the code.
2019-10-14 02:09:53 +03:00
Timothy Liu 376e65a674 Added automatic mixed precision and XLA options to run_tf_glue.py 2019-10-13 13:19:06 +00:00
Timothy Liu 86f23a1944 Minor enhancements to run_tf_glue.py 2019-10-13 10:21:35 +00:00
Emrah Budur 5a8c6e771a Fixed the sample code in the title 'Quick tour'. 2019-10-12 14:17:17 +03:00
jeffxtang e76d71521c the working example code to use BertForQuestionAnswering and get an answer from a text and a question 2019-10-11 17:04:02 -07:00
VictorSanh d844db4005 Add citation bibtex 2019-10-11 16:55:42 -04:00
Lysandre a701c9b321 CTRL to tf automodels 2019-10-11 16:05:30 -04:00
Lysandre 3ddce1d74c Release: 2.1.1 2019-10-11 06:37:49 -04:00
Thomas Wolf 4428aefc63
Merge pull request #1488 from huggingface/pytorch-tpu
GLUE on TPU
2019-10-11 16:33:00 +02:00
Thomas Wolf 3b43b01872
Merge pull request #1482 from huggingface/tf2_integration_tests
Integration of TF 2.0 models with other Keras modules
2019-10-11 16:25:43 +02:00
thomwolf 4b8f3e8f32 adding citation 2019-10-11 16:18:16 +02:00
thomwolf 18a3cef7d5 no nans 2019-10-11 16:09:42 +02:00
thomwolf 1f5d9513d8 fix test 2019-10-11 15:55:01 +02:00
thomwolf 0f9fc4fbde adding option to desactivate past/memory outputs 2019-10-11 15:47:08 +02:00
Thomas Wolf 700331b5ec
Merge pull request #1492 from stefan-it/bert-german-dbmdz-models
Add new BERT models for German (cased and uncased)
2019-10-11 13:01:52 +02:00
Thomas Wolf 573dde9b44
Merge pull request #1405 from slayton58/xlnet_layer_reorder
Re-order XLNet attention head outputs for better perf
2019-10-11 12:10:58 +02:00
Stefan Schweter 5f25a5f367 model: add support for new German BERT models (cased and uncased) from @dbmdz 2019-10-11 10:20:33 +02:00
Luran He f382a8decd convert int to str before adding to a str 2019-10-10 19:20:39 -04:00
Lysandre 639f4b7190 Don't save/load when on TPU 2019-10-10 19:17:25 +00:00
Lysandre d4e7934ac3 GLUE on TPU 2019-10-10 19:03:06 +00:00
thomwolf 2a4fef837a move Circle-CI from TF2-rc0 to official TF2 2019-10-10 15:57:35 +02:00
thomwolf 751e246087 using tf.print in roberta 2019-10-10 15:47:20 +02:00