leo-du
|
ecd15667f3
|
fix repetition penalty
|
2019-10-17 14:47:14 -04:00 |
Lysandre Debut
|
c544194611
|
Remove `special_tokens_mask` from inputs in README
Co-authored-by: Thomas Wolf @thomwolf
|
2019-10-16 11:05:13 -04:00 |
thomwolf
|
be916cb3fb
|
Merge branch 'master' of https://github.com/huggingface/transformers
|
2019-10-15 10:37:13 +02:00 |
thomwolf
|
5875aaf762
|
install tensorboard
|
2019-10-15 10:36:46 +02:00 |
Thomas Wolf
|
40f14ff545
|
Merge pull request #1513 from slayton58/amp_fp16_einsum
Force einsum to run in fp16
|
2019-10-15 10:25:00 +02:00 |
Thomas Wolf
|
e703e4dfe1
|
Merge pull request #1509 from julian-pani/patch-3
remove leftover usage of DUMMY_INPUTS
|
2019-10-15 10:24:13 +02:00 |
thomwolf
|
898ce064f8
|
add tests on TF2.0 & PT checkpoint => model convertion functions
|
2019-10-15 10:04:19 +02:00 |
Thomas Wolf
|
d147671c6c
|
Merge pull request #1508 from tlkh/master
Added performance enhancements (XLA, AMP) to examples
|
2019-10-15 09:57:18 +02:00 |
thomwolf
|
2c1d5564ad
|
add readme information
|
2019-10-15 09:56:52 +02:00 |
Thomas Wolf
|
08bd8f9f39
|
Merge pull request #1505 from e-budur/master
Fixed the sample code in the title 'Quick tour'.
|
2019-10-15 09:50:36 +02:00 |
Thomas Wolf
|
8aa3b753bd
|
Merge pull request #1434 from bryant1410/patch-1
Remove unnecessary use of FusedLayerNorm in XLNet
|
2019-10-15 09:44:19 +02:00 |
Thomas Wolf
|
621e7a2529
|
Merge pull request #1275 from stecklin/ner-fine-tuning
Implement fine-tuning BERT on CoNLL-2003 named entity recognition task
|
2019-10-15 09:35:24 +02:00 |
thomwolf
|
c55badcee0
|
Add NER finetuning details by @stefan-it in example readme
|
2019-10-15 09:33:52 +02:00 |
Julien Chaumond
|
788e632622
|
[ner] Honor args.overwrite_cache
|
2019-10-15 09:17:31 +02:00 |
thomwolf
|
0f9ebb0b43
|
add seqeval as requirement for examples
|
2019-10-15 09:17:31 +02:00 |
thomwolf
|
66adb71734
|
update to transformers
|
2019-10-15 09:17:31 +02:00 |
Marianne Stecklina
|
5ff9cd158a
|
Add option to predict on test set
|
2019-10-15 09:17:31 +02:00 |
Marianne Stecklina
|
7f5367e0b1
|
Add cli argument for configuring labels
|
2019-10-15 09:17:31 +02:00 |
Marianne Stecklina
|
e1d4179b64
|
Make file reading more robust
|
2019-10-15 09:17:31 +02:00 |
Marianne Stecklina
|
383ef96747
|
Implement fine-tuning BERT on CoNLL-2003 named entity recognition task
|
2019-10-15 09:17:31 +02:00 |
Marianne Stecklina
|
5adb39e757
|
Add option to predict on test set
|
2019-10-15 09:14:53 +02:00 |
Marianne Stecklina
|
99b189df6d
|
Add cli argument for configuring labels
|
2019-10-15 09:14:53 +02:00 |
Marianne Stecklina
|
3e9420add1
|
Make file reading more robust
|
2019-10-15 09:14:53 +02:00 |
Marianne Stecklina
|
cde42c4354
|
Implement fine-tuning BERT on CoNLL-2003 named entity recognition task
|
2019-10-15 09:14:53 +02:00 |
Thomas Wolf
|
80889a0226
|
Merge pull request #1512 from louismartin/fix-roberta-convert
Fix import error in script to convert faisreq roberta checkpoints
|
2019-10-14 17:40:32 +02:00 |
Simon Layton
|
4e6a55751a
|
Force einsum to fp16
|
2019-10-14 11:12:41 -04:00 |
Thomas Wolf
|
f62f992cf7
|
Merge pull request #1502 from jeffxtang/master
the working example code to use BertForQuestionAnswering
|
2019-10-14 16:14:52 +02:00 |
Louis MARTIN
|
49cba6e543
|
Fix import error in script to convert faisreq roberta checkpoints
|
2019-10-14 01:38:57 -07:00 |
JulianPani
|
0993586758
|
remove usage of DUMMY_INPUTS
Hey @thomwolf
This change da26bae61b (diff-8ddce309e88e8eb5b4d02228fd8881daL28-L29) removed the constant, but one usage of that constant remains in the code.
|
2019-10-14 02:09:53 +03:00 |
Timothy Liu
|
376e65a674
|
Added automatic mixed precision and XLA options to run_tf_glue.py
|
2019-10-13 13:19:06 +00:00 |
Timothy Liu
|
86f23a1944
|
Minor enhancements to run_tf_glue.py
|
2019-10-13 10:21:35 +00:00 |
Emrah Budur
|
5a8c6e771a
|
Fixed the sample code in the title 'Quick tour'.
|
2019-10-12 14:17:17 +03:00 |
jeffxtang
|
e76d71521c
|
the working example code to use BertForQuestionAnswering and get an answer from a text and a question
|
2019-10-11 17:04:02 -07:00 |
VictorSanh
|
d844db4005
|
Add citation bibtex
|
2019-10-11 16:55:42 -04:00 |
Lysandre
|
a701c9b321
|
CTRL to tf automodels
|
2019-10-11 16:05:30 -04:00 |
Lysandre
|
3ddce1d74c
|
Release: 2.1.1
|
2019-10-11 06:37:49 -04:00 |
Thomas Wolf
|
4428aefc63
|
Merge pull request #1488 from huggingface/pytorch-tpu
GLUE on TPU
|
2019-10-11 16:33:00 +02:00 |
Thomas Wolf
|
3b43b01872
|
Merge pull request #1482 from huggingface/tf2_integration_tests
Integration of TF 2.0 models with other Keras modules
|
2019-10-11 16:25:43 +02:00 |
thomwolf
|
4b8f3e8f32
|
adding citation
|
2019-10-11 16:18:16 +02:00 |
thomwolf
|
18a3cef7d5
|
no nans
|
2019-10-11 16:09:42 +02:00 |
thomwolf
|
1f5d9513d8
|
fix test
|
2019-10-11 15:55:01 +02:00 |
thomwolf
|
0f9fc4fbde
|
adding option to desactivate past/memory outputs
|
2019-10-11 15:47:08 +02:00 |
Thomas Wolf
|
700331b5ec
|
Merge pull request #1492 from stefan-it/bert-german-dbmdz-models
Add new BERT models for German (cased and uncased)
|
2019-10-11 13:01:52 +02:00 |
Thomas Wolf
|
573dde9b44
|
Merge pull request #1405 from slayton58/xlnet_layer_reorder
Re-order XLNet attention head outputs for better perf
|
2019-10-11 12:10:58 +02:00 |
Stefan Schweter
|
5f25a5f367
|
model: add support for new German BERT models (cased and uncased) from @dbmdz
|
2019-10-11 10:20:33 +02:00 |
Luran He
|
f382a8decd
|
convert int to str before adding to a str
|
2019-10-10 19:20:39 -04:00 |
Lysandre
|
639f4b7190
|
Don't save/load when on TPU
|
2019-10-10 19:17:25 +00:00 |
Lysandre
|
d4e7934ac3
|
GLUE on TPU
|
2019-10-10 19:03:06 +00:00 |
thomwolf
|
2a4fef837a
|
move Circle-CI from TF2-rc0 to official TF2
|
2019-10-10 15:57:35 +02:00 |
thomwolf
|
751e246087
|
using tf.print in roberta
|
2019-10-10 15:47:20 +02:00 |