Thomas Wolf
|
6f4be31d0d
|
Merge pull request #40 from joelgrus/patch-1
update pip package name
|
2018-11-19 20:54:46 +01:00 |
Joel Grus
|
dd56cfd89a
|
update pip package name
|
2018-11-19 09:50:34 -08:00 |
Thomas Wolf
|
956c917344
|
fix typos in readme
|
2018-11-17 23:25:23 +01:00 |
thomwolf
|
27ee0fff3c
|
add no_cuda args in extract_features
|
2018-11-17 23:04:44 +01:00 |
thomwolf
|
aa50fd196f
|
remove unused arguments in example scripts
|
2018-11-17 23:01:05 +01:00 |
Thomas Wolf
|
7c91e51c26
|
update links in readme
|
2018-11-17 22:54:15 +01:00 |
Thomas Wolf
|
e113101702
|
fix typos in readme
|
2018-11-17 12:36:35 +01:00 |
Thomas Wolf
|
4132a028af
|
Merge pull request #29 from huggingface/first-release
First release
|
2018-11-17 12:21:48 +01:00 |
thomwolf
|
47a7d4ec14
|
update examples from master
|
2018-11-17 12:21:35 +01:00 |
thomwolf
|
c8cba67742
|
clean up readme and examples
|
2018-11-17 12:19:16 +01:00 |
thomwolf
|
757750d6f6
|
fix tests
|
2018-11-17 11:58:14 +01:00 |
thomwolf
|
a99b971738
|
bump up version minor
|
2018-11-17 10:43:39 +01:00 |
thomwolf
|
4e46affc34
|
updating examples
|
2018-11-17 10:30:54 +01:00 |
thomwolf
|
d0673c7dbd
|
fix links
|
2018-11-17 08:59:29 +01:00 |
thomwolf
|
68b937aa40
|
sub section overviews
|
2018-11-17 08:55:56 +01:00 |
thomwolf
|
c54d8b1847
|
fixing links in readme
|
2018-11-17 08:46:17 +01:00 |
thomwolf
|
f920eff8c3
|
update readme
|
2018-11-17 08:42:45 +01:00 |
thomwolf
|
886cb49792
|
updating readme and notebooks
|
2018-11-16 14:31:15 +01:00 |
thomwolf
|
fd647e8c87
|
comparison masked LM ok
|
2018-11-16 11:04:31 +01:00 |
thomwolf
|
02173a1a0a
|
fixing error in isnan test for optimizer_on_cpu & fp16
|
2018-11-15 21:49:12 +01:00 |
thomwolf
|
cba85a67b9
|
fix nan in optimizer_on_cpu
|
2018-11-15 21:47:41 +01:00 |
thomwolf
|
1de35b624b
|
preparing for first release
|
2018-11-15 20:56:10 +01:00 |
Thomas Wolf
|
8513741b57
|
Merge pull request #17 from lukovnikov/master
activation function in BERTIntermediate
|
2018-11-13 17:00:09 +01:00 |
lukovnikov
|
470076e419
|
Merge remote-tracking branch 'origin/master'
|
2018-11-13 16:49:26 +01:00 |
lukovnikov
|
9f3cd27187
|
clean up pr
|
2018-11-13 16:48:59 +01:00 |
Denis
|
3d4c7a6f5d
|
Delete __init__.py
|
2018-11-13 16:48:43 +01:00 |
lukovnikov
|
d64db6dfb9
|
clean up pr
|
2018-11-13 16:41:01 +01:00 |
lukovnikov
|
7ba83730c4
|
clean up pr
|
2018-11-13 16:31:20 +01:00 |
lukovnikov
|
fa0c5a2ea1
|
clean up pr
|
2018-11-13 16:24:53 +01:00 |
lukovnikov
|
f4d79f44c9
|
Merge remote-tracking branch 'upstream/master'
|
2018-11-13 16:22:23 +01:00 |
Thomas Wolf
|
5cd8d7ad27
|
Merge pull request #16 from donatasrep/master
Excluding AdamWeightDecayOptimizer internal variables from restoring
|
2018-11-13 16:19:28 +01:00 |
Donatas Repecka
|
20d07b3a7f
|
Excluding AdamWeightDecayOptimizer internal variables from restoring
|
2018-11-13 16:56:25 +02:00 |
Thomas Wolf
|
278fd28a32
|
added results for 16-bit fine-tuning in readme
|
2018-11-13 09:34:49 +01:00 |
thomwolf
|
d940eeda54
|
typo
|
2018-11-12 15:26:46 +01:00 |
thomwolf
|
1cf0a16c67
|
cleaning up readme
|
2018-11-12 15:24:47 +01:00 |
thomwolf
|
66b0090877
|
add fp16 training
|
2018-11-12 15:15:02 +01:00 |
Thomas Wolf
|
5dfd19060a
|
fix typo in readme
|
2018-11-12 12:39:57 +01:00 |
Thomas Wolf
|
fa1aa81f26
|
fix typo in readme bach examples
|
2018-11-12 08:37:43 +01:00 |
Thomas Wolf
|
6d6b916f48
|
update to BERT-large results
|
2018-11-11 17:00:49 +01:00 |
Thomas Wolf
|
c4bfc646f5
|
Add results of fine-tuning BERT-large on GPUs
|
2018-11-11 16:59:35 +01:00 |
Thomas Wolf
|
48930a4cff
|
Merge pull request #2 from elyase/patch-1
Port tokenization for the multilingual model
|
2018-11-10 22:27:45 +01:00 |
thomwolf
|
a81a1ef8e9
|
fixing learning rate schedule when using gradient_accumulation_steps
|
2018-11-10 16:11:14 +01:00 |
thomwolf
|
ea85cca8ab
|
adding optimize_on_cpu explanation in readme
|
2018-11-09 11:42:37 +01:00 |
thomwolf
|
5f04aa00ed
|
option to perform optimization and keep the optimizer averages on CPU
|
2018-11-09 11:28:14 +01:00 |
thomwolf
|
9e95cd8cd6
|
clean up optimizer from unused functions
|
2018-11-09 11:23:55 +01:00 |
thomwolf
|
34a1a01091
|
update code comment
|
2018-11-09 09:31:20 +01:00 |
thomwolf
|
34bdc8b54f
|
remove duplicate accumulate gradient step arguments
|
2018-11-09 09:19:45 +01:00 |
Thomas Wolf
|
0c24db9d5f
|
update results for SQuAD
|
2018-11-09 09:11:59 +01:00 |
thomwolf
|
2c5d993ba4
|
update readme - fix SQuAD model on multi-GPU
|
2018-11-08 21:22:22 +01:00 |
Gopal Krishna
|
4850ec5888
|
fixed small typos in the README.md (#8)
|
2018-11-08 15:00:02 -05:00 |