thomwolf
|
d940eeda54
|
typo
|
2018-11-12 15:26:46 +01:00 |
thomwolf
|
1cf0a16c67
|
cleaning up readme
|
2018-11-12 15:24:47 +01:00 |
thomwolf
|
66b0090877
|
add fp16 training
|
2018-11-12 15:15:02 +01:00 |
Thomas Wolf
|
5dfd19060a
|
fix typo in readme
|
2018-11-12 12:39:57 +01:00 |
Thomas Wolf
|
fa1aa81f26
|
fix typo in readme bach examples
|
2018-11-12 08:37:43 +01:00 |
Thomas Wolf
|
6d6b916f48
|
update to BERT-large results
|
2018-11-11 17:00:49 +01:00 |
Thomas Wolf
|
c4bfc646f5
|
Add results of fine-tuning BERT-large on GPUs
|
2018-11-11 16:59:35 +01:00 |
Thomas Wolf
|
48930a4cff
|
Merge pull request #2 from elyase/patch-1
Port tokenization for the multilingual model
|
2018-11-10 22:27:45 +01:00 |
thomwolf
|
a81a1ef8e9
|
fixing learning rate schedule when using gradient_accumulation_steps
|
2018-11-10 16:11:14 +01:00 |
thomwolf
|
ea85cca8ab
|
adding optimize_on_cpu explanation in readme
|
2018-11-09 11:42:37 +01:00 |
thomwolf
|
5f04aa00ed
|
option to perform optimization and keep the optimizer averages on CPU
|
2018-11-09 11:28:14 +01:00 |
thomwolf
|
9e95cd8cd6
|
clean up optimizer from unused functions
|
2018-11-09 11:23:55 +01:00 |
thomwolf
|
34a1a01091
|
update code comment
|
2018-11-09 09:31:20 +01:00 |
thomwolf
|
34bdc8b54f
|
remove duplicate accumulate gradient step arguments
|
2018-11-09 09:19:45 +01:00 |
Thomas Wolf
|
0c24db9d5f
|
update results for SQuAD
|
2018-11-09 09:11:59 +01:00 |
thomwolf
|
2c5d993ba4
|
update readme - fix SQuAD model on multi-GPU
|
2018-11-08 21:22:22 +01:00 |
Gopal Krishna
|
4850ec5888
|
fixed small typos in the README.md (#8)
|
2018-11-08 15:00:02 -05:00 |
Thomas Wolf
|
3bfbc21376
|
updating pytest command
|
2018-11-08 00:44:17 +01:00 |
Thomas Wolf
|
0ed7696191
|
Updated MRPC results
|
2018-11-08 00:39:42 +01:00 |
thomwolf
|
48d4a5317c
|
typo fix in output tuple
|
2018-11-07 23:51:12 +01:00 |
Thomas Wolf
|
d92a7f7721
|
Removing note on run_squad.py example
|
2018-11-07 23:37:55 +01:00 |
Thomas Wolf
|
5c0838d846
|
Merge pull request #7 from huggingface/develop
Develop
|
2018-11-07 23:36:46 +01:00 |
Thomas Wolf
|
efeb6b1a0d
|
Merge branch 'master' into develop
|
2018-11-07 23:35:42 +01:00 |
thomwolf
|
dbc318a4c6
|
cleaning up - speeding up a bit multi-gpu
|
2018-11-07 22:22:55 +01:00 |
thomwolf
|
6bb7510a50
|
fixing pre-processing bug - averaging loss for gradient accumulation - no_grad on evaluation
|
2018-11-07 22:12:41 +01:00 |
lukovnikov
|
bd91ae654f
|
moved bert to qelos-util
|
2018-11-06 18:21:44 +01:00 |
lukovnikov
|
4e52188433
|
bert weight loading from tf
|
2018-11-06 17:47:03 +01:00 |
thomwolf
|
a1126237a9
|
clean up logits extraction logic
|
2018-11-06 17:31:15 +01:00 |
thomwolf
|
2a97fe220b
|
fixing weights initialization in the model and out of span clamping
|
2018-11-06 17:26:33 +01:00 |
thomwolf
|
907d3569c1
|
cleaning up SQuAD notebook - more explanation - fixing error
|
2018-11-06 11:13:43 +01:00 |
Thomas Wolf
|
1a5bbd83dc
|
Updating run_squad information in readme
|
2018-11-06 08:53:01 +01:00 |
Thomas Wolf
|
79e1b95e75
|
fix link in readme
|
2018-11-06 08:38:02 +01:00 |
Knut Ole Sjøli
|
886f595c37
|
Fix typo in subheader (#4)
|
2018-11-05 18:34:18 -05:00 |
Yaser Martinez Palenzuela
|
4d124baf8f
|
Add test for Chinese tokenization
|
2018-11-05 23:04:29 +01:00 |
Thomas Wolf
|
59d4cc5f2b
|
typos
|
2018-11-05 22:47:24 +01:00 |
Yaser Martinez Palenzuela
|
0ce2f496dc
|
Port tokenization for the multilingual model
|
2018-11-05 22:34:12 +01:00 |
Thomas Wolf
|
d983eecdd3
|
more readme typo fixes
|
2018-11-05 21:29:04 +01:00 |
Thomas Wolf
|
8f91b4de91
|
more typo fixes
|
2018-11-05 21:24:14 +01:00 |
Thomas Wolf
|
7316b0d6d0
|
fix typo
|
2018-11-05 21:22:45 +01:00 |
Clement
|
d130cb5139
|
typos
|
2018-11-05 15:09:24 -05:00 |
Clement
|
2a8fee495b
|
typos
|
2018-11-05 15:04:06 -05:00 |
Clement
|
f968b11657
|
typo
|
2018-11-05 14:59:44 -05:00 |
thomwolf
|
88e793f31a
|
fix typos
|
2018-11-05 16:14:19 +01:00 |
thomwolf
|
3914eed505
|
update readme
|
2018-11-05 16:09:27 +01:00 |
thomwolf
|
bab5d13077
|
update optimizer documentation
|
2018-11-05 16:09:21 +01:00 |
thomwolf
|
7394eb47a5
|
update readme
|
2018-11-05 15:35:44 +01:00 |
thomwolf
|
e6646751ac
|
update notebooks
|
2018-11-05 15:02:50 +01:00 |
thomwolf
|
b705c9eff5
|
remove small script, moved notebooks to notebook folder
|
2018-11-05 14:55:08 +01:00 |
thomwolf
|
3a301d443b
|
update gitignore
|
2018-11-05 14:53:43 +01:00 |
thomwolf
|
711d3f9f2b
|
remove tensorflow_code
|
2018-11-05 14:53:03 +01:00 |