thomwolf
d6418c5ef3
tweaking the readme
2018-11-03 23:52:35 +01:00
thomwolf
3b70b270e0
update readme
2018-11-03 23:39:55 +01:00
thomwolf
eaa6db92f1
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 23:35:16 +01:00
thomwolf
f8276008df
update readme, file names, removing TF code, moving tests
2018-11-03 23:35:14 +01:00
Ubuntu
f18ae210e1
fix typo
2018-11-03 22:34:37 +00:00
VictorSanh
3c24e4bef1
Multi-Gpu loss - Cleaning
2018-11-03 18:03:17 -04:00
Tim Rault
5de1517d6b
WIP modeling_test_pytorch.py
2018-11-03 22:40:50 +01:00
VictorSanh
1ba5b58c20
fix typo
2018-11-03 17:10:23 -04:00
VictorSanh
5858e8e4dd
Fix both loss and eval metrics -> more coherence on the loss (eval vs train and tf vs pt)
2018-11-03 16:48:24 -04:00
VictorSanh
cd09cd5b40
Fix import on initalization
2018-11-03 15:38:30 -04:00
Tim Rault
ec66841afa
WIP
2018-11-03 19:12:20 +01:00
thomwolf
139873f6e3
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 19:06:17 +01:00
thomwolf
04287a4d68
special edition script
2018-11-03 19:06:15 +01:00
VictorSanh
a1af5247e1
Add seed in initialization
2018-11-03 14:00:36 -04:00
Ubuntu
4faeb38b51
Fix loss loss logging for multi-gpu compatibility
2018-11-03 17:52:51 +00:00
thomwolf
25f73add07
update optimizer run_squad
2018-11-03 17:56:34 +01:00
thomwolf
f514cbbf30
update run_squad with tqdm
2018-11-03 17:52:44 +01:00
thomwolf
cb76c1ddd3
add model.zero_grad()
2018-11-03 17:40:12 +01:00
thomwolf
a4086c5de5
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 17:38:17 +01:00
thomwolf
088ad45888
fixing optimization
2018-11-03 17:38:15 +01:00
VictorSanh
8bd6b235b7
typo on tokenization
2018-11-03 10:27:59 -04:00
VictorSanh
2c55568c40
`scatter_` and `scatter`
2018-11-03 10:27:38 -04:00
VictorSanh
a6efe1235f
Merge pull request #1 from huggingface/multi-gpu-support
...
Create DataParallel model if several GPUs
2018-11-03 10:10:34 -04:00
VictorSanh
5f432480c0
Create DataParallel model if several GPUs
2018-11-03 10:10:01 -04:00
VictorSanh
5889765a7c
Update README.md
2018-11-03 09:18:44 -04:00
VictorSanh
8c932e37f9
Update the comparison notebook
2018-11-03 09:08:05 -04:00
VictorSanh
391a4ec2f3
Small typo in `trange`
...
I seriously don't understand why they defined num_train_epochs as a float in the originial tf code.
I Will change it at the end to avoir merge conflicts for now.
2018-11-03 08:25:15 -04:00
VictorSanh
5676d6f799
Remove BERT pretraining files for now
2018-11-03 08:17:22 -04:00
Tim Rault
8ec457d370
Fix imports
2018-11-03 12:35:21 +01:00
thomwolf
852e4b3c00
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 12:23:04 +01:00
thomwolf
0d8d2285ba
fix optimization_test
2018-11-03 12:23:00 +01:00
Tim Rault
574e20a92e
Move modeling_test.py to /tensorflow_code
2018-11-03 12:22:06 +01:00
thomwolf
45efc9d807
removing f-string
2018-11-03 11:46:18 +01:00
thomwolf
4df6027453
clean up
2018-11-03 11:33:10 +01:00
thomwolf
e6d106a01b
comment on gelu function
2018-11-03 10:47:10 +01:00
thomwolf
01b1a05343
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 03:11:15 +01:00
thomwolf
8aa22af0c5
fixing model
2018-11-03 03:11:13 +01:00
VictorSanh
72ab10399f
Fix loss
...
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
2018-11-02 18:06:21 -04:00
VictorSanh
25d5ca48e0
Fix scatter LopngTensor
2018-11-02 17:57:46 -04:00
VictorSanh
e6a710f684
device
2018-11-02 17:54:22 -04:00
VictorSanh
3ebf1a13c9
Fix loss computation for indexes bigger than max_seq_length.
2018-11-02 17:49:35 -04:00
Tim Rault
629bd006bf
Convert optimization_test.py to PyTorch
2018-11-02 17:50:17 +01:00
VictorSanh
38f740a1d5
Fix bug writing predictions in run_squad_pytorch
2018-11-02 11:29:45 -04:00
VictorSanh
ee29871f8d
Debug run_squad_pytorch
2018-11-02 11:07:32 -04:00
VictorSanh
101eabff90
Debug run_squad_pytorch
2018-11-02 10:44:08 -04:00
VictorSanh
bb0a510330
Print for debug run_squad
2018-11-02 10:16:07 -04:00
thomwolf
c84315ec35
model fixes + ipnb fixes
2018-11-02 15:11:16 +01:00
Tim Rault
3ff2ec5eb3
Move command-line argparse arguments into main() function
2018-11-02 14:42:05 +01:00
thomwolf
c9690e57f8
adding jupyter, updating extract features adding simple test file
2018-11-02 14:25:21 +01:00
VictorSanh
844b2f0e6f
Small update Readme
2018-11-02 08:57:15 -04:00