Commit Graph

15000 Commits

Author SHA1 Message Date
thomwolf d6418c5ef3 tweaking the readme 2018-11-03 23:52:35 +01:00
thomwolf 3b70b270e0 update readme 2018-11-03 23:39:55 +01:00
thomwolf eaa6db92f1 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 23:35:16 +01:00
thomwolf f8276008df update readme, file names, removing TF code, moving tests 2018-11-03 23:35:14 +01:00
Ubuntu f18ae210e1 fix typo 2018-11-03 22:34:37 +00:00
VictorSanh 3c24e4bef1 Multi-Gpu loss - Cleaning 2018-11-03 18:03:17 -04:00
Tim Rault 5de1517d6b WIP modeling_test_pytorch.py 2018-11-03 22:40:50 +01:00
VictorSanh 1ba5b58c20 fix typo 2018-11-03 17:10:23 -04:00
VictorSanh 5858e8e4dd Fix both loss and eval metrics -> more coherence on the loss (eval vs train and tf vs pt) 2018-11-03 16:48:24 -04:00
VictorSanh cd09cd5b40 Fix import on initalization 2018-11-03 15:38:30 -04:00
Tim Rault ec66841afa WIP 2018-11-03 19:12:20 +01:00
thomwolf 139873f6e3 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 19:06:17 +01:00
thomwolf 04287a4d68 special edition script 2018-11-03 19:06:15 +01:00
VictorSanh a1af5247e1 Add seed in initialization 2018-11-03 14:00:36 -04:00
Ubuntu 4faeb38b51 Fix loss loss logging for multi-gpu compatibility 2018-11-03 17:52:51 +00:00
thomwolf 25f73add07 update optimizer run_squad 2018-11-03 17:56:34 +01:00
thomwolf f514cbbf30 update run_squad with tqdm 2018-11-03 17:52:44 +01:00
thomwolf cb76c1ddd3 add model.zero_grad() 2018-11-03 17:40:12 +01:00
thomwolf a4086c5de5 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 17:38:17 +01:00
thomwolf 088ad45888 fixing optimization 2018-11-03 17:38:15 +01:00
VictorSanh 8bd6b235b7 typo on tokenization 2018-11-03 10:27:59 -04:00
VictorSanh 2c55568c40 `scatter_` and `scatter` 2018-11-03 10:27:38 -04:00
VictorSanh a6efe1235f
Merge pull request #1 from huggingface/multi-gpu-support
Create DataParallel model if several GPUs
2018-11-03 10:10:34 -04:00
VictorSanh 5f432480c0 Create DataParallel model if several GPUs 2018-11-03 10:10:01 -04:00
VictorSanh 5889765a7c Update README.md 2018-11-03 09:18:44 -04:00
VictorSanh 8c932e37f9 Update the comparison notebook 2018-11-03 09:08:05 -04:00
VictorSanh 391a4ec2f3 Small typo in `trange`
I seriously don't understand why they defined num_train_epochs as a float in the originial tf code.
I Will change it at the end to avoir merge conflicts for now.
2018-11-03 08:25:15 -04:00
VictorSanh 5676d6f799 Remove BERT pretraining files for now 2018-11-03 08:17:22 -04:00
Tim Rault 8ec457d370 Fix imports 2018-11-03 12:35:21 +01:00
thomwolf 852e4b3c00 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 12:23:04 +01:00
thomwolf 0d8d2285ba fix optimization_test 2018-11-03 12:23:00 +01:00
Tim Rault 574e20a92e Move modeling_test.py to /tensorflow_code 2018-11-03 12:22:06 +01:00
thomwolf 45efc9d807 removing f-string 2018-11-03 11:46:18 +01:00
thomwolf 4df6027453 clean up 2018-11-03 11:33:10 +01:00
thomwolf e6d106a01b comment on gelu function 2018-11-03 10:47:10 +01:00
thomwolf 01b1a05343 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 03:11:15 +01:00
thomwolf 8aa22af0c5 fixing model 2018-11-03 03:11:13 +01:00
VictorSanh 72ab10399f Fix loss
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
2018-11-02 18:06:21 -04:00
VictorSanh 25d5ca48e0 Fix scatter LopngTensor 2018-11-02 17:57:46 -04:00
VictorSanh e6a710f684 device 2018-11-02 17:54:22 -04:00
VictorSanh 3ebf1a13c9 Fix loss computation for indexes bigger than max_seq_length. 2018-11-02 17:49:35 -04:00
Tim Rault 629bd006bf Convert optimization_test.py to PyTorch 2018-11-02 17:50:17 +01:00
VictorSanh 38f740a1d5 Fix bug writing predictions in run_squad_pytorch 2018-11-02 11:29:45 -04:00
VictorSanh ee29871f8d Debug run_squad_pytorch 2018-11-02 11:07:32 -04:00
VictorSanh 101eabff90 Debug run_squad_pytorch 2018-11-02 10:44:08 -04:00
VictorSanh bb0a510330 Print for debug run_squad 2018-11-02 10:16:07 -04:00
thomwolf c84315ec35 model fixes + ipnb fixes 2018-11-02 15:11:16 +01:00
Tim Rault 3ff2ec5eb3 Move command-line argparse arguments into main() function 2018-11-02 14:42:05 +01:00
thomwolf c9690e57f8 adding jupyter, updating extract features adding simple test file 2018-11-02 14:25:21 +01:00
VictorSanh 844b2f0e6f Small update Readme 2018-11-02 08:57:15 -04:00