Commit Graph

118 Commits

Author SHA1 Message Date
thomwolf 45efc9d807 removing f-string 2018-11-03 11:46:18 +01:00
thomwolf 4df6027453 clean up 2018-11-03 11:33:10 +01:00
thomwolf e6d106a01b comment on gelu function 2018-11-03 10:47:10 +01:00
thomwolf 01b1a05343 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 03:11:15 +01:00
thomwolf 8aa22af0c5 fixing model 2018-11-03 03:11:13 +01:00
VictorSanh 72ab10399f Fix loss
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
2018-11-02 18:06:21 -04:00
VictorSanh 25d5ca48e0 Fix scatter LopngTensor 2018-11-02 17:57:46 -04:00
VictorSanh e6a710f684 device 2018-11-02 17:54:22 -04:00
VictorSanh 3ebf1a13c9 Fix loss computation for indexes bigger than max_seq_length. 2018-11-02 17:49:35 -04:00
Tim Rault 629bd006bf Convert optimization_test.py to PyTorch 2018-11-02 17:50:17 +01:00
VictorSanh 38f740a1d5 Fix bug writing predictions in run_squad_pytorch 2018-11-02 11:29:45 -04:00
VictorSanh ee29871f8d Debug run_squad_pytorch 2018-11-02 11:07:32 -04:00
VictorSanh 101eabff90 Debug run_squad_pytorch 2018-11-02 10:44:08 -04:00
VictorSanh bb0a510330 Print for debug run_squad 2018-11-02 10:16:07 -04:00
thomwolf c84315ec35 model fixes + ipnb fixes 2018-11-02 15:11:16 +01:00
Tim Rault 3ff2ec5eb3 Move command-line argparse arguments into main() function 2018-11-02 14:42:05 +01:00
thomwolf c9690e57f8 adding jupyter, updating extract features adding simple test file 2018-11-02 14:25:21 +01:00
VictorSanh 844b2f0e6f Small update Readme 2018-11-02 08:57:15 -04:00
Tim Rault 49006d0662 Bug fix type=bool -> action='store_true' in argparse 2018-11-02 10:04:41 +01:00
VictorSanh beb59080b3 Fix size compatibility for model.forward
Error was coming from "modeling_pytorch.py", line 484, in forward: start_loss = loss_fct(start_logits, start_positions) --> ValueError: Expected target size (12, 1), got torch.Size([12])
2018-11-02 04:36:42 -04:00
VictorSanh 8cbe7d6af8 FIX errors in loading eval Dataset in `run_squad_pytorch` 2018-11-02 04:09:53 -04:00
VictorSanh 833c3a7a25 FIX errors in loading Dataset in `run_squad_pytorch` 2018-11-02 04:00:00 -04:00
VictorSanh 72d69a4ef4 Update README 2018-11-02 03:37:39 -04:00
VictorSanh 62ac7e9a60 Fix small bug in `run_squad_pytorch.py` 2018-11-02 03:32:35 -04:00
VictorSanh 98b9771dfe Quick fix metrics evaluation on run_classif_pytorch 2018-11-02 03:02:06 -04:00
VictorSanh bf65d4dbb7 Begin Updating the README.md 2018-11-02 02:51:07 -04:00
VictorSanh b54de837c2 Quick fix on eval accuracy 2018-11-02 02:46:17 -04:00
VictorSanh 1d53f9cb72 Fix cuda compability for evaluation 2018-11-02 01:39:15 -04:00
VictorSanh 1d8511f8f2 FIX small bugs in `run_classifier_pytorch.py` 2018-11-02 01:12:52 -04:00
VictorSanh 936eb4c3ad FIX small bugs in `run_classifier_pytorch.py` 2018-11-02 01:11:25 -04:00
VictorSanh cc228089ef Small fix to ensure Python3 compativility. 2018-11-02 00:02:36 -04:00
thomwolf ebfffa0ab2 updated extract_features 2018-11-02 04:32:18 +01:00
thomwolf 9af479b3b9 conversion run_squad ok 2018-11-02 04:12:20 +01:00
thomwolf 8e81e5e6ff working on squad 2018-11-02 04:07:52 +01:00
thomwolf e61db0d1c0 run_squad WIP 2018-11-02 03:56:14 +01:00
thomwolf c0065af6cb implemented BertForQuestionAnswering 2018-11-02 03:04:34 +01:00
thomwolf 5383fca458 update name 2018-11-02 01:56:25 +01:00
thomwolf dee09a40b2 various fixes 2018-11-02 01:52:54 +01:00
thomwolf 2c731fd129 small tweaks 2018-11-02 01:38:22 +01:00
thomwolf 9343a2311b model training loop working – still have to check that everything is exactly same 2018-11-02 01:31:31 +01:00
thomwolf f690f0e167 run_classifier WIP + added classifier head and initialization to the model 2018-11-02 00:27:50 +01:00
thomwolf 4a0b59e980 run_classifier WIP 2018-11-01 21:05:04 +01:00
VictorSanh 7af7f8173b Fix oubli 2018-11-01 15:44:52 -04:00
thomwolf e1bb7904d8 optimization_pytorch from OpenAI with adapted HP - check if there is weights decay on bias 2018-11-01 20:11:14 +01:00
VictorSanh d3a8df6b9f typos in `input_fn_builder` 2018-11-01 14:17:55 -04:00
VictorSanh 836faed985 wip 2018-11-01 14:17:12 -04:00
thomwolf 960ef4df3b probably ok weights convertion script 2018-11-01 19:12:31 +01:00
thomwolf ab0e8932a8 convertion script WIP 2018-11-01 18:00:20 +01:00
Tim Rault 5581edb4f6 Merge remote-tracking branch 'origin/master' 2018-11-01 17:55:40 +01:00
Tim Rault cdd39d216e Remove TensorFlow dependence in `tokenization_test_pytorch.py` 2018-11-01 17:55:30 +01:00