Commit Graph

1798 Commits

Author SHA1 Message Date
thomwolf dcddf498c8 fix bert layernorm 2019-09-12 16:46:32 +02:00
thomwolf d3a3a0353c clean up cache after conversion 2019-09-12 16:42:52 +02:00
thomwolf a84adddd1b convert all models 2019-09-12 13:14:07 +02:00
VictorSanh 32e1332acf [distil] fix once for all general logger for scripts 2019-09-11 14:19:07 +00:00
Thomas Wolf b62abe87c9
Merge pull request #1249 from ziliwang/master
fixed: hard coding for max and min number will out of range in fp16, which will cause nan.
2019-09-11 15:53:28 +02:00
thomwolf 969d3ae95e XLMWithLMHead fixed - standardize conversion 2019-09-11 15:47:33 +02:00
thomwolf 646711e1e2 standardize scopes names - add conversion methods 2019-09-11 15:34:17 +02:00
thomwolf 4356f791a2 XLM passing tests 2019-09-11 11:49:54 +02:00
LysandreJik 11ac4b9555 [CI] Symbolic link for documentation 2019-09-11 10:13:44 +02:00
Zili Wang 8bdee1cb73 fixed: hard coding for max and min number will out of range in fp16, which will cause nan. 2019-09-11 15:41:53 +08:00
ziliwang 7424b2848f
Merge pull request #1 from huggingface/master
merege from original repo
2019-09-11 11:02:23 +08:00
VictorSanh 364920e216 fix small bug/typo 2019-09-10 21:45:01 +00:00
Thomas Wolf 23c23f5399
Merge pull request #1229 from SKRohit/master
changes in evaluate function in run_lm_finetuning.py
2019-09-10 22:16:45 +02:00
Thomas Wolf 99a54ac51c
Merge pull request #1233 from searchivarius/master
Fix to prevent crashing on assert len(tokens_b)>=1
2019-09-10 22:15:47 +02:00
Thomas Wolf 439b37b474
Merge pull request #1241 from mattolson93/patch-1
Fixing typo in gpt2 for doc site's class link
2019-09-10 22:14:18 +02:00
mattolson93 f2cf6ce4a9
Fixing typo in gpt2 for doc site's class link 2019-09-10 09:12:01 -07:00
thomwolf 465870c33f Xlnet working - also added simple question answering model for XLNet 2019-09-10 16:44:41 +02:00
thomwolf 16b6361792 xlnet paassing first test 2019-09-10 12:39:27 +02:00
thomwolf 32aabe8c33 WIP XLNet 2019-09-10 12:17:18 +02:00
Thomas Wolf 2c177a87eb
Merge pull request #1228 from huggingface/head-masking-test
Trying to fix the head masking test
2019-09-10 11:55:27 +02:00
thomwolf f851fb55ca fixing error message 2019-09-10 09:24:08 +02:00
searchivarius eab980fd68 Fix to prevent crashing on assert len(tokens_b)>=1 2019-09-09 19:58:08 -04:00
VictorSanh a95ced6260 [Distillation] save last chkpt as `pytorch_model.bin` 2019-09-09 19:53:35 +00:00
thomwolf 50c6bc4195 fix tf bert model 2019-09-09 17:46:01 +02:00
Rohit Kumar Singh 4b082bd4d8
Merge pull request #1 from SKRohit/SKRohit-patch-1
changes in return statement of evaluate function
2019-09-09 19:59:27 +05:30
Rohit Kumar Singh e5df36397b
changes in return statement of evaluate function
changed `results` to `result` and removed `results` dict defined previously
2019-09-09 19:55:57 +05:30
thomwolf 0537139b2b removing tf.function 2019-09-09 14:47:31 +02:00
Thomas Wolf 84d346b687
Merge pull request #1195 from huggingface/reorder_arguments
[2.0] Reodering arguments for torch jit #1010 and future TF2.0 compatibility
2019-09-09 15:42:51 +03:00
Thomas Wolf 3f05de6dde
Merge branch 'master' into reorder_arguments 2019-09-09 15:42:25 +03:00
thomwolf 33cb00f41a add GPT2 to init - fix weights loading - remove tf.function 2019-09-09 14:29:24 +02:00
thomwolf 78b2a53f10 debug file download in tests error 2019-09-09 13:38:10 +02:00
thomwolf 6b3438df21 fixing GPT2 double head model and updating the torch version tests 2019-09-09 12:48:36 +02:00
thomwolf e360037236 Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2 2019-09-09 11:08:49 +02:00
thomwolf b7175a2701 fixed imports in tests and gpt2 config test 2019-09-09 11:04:03 +02:00
Thomas Wolf 995e38b7af
Merge pull request #1214 from huggingface/new-examples
Better examples
2019-09-09 10:26:36 +03:00
thomwolf 3401980fc4 fix #1208 2019-09-09 10:22:12 +03:00
thomwolf 728637356c WIP GPT2 2019-09-09 10:18:55 +03:00
thomwolf 34f28b2a13 WIP GPT2 2019-09-08 15:02:06 +03:00
thomwolf ad88563bda WIP GPT-2 2019-09-08 15:02:06 +03:00
thomwolf 64d83c7ae0 WIP 2019-09-08 15:02:06 +03:00
thomwolf 01597e5b90 add tf auto models + tests 2019-09-08 15:02:06 +03:00
thomwolf f5c698b21a add weights tying, attention and hidden states output tests 2019-09-08 15:02:06 +03:00
thomwolf 6dc4b6f34c skip transfo-xl tokenizer tests with tf for now 2019-09-08 15:02:06 +03:00
thomwolf e30579f764 no pytest version checking 2019-09-08 15:02:06 +03:00
thomwolf 518307dfcd test suite independent of framework 2019-09-08 15:02:06 +03:00
thomwolf 9d0a11a68c update dependencies and circle-ci 2019-09-08 15:02:06 +03:00
thomwolf 24a20483f5 update conversion script names 2019-09-08 15:02:06 +03:00
thomwolf 6f152572cd add conversion script, rename conversion scripts 2019-09-08 15:02:06 +03:00
thomwolf a4704b1263 skipping tf tests if tf is not installed 2019-09-08 15:02:06 +03:00
thomwolf ad0ab9afe9 fix test when tf is not here 2019-09-08 15:02:06 +03:00