thomwolf
|
dcddf498c8
|
fix bert layernorm
|
2019-09-12 16:46:32 +02:00 |
thomwolf
|
d3a3a0353c
|
clean up cache after conversion
|
2019-09-12 16:42:52 +02:00 |
thomwolf
|
a84adddd1b
|
convert all models
|
2019-09-12 13:14:07 +02:00 |
VictorSanh
|
32e1332acf
|
[distil] fix once for all general logger for scripts
|
2019-09-11 14:19:07 +00:00 |
Thomas Wolf
|
b62abe87c9
|
Merge pull request #1249 from ziliwang/master
fixed: hard coding for max and min number will out of range in fp16, which will cause nan.
|
2019-09-11 15:53:28 +02:00 |
thomwolf
|
969d3ae95e
|
XLMWithLMHead fixed - standardize conversion
|
2019-09-11 15:47:33 +02:00 |
thomwolf
|
646711e1e2
|
standardize scopes names - add conversion methods
|
2019-09-11 15:34:17 +02:00 |
thomwolf
|
4356f791a2
|
XLM passing tests
|
2019-09-11 11:49:54 +02:00 |
LysandreJik
|
11ac4b9555
|
[CI] Symbolic link for documentation
|
2019-09-11 10:13:44 +02:00 |
Zili Wang
|
8bdee1cb73
|
fixed: hard coding for max and min number will out of range in fp16, which will cause nan.
|
2019-09-11 15:41:53 +08:00 |
ziliwang
|
7424b2848f
|
Merge pull request #1 from huggingface/master
merege from original repo
|
2019-09-11 11:02:23 +08:00 |
VictorSanh
|
364920e216
|
fix small bug/typo
|
2019-09-10 21:45:01 +00:00 |
Thomas Wolf
|
23c23f5399
|
Merge pull request #1229 from SKRohit/master
changes in evaluate function in run_lm_finetuning.py
|
2019-09-10 22:16:45 +02:00 |
Thomas Wolf
|
99a54ac51c
|
Merge pull request #1233 from searchivarius/master
Fix to prevent crashing on assert len(tokens_b)>=1
|
2019-09-10 22:15:47 +02:00 |
Thomas Wolf
|
439b37b474
|
Merge pull request #1241 from mattolson93/patch-1
Fixing typo in gpt2 for doc site's class link
|
2019-09-10 22:14:18 +02:00 |
mattolson93
|
f2cf6ce4a9
|
Fixing typo in gpt2 for doc site's class link
|
2019-09-10 09:12:01 -07:00 |
thomwolf
|
465870c33f
|
Xlnet working - also added simple question answering model for XLNet
|
2019-09-10 16:44:41 +02:00 |
thomwolf
|
16b6361792
|
xlnet paassing first test
|
2019-09-10 12:39:27 +02:00 |
thomwolf
|
32aabe8c33
|
WIP XLNet
|
2019-09-10 12:17:18 +02:00 |
Thomas Wolf
|
2c177a87eb
|
Merge pull request #1228 from huggingface/head-masking-test
Trying to fix the head masking test
|
2019-09-10 11:55:27 +02:00 |
thomwolf
|
f851fb55ca
|
fixing error message
|
2019-09-10 09:24:08 +02:00 |
searchivarius
|
eab980fd68
|
Fix to prevent crashing on assert len(tokens_b)>=1
|
2019-09-09 19:58:08 -04:00 |
VictorSanh
|
a95ced6260
|
[Distillation] save last chkpt as `pytorch_model.bin`
|
2019-09-09 19:53:35 +00:00 |
thomwolf
|
50c6bc4195
|
fix tf bert model
|
2019-09-09 17:46:01 +02:00 |
Rohit Kumar Singh
|
4b082bd4d8
|
Merge pull request #1 from SKRohit/SKRohit-patch-1
changes in return statement of evaluate function
|
2019-09-09 19:59:27 +05:30 |
Rohit Kumar Singh
|
e5df36397b
|
changes in return statement of evaluate function
changed `results` to `result` and removed `results` dict defined previously
|
2019-09-09 19:55:57 +05:30 |
thomwolf
|
0537139b2b
|
removing tf.function
|
2019-09-09 14:47:31 +02:00 |
Thomas Wolf
|
84d346b687
|
Merge pull request #1195 from huggingface/reorder_arguments
[2.0] Reodering arguments for torch jit #1010 and future TF2.0 compatibility
|
2019-09-09 15:42:51 +03:00 |
Thomas Wolf
|
3f05de6dde
|
Merge branch 'master' into reorder_arguments
|
2019-09-09 15:42:25 +03:00 |
thomwolf
|
33cb00f41a
|
add GPT2 to init - fix weights loading - remove tf.function
|
2019-09-09 14:29:24 +02:00 |
thomwolf
|
78b2a53f10
|
debug file download in tests error
|
2019-09-09 13:38:10 +02:00 |
thomwolf
|
6b3438df21
|
fixing GPT2 double head model and updating the torch version tests
|
2019-09-09 12:48:36 +02:00 |
thomwolf
|
e360037236
|
Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2
|
2019-09-09 11:08:49 +02:00 |
thomwolf
|
b7175a2701
|
fixed imports in tests and gpt2 config test
|
2019-09-09 11:04:03 +02:00 |
Thomas Wolf
|
995e38b7af
|
Merge pull request #1214 from huggingface/new-examples
Better examples
|
2019-09-09 10:26:36 +03:00 |
thomwolf
|
3401980fc4
|
fix #1208
|
2019-09-09 10:22:12 +03:00 |
thomwolf
|
728637356c
|
WIP GPT2
|
2019-09-09 10:18:55 +03:00 |
thomwolf
|
34f28b2a13
|
WIP GPT2
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
ad88563bda
|
WIP GPT-2
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
64d83c7ae0
|
WIP
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
01597e5b90
|
add tf auto models + tests
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
f5c698b21a
|
add weights tying, attention and hidden states output tests
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
6dc4b6f34c
|
skip transfo-xl tokenizer tests with tf for now
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
e30579f764
|
no pytest version checking
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
518307dfcd
|
test suite independent of framework
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
9d0a11a68c
|
update dependencies and circle-ci
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
24a20483f5
|
update conversion script names
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
6f152572cd
|
add conversion script, rename conversion scripts
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
a4704b1263
|
skipping tf tests if tf is not installed
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
ad0ab9afe9
|
fix test when tf is not here
|
2019-09-08 15:02:06 +03:00 |