thomwolf
|
a84adddd1b
|
convert all models
|
2019-09-12 13:14:07 +02:00 |
thomwolf
|
969d3ae95e
|
XLMWithLMHead fixed - standardize conversion
|
2019-09-11 15:47:33 +02:00 |
thomwolf
|
646711e1e2
|
standardize scopes names - add conversion methods
|
2019-09-11 15:34:17 +02:00 |
thomwolf
|
4356f791a2
|
XLM passing tests
|
2019-09-11 11:49:54 +02:00 |
thomwolf
|
465870c33f
|
Xlnet working - also added simple question answering model for XLNet
|
2019-09-10 16:44:41 +02:00 |
thomwolf
|
16b6361792
|
xlnet paassing first test
|
2019-09-10 12:39:27 +02:00 |
thomwolf
|
32aabe8c33
|
WIP XLNet
|
2019-09-10 12:17:18 +02:00 |
thomwolf
|
f851fb55ca
|
fixing error message
|
2019-09-10 09:24:08 +02:00 |
thomwolf
|
50c6bc4195
|
fix tf bert model
|
2019-09-09 17:46:01 +02:00 |
thomwolf
|
0537139b2b
|
removing tf.function
|
2019-09-09 14:47:31 +02:00 |
thomwolf
|
33cb00f41a
|
add GPT2 to init - fix weights loading - remove tf.function
|
2019-09-09 14:29:24 +02:00 |
thomwolf
|
78b2a53f10
|
debug file download in tests error
|
2019-09-09 13:38:10 +02:00 |
thomwolf
|
6b3438df21
|
fixing GPT2 double head model and updating the torch version tests
|
2019-09-09 12:48:36 +02:00 |
thomwolf
|
b7175a2701
|
fixed imports in tests and gpt2 config test
|
2019-09-09 11:04:03 +02:00 |
thomwolf
|
728637356c
|
WIP GPT2
|
2019-09-09 10:18:55 +03:00 |
thomwolf
|
34f28b2a13
|
WIP GPT2
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
ad88563bda
|
WIP GPT-2
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
64d83c7ae0
|
WIP
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
01597e5b90
|
add tf auto models + tests
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
f5c698b21a
|
add weights tying, attention and hidden states output tests
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
6dc4b6f34c
|
skip transfo-xl tokenizer tests with tf for now
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
518307dfcd
|
test suite independent of framework
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
24a20483f5
|
update conversion script names
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
6f152572cd
|
add conversion script, rename conversion scripts
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
a4704b1263
|
skipping tf tests if tf is not installed
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
ad0ab9afe9
|
fix test when tf is not here
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
59fe641b8b
|
also gathering file names in file_utils
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
d68a8fe462
|
add tf bert files
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
7ae642b72d
|
update conversion scripts
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
69bff89935
|
clean ups
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
1efb1f1660
|
split configuration and modeling files
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
1eb125fb95
|
be sure we have uint8
|
2019-09-08 15:02:06 +03:00 |
thomwolf
|
7fba47b7d9
|
WIP reordering
|
2019-09-04 22:39:23 +02:00 |
thomwolf
|
e25cba78cf
|
WIP reodering arguments for torchscript and TF
|
2019-09-04 22:39:23 +02:00 |
thomwolf
|
38b79b5a63
|
Fixing this TransformerXL bool issue
|
2019-09-04 22:36:30 +02:00 |
thomwolf
|
89fd3450a6
|
Release: 1.2.0
|
2019-09-04 13:32:18 +02:00 |
Shijie Wu
|
a15562e170
|
Fix reference of import when called for the second time
|
2019-09-03 18:27:29 -07:00 |
Thomas Wolf
|
0287d264e9
|
Merge pull request #1162 from huggingface/xlnet-bias
XLNet bias fix on resize embeddings (cf #1124)
|
2019-09-02 23:14:04 +02:00 |
LysandreJik
|
31d3373bc9
|
Appends space before special token
|
2019-09-01 21:07:00 -04:00 |
thomwolf
|
fede4ef45d
|
fixing #1133
|
2019-09-02 02:27:39 +02:00 |
Thomas Wolf
|
ff7368eb6b
|
Merge pull request #1077 from huggingface/pruning-save-and-load
Pruning changes so that deleted heads are kept on save/load
|
2019-09-01 09:42:15 +02:00 |
LysandreJik
|
6ae0bb5291
|
XLM 100 different URLs
|
2019-08-31 14:46:31 -04:00 |
LysandreJik
|
819b468f70
|
Fixed XLM model url
|
2019-08-31 14:40:51 -04:00 |
LysandreJik
|
ea86bef545
|
Check for None
|
2019-08-31 00:56:22 -04:00 |
LysandreJik
|
e0f867a9ba
|
XLNet bias fix on resize embeddings (cf #1124)
|
2019-08-31 00:50:59 -04:00 |
LysandreJik
|
11600edc6e
|
Rebase on master + DistilBERT head pruning patch
|
2019-08-31 00:37:41 -04:00 |
LysandreJik
|
b6992b7b47
|
Applied patch to OpenAI GPT, RoBERTa, TransfoL, XLM and XLNet
|
2019-08-31 00:33:50 -04:00 |
thomwolf
|
bdb4409ed8
|
updated pruning logic with sets - Bert and GPT-2
|
2019-08-31 00:33:50 -04:00 |
LysandreJik
|
0c8e823b03
|
Added patch to remaining models
|
2019-08-31 00:33:50 -04:00 |
LysandreJik
|
0cd283522a
|
Attempt to fix head index
|
2019-08-31 00:33:50 -04:00 |