Commit Graph

306 Commits

Author SHA1 Message Date
thomwolf a84adddd1b convert all models 2019-09-12 13:14:07 +02:00
thomwolf 969d3ae95e XLMWithLMHead fixed - standardize conversion 2019-09-11 15:47:33 +02:00
thomwolf 646711e1e2 standardize scopes names - add conversion methods 2019-09-11 15:34:17 +02:00
thomwolf 4356f791a2 XLM passing tests 2019-09-11 11:49:54 +02:00
thomwolf 465870c33f Xlnet working - also added simple question answering model for XLNet 2019-09-10 16:44:41 +02:00
thomwolf 16b6361792 xlnet paassing first test 2019-09-10 12:39:27 +02:00
thomwolf 32aabe8c33 WIP XLNet 2019-09-10 12:17:18 +02:00
thomwolf f851fb55ca fixing error message 2019-09-10 09:24:08 +02:00
thomwolf 50c6bc4195 fix tf bert model 2019-09-09 17:46:01 +02:00
thomwolf 0537139b2b removing tf.function 2019-09-09 14:47:31 +02:00
thomwolf 33cb00f41a add GPT2 to init - fix weights loading - remove tf.function 2019-09-09 14:29:24 +02:00
thomwolf 78b2a53f10 debug file download in tests error 2019-09-09 13:38:10 +02:00
thomwolf 6b3438df21 fixing GPT2 double head model and updating the torch version tests 2019-09-09 12:48:36 +02:00
thomwolf b7175a2701 fixed imports in tests and gpt2 config test 2019-09-09 11:04:03 +02:00
thomwolf 728637356c WIP GPT2 2019-09-09 10:18:55 +03:00
thomwolf 34f28b2a13 WIP GPT2 2019-09-08 15:02:06 +03:00
thomwolf ad88563bda WIP GPT-2 2019-09-08 15:02:06 +03:00
thomwolf 64d83c7ae0 WIP 2019-09-08 15:02:06 +03:00
thomwolf 01597e5b90 add tf auto models + tests 2019-09-08 15:02:06 +03:00
thomwolf f5c698b21a add weights tying, attention and hidden states output tests 2019-09-08 15:02:06 +03:00
thomwolf 6dc4b6f34c skip transfo-xl tokenizer tests with tf for now 2019-09-08 15:02:06 +03:00
thomwolf 518307dfcd test suite independent of framework 2019-09-08 15:02:06 +03:00
thomwolf 24a20483f5 update conversion script names 2019-09-08 15:02:06 +03:00
thomwolf 6f152572cd add conversion script, rename conversion scripts 2019-09-08 15:02:06 +03:00
thomwolf a4704b1263 skipping tf tests if tf is not installed 2019-09-08 15:02:06 +03:00
thomwolf ad0ab9afe9 fix test when tf is not here 2019-09-08 15:02:06 +03:00
thomwolf 59fe641b8b also gathering file names in file_utils 2019-09-08 15:02:06 +03:00
thomwolf d68a8fe462 add tf bert files 2019-09-08 15:02:06 +03:00
thomwolf 7ae642b72d update conversion scripts 2019-09-08 15:02:06 +03:00
thomwolf 69bff89935 clean ups 2019-09-08 15:02:06 +03:00
thomwolf 1efb1f1660 split configuration and modeling files 2019-09-08 15:02:06 +03:00
thomwolf 1eb125fb95 be sure we have uint8 2019-09-08 15:02:06 +03:00
thomwolf 7fba47b7d9 WIP reordering 2019-09-04 22:39:23 +02:00
thomwolf e25cba78cf WIP reodering arguments for torchscript and TF 2019-09-04 22:39:23 +02:00
thomwolf 38b79b5a63 Fixing this TransformerXL bool issue 2019-09-04 22:36:30 +02:00
thomwolf 89fd3450a6 Release: 1.2.0 2019-09-04 13:32:18 +02:00
Shijie Wu a15562e170
Fix reference of import when called for the second time 2019-09-03 18:27:29 -07:00
Thomas Wolf 0287d264e9
Merge pull request #1162 from huggingface/xlnet-bias
XLNet bias fix on resize embeddings (cf #1124)
2019-09-02 23:14:04 +02:00
LysandreJik 31d3373bc9 Appends space before special token 2019-09-01 21:07:00 -04:00
thomwolf fede4ef45d fixing #1133 2019-09-02 02:27:39 +02:00
Thomas Wolf ff7368eb6b
Merge pull request #1077 from huggingface/pruning-save-and-load
Pruning changes so that deleted heads are kept on save/load
2019-09-01 09:42:15 +02:00
LysandreJik 6ae0bb5291 XLM 100 different URLs 2019-08-31 14:46:31 -04:00
LysandreJik 819b468f70 Fixed XLM model url 2019-08-31 14:40:51 -04:00
LysandreJik ea86bef545 Check for None 2019-08-31 00:56:22 -04:00
LysandreJik e0f867a9ba XLNet bias fix on resize embeddings (cf #1124) 2019-08-31 00:50:59 -04:00
LysandreJik 11600edc6e Rebase on master + DistilBERT head pruning patch 2019-08-31 00:37:41 -04:00
LysandreJik b6992b7b47 Applied patch to OpenAI GPT, RoBERTa, TransfoL, XLM and XLNet 2019-08-31 00:33:50 -04:00
thomwolf bdb4409ed8 updated pruning logic with sets - Bert and GPT-2 2019-08-31 00:33:50 -04:00
LysandreJik 0c8e823b03 Added patch to remaining models 2019-08-31 00:33:50 -04:00
LysandreJik 0cd283522a Attempt to fix head index 2019-08-31 00:33:50 -04:00