thomwolf
|
f12007e421
|
add head masking and pruning to openai GPT
|
2019-06-17 14:19:40 +02:00 |
thomwolf
|
8415a38b23
|
better error messages
|
2019-06-17 13:03:48 +02:00 |
Thomas Wolf
|
ff276fc00c
|
Merge branch 'master' into finish_torchhub_interfaces
|
2019-06-14 16:59:07 +02:00 |
Thomas Wolf
|
35e6baab37
|
Merge branch 'master' into attention
|
2019-06-14 16:41:56 +02:00 |
VictorSanh
|
8f97f6c57f
|
fix typo
cc @thomwolf
|
2019-06-01 17:29:07 -04:00 |
VictorSanh
|
0c5a4fe9c9
|
modify from_pretrained for OpenAIGPT
|
2019-05-31 00:27:18 -04:00 |
thomwolf
|
0efc4ab632
|
adding dropout to GPT-2 and embedding dropout to GPT
|
2019-05-08 10:41:35 +02:00 |
thomwolf
|
ce86336545
|
add predict_special_tokens option to GPT also
|
2019-05-07 16:47:22 +02:00 |
thomwolf
|
e211785ada
|
extract attention weights from GPT
|
2019-05-02 18:31:26 +02:00 |
thomwolf
|
c30139a013
|
add special tokens to gpt-2
|
2019-04-30 10:45:26 +02:00 |
Thomas Wolf
|
3d78e226e6
|
Merge pull request #489 from huggingface/tokenization_serialization
Better serialization for Tokenizers and Configuration classes - Also fix #466
|
2019-04-16 08:49:54 +02:00 |
thomwolf
|
df5d9c3551
|
load all models on cpu
|
2019-04-15 15:43:01 +02:00 |
thomwolf
|
60ea6c59d2
|
added best practices for serialization in README and examples
|
2019-04-15 15:00:33 +02:00 |
thomwolf
|
9761aa4845
|
add to_json_file method to configuration classes
|
2019-04-15 14:12:08 +02:00 |
thomwolf
|
fe2756ff41
|
update double head model
|
2019-04-15 10:04:05 +02:00 |
thomwolf
|
b509bf7655
|
updating loss computation
|
2019-04-12 12:12:33 +02:00 |
thomwolf
|
1d203a34c0
|
back to simple indexing
|
2019-04-11 23:51:03 +02:00 |
thomwolf
|
074c869bbe
|
fix OpenAIGPTMultipleChoiceHead
|
2019-04-11 20:53:50 +02:00 |
thomwolf
|
a05fad8dce
|
fix typo
|
2019-04-11 13:16:17 +02:00 |
thomwolf
|
4a82f4f856
|
update special token addition
|
2019-04-11 13:11:22 +02:00 |
thomwolf
|
991b8e65f4
|
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
|
2019-04-11 11:43:15 +02:00 |
thomwolf
|
e99b2014cc
|
fixes #471
|
2019-04-11 11:43:13 +02:00 |
Catalin Voss
|
01520d5412
|
Remove my unhelpful comments :)
|
2019-03-27 10:45:28 -07:00 |
Catalin Voss
|
fda2f62395
|
Fix test failures due to old torch issue with non-contiguous view
|
2019-03-24 14:37:13 -07:00 |
Catalin Voss
|
0dd796e359
|
Also fix loss function issue with the double head models
|
2019-03-24 14:35:55 -07:00 |
Catalin Voss
|
472857c47f
|
Fix typo syntax err (sorry, c/p from my repo)
|
2019-03-24 14:14:49 -07:00 |
Catalin Voss
|
2e6f5ffb96
|
Fix GPT language model loss here as well
|
2019-03-24 14:14:44 -07:00 |
thomwolf
|
e5f2d9122c
|
adding absolute imports to gpt2, openai and transfo-xl
|
2019-03-14 09:55:01 +01:00 |
Philipp Glock
|
6190e8ce4c
|
Fix: use dropout layer
|
2019-03-07 10:12:45 +01:00 |
thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
thomwolf
|
009ee86a19
|
fix tests - bump up version
|
2019-02-17 23:57:23 +01:00 |
thomwolf
|
1320e4ec0c
|
mc_token_mask => mc_token_ids
|
2019-02-09 16:58:53 +01:00 |
thomwolf
|
80607874c1
|
fix layer norm epsilon in OpenAI GPT
|
2019-02-08 21:49:05 +01:00 |
thomwolf
|
777459b471
|
run openai example running
|
2019-02-08 10:33:14 +01:00 |
thomwolf
|
edcb56fd96
|
more explicit variable name
|
2019-02-08 09:54:49 +01:00 |
thomwolf
|
9c3c24800b
|
split saved model in config & weights
|
2019-02-07 17:06:17 +01:00 |
thomwolf
|
448937c00d
|
python 2 compatibility
|
2019-02-06 00:07:46 +01:00 |
thomwolf
|
3a848111e6
|
update config, docstrings and readme to switch to seperated tokens and position embeddings
|
2019-01-29 11:00:11 +01:00 |
thomwolf
|
98c96fb1a7
|
splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests
|
2019-01-29 10:31:42 +01:00 |
thomwolf
|
5456d82311
|
more versatile model loading
|
2019-01-29 09:54:18 +01:00 |
thomwolf
|
bd3b3aee9c
|
update
|
2019-01-28 17:47:29 +01:00 |
thomwolf
|
b12616fd8e
|
updating code organization to fix imports
|
2019-01-28 17:03:39 +01:00 |
thomwolf
|
d77dd62ff8
|
directly load from TF checkpoints + code cleanup
|
2019-01-28 16:50:23 +01:00 |
thomwolf
|
3a9c88377f
|
adding Transformer XL
|
2019-01-15 12:59:38 +01:00 |
thomwolf
|
ab90d4cddd
|
adding docs and example for OpenAI GPT
|
2019-01-09 00:12:43 +01:00 |
thomwolf
|
dc5df92fa8
|
added LM head for OpenAI
|
2019-01-08 17:18:47 +01:00 |
thomwolf
|
3cf12b235a
|
added tests + fixed losses
|
2019-01-08 16:24:23 +01:00 |
thomwolf
|
eed51c5bdf
|
add OpenAI GPT
|
2019-01-08 12:26:58 +01:00 |
thomwolf
|
93f563b8a8
|
adding OpenAI GPT
|
2019-01-07 12:55:36 +01:00 |