LysandreJik
|
03de9686a7
|
Initial folder structure for the documentation. A draft of documentation change has been made in the BertModel class.
|
2019-07-05 17:11:13 -04:00 |
thomwolf
|
e75c3f70aa
|
standardizing tokenizers API and adding tests
|
2019-07-05 11:20:27 +02:00 |
thomwolf
|
c0239e09e6
|
first commit
|
2019-07-04 17:06:30 +02:00 |
thomwolf
|
cf86d23eff
|
parallelism in circlci
|
2019-07-04 17:02:21 +02:00 |
thomwolf
|
15b70338ba
|
adding squad model to xlnet and xlm
|
2019-07-04 16:50:42 +02:00 |
thomwolf
|
fbe04423b6
|
Common SequenceSummary class
|
2019-07-04 00:25:30 +02:00 |
thomwolf
|
c22545aa40
|
fix xlm torchscript
|
2019-07-03 23:03:57 +02:00 |
thomwolf
|
3b23a846b6
|
Merge branch 'xlnet' of https://github.com/huggingface/pytorch-pretrained-BERT into xlnet
|
2019-07-03 22:54:58 +02:00 |
thomwolf
|
8fa3a1f0d8
|
updating tests
|
2019-07-03 22:54:53 +02:00 |
thomwolf
|
c41f2bad69
|
WIP XLM + refactoring
|
2019-07-03 22:54:39 +02:00 |
Thomas Wolf
|
64ce4dbd86
|
Merge pull request #748 from huggingface/torchscript
Release 0.7 - Add Torchscript capabilities
|
2019-07-03 22:52:03 +02:00 |
LysandreJik
|
b43b130f35
|
TorchScript flag in config; Tied weights when not running TorchScript; tuple concatenation clean-up.
|
2019-07-03 16:21:17 -04:00 |
LysandreJik
|
4703148f0c
|
TransformerXL can't be exported to TorchScript because of control-flow. Exception added to tests.
|
2019-07-03 14:50:23 -04:00 |
LysandreJik
|
971c24687f
|
XLNET can be exported to TorchScript
|
2019-07-03 11:03:09 -04:00 |
LysandreJik
|
be54b16960
|
GPT can be exported to TorchScript
|
2019-07-02 18:09:45 -04:00 |
LysandreJik
|
d8e83de792
|
GPT2 can be exported to TorchScript
|
2019-07-02 18:01:09 -04:00 |
thomwolf
|
288be7b7ea
|
xlm
|
2019-07-02 23:42:31 +02:00 |
LysandreJik
|
e891bb43d5
|
BERT can be exported to TorchScript
|
2019-07-02 17:23:18 -04:00 |
LysandreJik
|
6ce1ee04fc
|
TorchScript testing with output_attentions and output_hidden_state
|
2019-07-02 17:22:59 -04:00 |
thomwolf
|
7ed5bf706f
|
add tests
|
2019-07-02 16:42:22 +02:00 |
thomwolf
|
708877958a
|
updating tests and models, adding weights initialization test
|
2019-07-02 16:35:29 +02:00 |
thomwolf
|
99ae5ab883
|
update config tests and circle-ci
|
2019-07-02 12:40:39 +02:00 |
thomwolf
|
1484d67de9
|
[LARGE] updating all tests and API
|
2019-07-02 12:13:17 +02:00 |
thomwolf
|
4f8b5f687c
|
add fix for serialization of tokenizer
|
2019-06-29 23:35:21 +02:00 |
thomwolf
|
d9184620f9
|
fix tests and new API
|
2019-06-29 23:10:40 +02:00 |
thomwolf
|
213981d8cb
|
updating bert API
|
2019-06-28 16:45:24 +02:00 |
thomwolf
|
2b56e98892
|
standardizing API across models - XLNetForSeqClass working
|
2019-06-28 16:35:09 +02:00 |
thomwolf
|
3a00674cbf
|
fix imports
|
2019-06-27 17:18:46 +02:00 |
thomwolf
|
d939d6fd02
|
fix hidden-state extraction
|
2019-06-27 09:39:44 +02:00 |
thomwolf
|
0c2ff34815
|
extracting double hidden-state from xlnet
|
2019-06-27 09:27:50 +02:00 |
thomwolf
|
3deea56c07
|
fixing loading fucntion
|
2019-06-26 13:41:12 +02:00 |
thomwolf
|
f56b8033f0
|
more versatile loading
|
2019-06-26 13:13:15 +02:00 |
thomwolf
|
4d47f4985d
|
slight refactoring, add abstract class for model loading
|
2019-06-26 12:52:44 +02:00 |
thomwolf
|
59cefd4f98
|
fix #726 - get_lr in examples
|
2019-06-26 11:28:27 +02:00 |
thomwolf
|
ddc2cc61a6
|
fix python2 tests
|
2019-06-26 11:17:42 +02:00 |
thomwolf
|
7e3070ae4f
|
add from_pretrained method to all configuration classes
|
2019-06-26 11:12:00 +02:00 |
thomwolf
|
93e9971c54
|
fix tests
|
2019-06-26 10:02:45 +02:00 |
thomwolf
|
092dacfd62
|
changing is_regression to unified API
|
2019-06-26 09:54:05 +02:00 |
thomwolf
|
e55d4c4ede
|
various updates to conversion, models and examples
|
2019-06-26 00:57:53 +02:00 |
thomwolf
|
603c513b35
|
update main conversion script and readme
|
2019-06-25 10:45:07 +02:00 |
thomwolf
|
7de1740490
|
add ability to restore fine-tuned TF mdoel
|
2019-06-25 10:27:58 +02:00 |
thomwolf
|
7334bf6c21
|
pad on left for xlnet
|
2019-06-24 15:05:11 +02:00 |
thomwolf
|
c888663f18
|
overwrite output directories if needed
|
2019-06-24 14:38:24 +02:00 |
thomwolf
|
62d78aa37e
|
updating GLUE utils for compatibility with XLNet
|
2019-06-24 14:36:11 +02:00 |
thomwolf
|
24ed0b9346
|
updating run_xlnet_classifier
|
2019-06-24 12:00:09 +02:00 |
thomwolf
|
f6081f2255
|
add xlnetforsequence classif and run_classifier example for xlnet
|
2019-06-24 10:01:07 +02:00 |
thomwolf
|
c946bb51a6
|
fix xlnet tokenizer and python2
|
2019-06-22 22:28:49 +02:00 |
thomwolf
|
181075635d
|
updating model loading and adding special tokens ids
|
2019-06-21 23:23:37 +02:00 |
thomwolf
|
ebd2cb8d74
|
update from_pretrained to load XLNetModel as well
|
2019-06-21 21:08:44 +02:00 |
thomwolf
|
483cbc36a9
|
test deviation with tf model: max ~1e-3 should be ok
|
2019-06-21 16:38:01 +02:00 |