Commit Graph

975 Commits

Author SHA1 Message Date
LysandreJik 03de9686a7 Initial folder structure for the documentation. A draft of documentation change has been made in the BertModel class. 2019-07-05 17:11:13 -04:00
thomwolf e75c3f70aa standardizing tokenizers API and adding tests 2019-07-05 11:20:27 +02:00
thomwolf c0239e09e6 first commit 2019-07-04 17:06:30 +02:00
thomwolf cf86d23eff parallelism in circlci 2019-07-04 17:02:21 +02:00
thomwolf 15b70338ba adding squad model to xlnet and xlm 2019-07-04 16:50:42 +02:00
thomwolf fbe04423b6 Common SequenceSummary class 2019-07-04 00:25:30 +02:00
thomwolf c22545aa40 fix xlm torchscript 2019-07-03 23:03:57 +02:00
thomwolf 3b23a846b6 Merge branch 'xlnet' of https://github.com/huggingface/pytorch-pretrained-BERT into xlnet 2019-07-03 22:54:58 +02:00
thomwolf 8fa3a1f0d8 updating tests 2019-07-03 22:54:53 +02:00
thomwolf c41f2bad69 WIP XLM + refactoring 2019-07-03 22:54:39 +02:00
Thomas Wolf 64ce4dbd86
Merge pull request #748 from huggingface/torchscript
Release 0.7 - Add Torchscript capabilities
2019-07-03 22:52:03 +02:00
LysandreJik b43b130f35 TorchScript flag in config; Tied weights when not running TorchScript; tuple concatenation clean-up. 2019-07-03 16:21:17 -04:00
LysandreJik 4703148f0c TransformerXL can't be exported to TorchScript because of control-flow. Exception added to tests. 2019-07-03 14:50:23 -04:00
LysandreJik 971c24687f XLNET can be exported to TorchScript 2019-07-03 11:03:09 -04:00
LysandreJik be54b16960 GPT can be exported to TorchScript 2019-07-02 18:09:45 -04:00
LysandreJik d8e83de792 GPT2 can be exported to TorchScript 2019-07-02 18:01:09 -04:00
thomwolf 288be7b7ea xlm 2019-07-02 23:42:31 +02:00
LysandreJik e891bb43d5 BERT can be exported to TorchScript 2019-07-02 17:23:18 -04:00
LysandreJik 6ce1ee04fc TorchScript testing with output_attentions and output_hidden_state 2019-07-02 17:22:59 -04:00
thomwolf 7ed5bf706f add tests 2019-07-02 16:42:22 +02:00
thomwolf 708877958a updating tests and models, adding weights initialization test 2019-07-02 16:35:29 +02:00
thomwolf 99ae5ab883 update config tests and circle-ci 2019-07-02 12:40:39 +02:00
thomwolf 1484d67de9 [LARGE] updating all tests and API 2019-07-02 12:13:17 +02:00
thomwolf 4f8b5f687c add fix for serialization of tokenizer 2019-06-29 23:35:21 +02:00
thomwolf d9184620f9 fix tests and new API 2019-06-29 23:10:40 +02:00
thomwolf 213981d8cb updating bert API 2019-06-28 16:45:24 +02:00
thomwolf 2b56e98892 standardizing API across models - XLNetForSeqClass working 2019-06-28 16:35:09 +02:00
thomwolf 3a00674cbf fix imports 2019-06-27 17:18:46 +02:00
thomwolf d939d6fd02 fix hidden-state extraction 2019-06-27 09:39:44 +02:00
thomwolf 0c2ff34815 extracting double hidden-state from xlnet 2019-06-27 09:27:50 +02:00
thomwolf 3deea56c07 fixing loading fucntion 2019-06-26 13:41:12 +02:00
thomwolf f56b8033f0 more versatile loading 2019-06-26 13:13:15 +02:00
thomwolf 4d47f4985d slight refactoring, add abstract class for model loading 2019-06-26 12:52:44 +02:00
thomwolf 59cefd4f98 fix #726 - get_lr in examples 2019-06-26 11:28:27 +02:00
thomwolf ddc2cc61a6 fix python2 tests 2019-06-26 11:17:42 +02:00
thomwolf 7e3070ae4f add from_pretrained method to all configuration classes 2019-06-26 11:12:00 +02:00
thomwolf 93e9971c54 fix tests 2019-06-26 10:02:45 +02:00
thomwolf 092dacfd62 changing is_regression to unified API 2019-06-26 09:54:05 +02:00
thomwolf e55d4c4ede various updates to conversion, models and examples 2019-06-26 00:57:53 +02:00
thomwolf 603c513b35 update main conversion script and readme 2019-06-25 10:45:07 +02:00
thomwolf 7de1740490 add ability to restore fine-tuned TF mdoel 2019-06-25 10:27:58 +02:00
thomwolf 7334bf6c21 pad on left for xlnet 2019-06-24 15:05:11 +02:00
thomwolf c888663f18 overwrite output directories if needed 2019-06-24 14:38:24 +02:00
thomwolf 62d78aa37e updating GLUE utils for compatibility with XLNet 2019-06-24 14:36:11 +02:00
thomwolf 24ed0b9346 updating run_xlnet_classifier 2019-06-24 12:00:09 +02:00
thomwolf f6081f2255 add xlnetforsequence classif and run_classifier example for xlnet 2019-06-24 10:01:07 +02:00
thomwolf c946bb51a6 fix xlnet tokenizer and python2 2019-06-22 22:28:49 +02:00
thomwolf 181075635d updating model loading and adding special tokens ids 2019-06-21 23:23:37 +02:00
thomwolf ebd2cb8d74 update from_pretrained to load XLNetModel as well 2019-06-21 21:08:44 +02:00
thomwolf 483cbc36a9 test deviation with tf model: max ~1e-3 should be ok 2019-06-21 16:38:01 +02:00