Commit Graph

640 Commits

Author SHA1 Message Date
thomwolf bc70779bf0 fixed GPT-2 tokenization on python 2 2019-04-17 10:56:15 +02:00
thomwolf 18a8a15f78 improving GPT2 tokenization and adding tests 2019-04-16 17:00:55 +02:00
thomwolf 9761aa4845 add to_json_file method to configuration classes 2019-04-15 14:12:08 +02:00
thomwolf e8568a3b17 fixing tests 2019-04-15 12:55:38 +02:00
thomwolf 870b734bfd added tokenizers serialization tests 2019-04-15 12:03:56 +02:00
lukovnikov 20686b78fc schedule fix 2019-04-03 18:13:52 +02:00
lukovnikov 1b4ce76c38 schedule fix 2019-04-03 17:40:12 +02:00
lukovnikov 23bd2eebf5 schedule fix 2019-04-03 17:10:34 +02:00
lukovnikov 91a073f804 schedule fix 2019-04-03 17:10:08 +02:00
lukovnikov b64cc63a77 optimization schedule test update 2019-04-03 16:42:40 +02:00
lukovnikov d164867d90 - updated docs for optimization 2019-04-03 16:13:51 +02:00
lukovnikov 262a9992d7 class weights 2019-03-18 18:29:12 +01:00
thomwolf 2dd8f524f5 removing test for long sequences error following #337 2019-03-06 10:10:41 +01:00
thomwolf 009ee86a19 fix tests - bump up version 2019-02-17 23:57:23 +01:00
thomwolf ffd623823d adding gpt2 2019-02-17 23:38:51 +01:00
thomwolf 884ca81d87 transposing the inputs of Transformer-XL to have a unified interface 2019-02-11 13:19:59 +01:00
thomwolf 0a9860daa7 tests pass on python 2 and 3 2019-02-11 10:47:52 +01:00
thomwolf 2071a9b86e fix python 2.7 imports 2019-02-11 10:35:36 +01:00
thomwolf b514a60c36 added tests for OpenAI GPT and Transformer-XL tokenizers 2019-02-11 10:17:16 +01:00
thomwolf 9bdcba53fd fix tests 2019-02-09 17:07:12 +01:00
thomwolf 1320e4ec0c mc_token_mask => mc_token_ids 2019-02-09 16:58:53 +01:00
thomwolf 2df41663f1 added test 2019-02-07 17:05:49 +01:00
thomwolf ba9e4eb354 fix unicode in tokenization tests 2019-02-06 00:28:00 +01:00
thomwolf 448937c00d python 2 compatibility 2019-02-06 00:07:46 +01:00
thomwolf 98c96fb1a7 splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests 2019-01-29 10:31:42 +01:00
thomwolf a45a9cc0e1 update tests 2019-01-28 17:16:02 +01:00
thomwolf dc5df92fa8 added LM head for OpenAI 2019-01-08 17:18:47 +01:00
thomwolf 3cf12b235a added tests + fixed losses 2019-01-08 16:24:23 +01:00
Patrick Lewis 78cf7b4ab4 added code to raise value error for bert tokenizer for covert_tokens_to_indices 2018-12-18 14:41:30 +00:00
thomwolf 0f544625f4 fix swag example for work with apex 2018-12-13 13:35:59 +01:00
thomwolf 52c53f39d0 clean up apex integration 2018-12-13 13:02:17 +01:00
thomwolf 85fff78c2d compatibility PT 1.0 and 0.4.1 2018-12-13 12:48:13 +01:00
Deyu Fu c8ea286048 change to apex for better fp16 and multi-gpu support 2018-12-11 17:13:58 -08:00
thomwolf 7f7c41b0c1 tests for all model classes with and without labels 2018-11-30 22:54:33 +01:00
thomwolf 757750d6f6 fix tests 2018-11-17 11:58:14 +01:00
thomwolf 1de35b624b preparing for first release 2018-11-15 20:56:10 +01:00
Yaser Martinez Palenzuela 4d124baf8f
Add test for Chinese tokenization 2018-11-05 23:04:29 +01:00
thomwolf 3d291dea4a clean up tests 2018-11-04 21:27:19 +01:00
thomwolf 87da161c2a finishing model test 2018-11-04 21:27:10 +01:00
thomwolf f8276008df update readme, file names, removing TF code, moving tests 2018-11-03 23:35:14 +01:00