Commit Graph

1008 Commits

Author SHA1 Message Date
thomwolf 34d706a0e1 pruning in bertology 2019-06-19 15:25:49 +02:00
thomwolf dc8e0019b7 updating examples 2019-06-19 13:23:20 +02:00
thomwolf 68ab9599ce small fix and updates to readme 2019-06-19 09:38:38 +02:00
thomwolf f7e2ac01ea update barrier 2019-06-18 22:43:35 +02:00
thomwolf 4d8c4337ae test barrier in distrib training 2019-06-18 22:41:28 +02:00
thomwolf 3359955622 updating run_classif 2019-06-18 22:23:10 +02:00
thomwolf 29b7b30eaa updating evaluation on a single gpu 2019-06-18 22:20:21 +02:00
thomwolf 7d2001aa44 overwrite_output_dir 2019-06-18 22:13:30 +02:00
thomwolf 16a1f338c4 fixing 2019-06-18 17:06:31 +02:00
thomwolf 92e0ad5aba no numpy 2019-06-18 17:00:52 +02:00
thomwolf 4e6edc3274 hop 2019-06-18 16:57:15 +02:00
thomwolf f55b60b9ee fixing again 2019-06-18 16:56:52 +02:00
thomwolf 8bd9118294 quick fix 2019-06-18 16:54:41 +02:00
thomwolf 3e847449ad fix out_label_ids 2019-06-18 16:53:31 +02:00
thomwolf aad3a54e9c fix paths 2019-06-18 16:48:04 +02:00
thomwolf 40dbda6871 updating classification example 2019-06-18 16:45:52 +02:00
thomwolf 7388c83b60 update run_classifier for distributed eval 2019-06-18 16:32:49 +02:00
thomwolf 9727723243 fix pickle 2019-06-18 16:02:42 +02:00
thomwolf 9710b68dbc fix pickles 2019-06-18 16:01:15 +02:00
thomwolf 15ebd67d4e cache in run_classifier + various fixes to the examples 2019-06-18 15:58:22 +02:00
thomwolf e6e5f19257 fix 2019-06-18 14:45:14 +02:00
thomwolf a432b3d466 distributed traing t_total 2019-06-18 14:39:09 +02:00
thomwolf c5407f343f split squad example in two 2019-06-18 14:29:03 +02:00
thomwolf 335f57baf8 only on main process 2019-06-18 14:03:46 +02:00
thomwolf 326944d627 add tensorboard to run_squad 2019-06-18 14:02:42 +02:00
thomwolf d82e5deeb1 set find_unused_parameters=True in DDP 2019-06-18 12:13:14 +02:00
thomwolf a59abedfb5 DDP update 2019-06-18 12:06:26 +02:00
thomwolf 2ef5e0de87 switch to pytorch DistributedDataParallel 2019-06-18 12:03:13 +02:00
thomwolf 9ce37af99b oups 2019-06-18 11:47:54 +02:00
thomwolf a40955f071 no need to duplicate models anymore 2019-06-18 11:46:14 +02:00
Thomas Wolf 3763f8944d
Merge pull request #696 from huggingface/split_config_weights
Split config weights
2019-06-18 11:42:57 +02:00
thomwolf f964753090 explanation on the current location of the caching folder 2019-06-18 11:36:28 +02:00
thomwolf 868de8d1d7 updating weights loading 2019-06-18 10:58:20 +02:00
thomwolf 64e0adda81 better error message 2019-06-18 10:51:31 +02:00
thomwolf 382e2d1e50 spliting config and weight files for bert also 2019-06-18 10:37:16 +02:00
Thomas Wolf a6f2511811
Merge pull request #694 from huggingface/release_0.6.3
Release 0.6.3
2019-06-17 16:27:25 +02:00
thomwolf 4447f270b2 updating hub 2019-06-17 16:21:28 +02:00
thomwolf 33d3db5c43 updating head masking, readme and docstrings 2019-06-17 15:51:28 +02:00
thomwolf 965f172de6 output all hidden layers states in GPT/GPT-2 2019-06-17 14:34:12 +02:00
thomwolf f12007e421 add head masking and pruning to openai GPT 2019-06-17 14:19:40 +02:00
thomwolf b860e47cf5 add head masking and pruning to gpt-2 2019-06-17 14:12:10 +02:00
thomwolf 7220d47a1c adding head pruning and tests 2019-06-17 13:20:45 +02:00
thomwolf 8415a38b23 better error messages 2019-06-17 13:03:48 +02:00
thomwolf 96c4d3d988 add head masking tests 2019-06-17 12:17:26 +02:00
thomwolf 34858ae1d9 adding bert whole words, bertgerman and gpt-2 medium models, head masking 2019-06-17 11:02:39 +02:00
Thomas Wolf 80684f6f86
Merge pull request #690 from shashwath94/projadpsftmax_fix
Transformer XL ProjectedAdaptiveLogSoftmax output fix
2019-06-15 23:14:10 +02:00
Thomas Wolf 9e363703d6
Merge pull request #688 from deepset-ai/german_bert
Add German Bert model to code, update readme
2019-06-15 23:13:41 +02:00
Thomas Wolf cc6cd430f7
Merge pull request #691 from vanche/master
import class "GPT2MultipleChoiceHead"
2019-06-15 23:12:55 +02:00
vanche 8289646d4e
import class "GPT2MultipleChoiceHead" 2019-06-15 22:19:30 +09:00
Shashwath H A 5076a5daa7 Fix proj adp softmax output return when n_clusters=0 2019-06-14 22:03:21 -04:00