transformers

Commit Graph

Author	SHA1	Message	Date
thomwolf	d940eeda54	typo	2018-11-12 15:26:46 +01:00
thomwolf	1cf0a16c67	cleaning up readme	2018-11-12 15:24:47 +01:00
thomwolf	66b0090877	add fp16 training	2018-11-12 15:15:02 +01:00
Thomas Wolf	5dfd19060a	fix typo in readme	2018-11-12 12:39:57 +01:00
Thomas Wolf	fa1aa81f26	fix typo in readme bach examples	2018-11-12 08:37:43 +01:00
Thomas Wolf	6d6b916f48	update to BERT-large results	2018-11-11 17:00:49 +01:00
Thomas Wolf	c4bfc646f5	Add results of fine-tuning BERT-large on GPUs	2018-11-11 16:59:35 +01:00
Thomas Wolf	48930a4cff	Merge pull request #2 from elyase/patch-1 Port tokenization for the multilingual model	2018-11-10 22:27:45 +01:00
thomwolf	a81a1ef8e9	fixing learning rate schedule when using gradient_accumulation_steps	2018-11-10 16:11:14 +01:00
thomwolf	ea85cca8ab	adding optimize_on_cpu explanation in readme	2018-11-09 11:42:37 +01:00
thomwolf	5f04aa00ed	option to perform optimization and keep the optimizer averages on CPU	2018-11-09 11:28:14 +01:00
thomwolf	9e95cd8cd6	clean up optimizer from unused functions	2018-11-09 11:23:55 +01:00
thomwolf	34a1a01091	update code comment	2018-11-09 09:31:20 +01:00
thomwolf	34bdc8b54f	remove duplicate accumulate gradient step arguments	2018-11-09 09:19:45 +01:00
Thomas Wolf	0c24db9d5f	update results for SQuAD	2018-11-09 09:11:59 +01:00
thomwolf	2c5d993ba4	update readme - fix SQuAD model on multi-GPU	2018-11-08 21:22:22 +01:00
Gopal Krishna	4850ec5888	fixed small typos in the README.md (#8 )	2018-11-08 15:00:02 -05:00
Thomas Wolf	3bfbc21376	updating pytest command	2018-11-08 00:44:17 +01:00
Thomas Wolf	0ed7696191	Updated MRPC results	2018-11-08 00:39:42 +01:00
thomwolf	48d4a5317c	typo fix in output tuple	2018-11-07 23:51:12 +01:00
Thomas Wolf	d92a7f7721	Removing note on run_squad.py example	2018-11-07 23:37:55 +01:00
Thomas Wolf	5c0838d846	Merge pull request #7 from huggingface/develop Develop	2018-11-07 23:36:46 +01:00
Thomas Wolf	efeb6b1a0d	Merge branch 'master' into develop	2018-11-07 23:35:42 +01:00
thomwolf	dbc318a4c6	cleaning up - speeding up a bit multi-gpu	2018-11-07 22:22:55 +01:00
thomwolf	6bb7510a50	fixing pre-processing bug - averaging loss for gradient accumulation - no_grad on evaluation	2018-11-07 22:12:41 +01:00
lukovnikov	bd91ae654f	moved bert to qelos-util	2018-11-06 18:21:44 +01:00
lukovnikov	4e52188433	bert weight loading from tf	2018-11-06 17:47:03 +01:00
thomwolf	a1126237a9	clean up logits extraction logic	2018-11-06 17:31:15 +01:00
thomwolf	2a97fe220b	fixing weights initialization in the model and out of span clamping	2018-11-06 17:26:33 +01:00
thomwolf	907d3569c1	cleaning up SQuAD notebook - more explanation - fixing error	2018-11-06 11:13:43 +01:00
Thomas Wolf	1a5bbd83dc	Updating run_squad information in readme	2018-11-06 08:53:01 +01:00
Thomas Wolf	79e1b95e75	fix link in readme	2018-11-06 08:38:02 +01:00
Knut Ole Sjøli	886f595c37	Fix typo in subheader (#4 )	2018-11-05 18:34:18 -05:00
Yaser Martinez Palenzuela	4d124baf8f	Add test for Chinese tokenization	2018-11-05 23:04:29 +01:00
Thomas Wolf	59d4cc5f2b	typos	2018-11-05 22:47:24 +01:00
Yaser Martinez Palenzuela	0ce2f496dc	Port tokenization for the multilingual model	2018-11-05 22:34:12 +01:00
Thomas Wolf	d983eecdd3	more readme typo fixes	2018-11-05 21:29:04 +01:00
Thomas Wolf	8f91b4de91	more typo fixes	2018-11-05 21:24:14 +01:00
Thomas Wolf	7316b0d6d0	fix typo	2018-11-05 21:22:45 +01:00
Clement	d130cb5139	typos	2018-11-05 15:09:24 -05:00
Clement	2a8fee495b	typos	2018-11-05 15:04:06 -05:00
Clement	f968b11657	typo	2018-11-05 14:59:44 -05:00
thomwolf	88e793f31a	fix typos	2018-11-05 16:14:19 +01:00
thomwolf	3914eed505	update readme	2018-11-05 16:09:27 +01:00
thomwolf	bab5d13077	update optimizer documentation	2018-11-05 16:09:21 +01:00
thomwolf	7394eb47a5	update readme	2018-11-05 15:35:44 +01:00
thomwolf	e6646751ac	update notebooks	2018-11-05 15:02:50 +01:00
thomwolf	b705c9eff5	remove small script, moved notebooks to notebook folder	2018-11-05 14:55:08 +01:00
thomwolf	3a301d443b	update gitignore	2018-11-05 14:53:43 +01:00
thomwolf	711d3f9f2b	remove tensorflow_code	2018-11-05 14:53:03 +01:00

... 251 252 253 254 255 ...

12781 Commits All Branches Search

12781 Commits

All Branches