transformers

Commit Graph

Author	SHA1	Message	Date
thomwolf	d6418c5ef3	tweaking the readme	2018-11-03 23:52:35 +01:00
thomwolf	3b70b270e0	update readme	2018-11-03 23:39:55 +01:00
thomwolf	eaa6db92f1	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2018-11-03 23:35:16 +01:00
thomwolf	f8276008df	update readme, file names, removing TF code, moving tests	2018-11-03 23:35:14 +01:00
Ubuntu	f18ae210e1	fix typo	2018-11-03 22:34:37 +00:00
VictorSanh	3c24e4bef1	Multi-Gpu loss - Cleaning	2018-11-03 18:03:17 -04:00
Tim Rault	5de1517d6b	WIP modeling_test_pytorch.py	2018-11-03 22:40:50 +01:00
VictorSanh	1ba5b58c20	fix typo	2018-11-03 17:10:23 -04:00
VictorSanh	5858e8e4dd	Fix both loss and eval metrics -> more coherence on the loss (eval vs train and tf vs pt)	2018-11-03 16:48:24 -04:00
VictorSanh	cd09cd5b40	Fix import on initalization	2018-11-03 15:38:30 -04:00
Tim Rault	ec66841afa	WIP	2018-11-03 19:12:20 +01:00
thomwolf	139873f6e3	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2018-11-03 19:06:17 +01:00
thomwolf	04287a4d68	special edition script	2018-11-03 19:06:15 +01:00
VictorSanh	a1af5247e1	Add seed in initialization	2018-11-03 14:00:36 -04:00
Ubuntu	4faeb38b51	Fix loss loss logging for multi-gpu compatibility	2018-11-03 17:52:51 +00:00
thomwolf	25f73add07	update optimizer run_squad	2018-11-03 17:56:34 +01:00
thomwolf	f514cbbf30	update run_squad with tqdm	2018-11-03 17:52:44 +01:00
thomwolf	cb76c1ddd3	add model.zero_grad()	2018-11-03 17:40:12 +01:00
thomwolf	a4086c5de5	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2018-11-03 17:38:17 +01:00
thomwolf	088ad45888	fixing optimization	2018-11-03 17:38:15 +01:00
VictorSanh	8bd6b235b7	typo on tokenization	2018-11-03 10:27:59 -04:00
VictorSanh	2c55568c40	`scatter_` and `scatter`	2018-11-03 10:27:38 -04:00
VictorSanh	a6efe1235f	Merge pull request #1 from huggingface/multi-gpu-support Create DataParallel model if several GPUs	2018-11-03 10:10:34 -04:00
VictorSanh	5f432480c0	Create DataParallel model if several GPUs	2018-11-03 10:10:01 -04:00
VictorSanh	5889765a7c	Update README.md	2018-11-03 09:18:44 -04:00
VictorSanh	8c932e37f9	Update the comparison notebook	2018-11-03 09:08:05 -04:00
VictorSanh	391a4ec2f3	Small typo in `trange` I seriously don't understand why they defined num_train_epochs as a float in the originial tf code. I Will change it at the end to avoir merge conflicts for now.	2018-11-03 08:25:15 -04:00
VictorSanh	5676d6f799	Remove BERT pretraining files for now	2018-11-03 08:17:22 -04:00
Tim Rault	8ec457d370	Fix imports	2018-11-03 12:35:21 +01:00
thomwolf	852e4b3c00	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2018-11-03 12:23:04 +01:00
thomwolf	0d8d2285ba	fix optimization_test	2018-11-03 12:23:00 +01:00
Tim Rault	574e20a92e	Move modeling_test.py to /tensorflow_code	2018-11-03 12:22:06 +01:00
thomwolf	45efc9d807	removing f-string	2018-11-03 11:46:18 +01:00
thomwolf	4df6027453	clean up	2018-11-03 11:33:10 +01:00
thomwolf	e6d106a01b	comment on gelu function	2018-11-03 10:47:10 +01:00
thomwolf	01b1a05343	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2018-11-03 03:11:15 +01:00
thomwolf	8aa22af0c5	fixing model	2018-11-03 03:11:13 +01:00
VictorSanh	72ab10399f	Fix loss Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)	2018-11-02 18:06:21 -04:00
VictorSanh	25d5ca48e0	Fix scatter LopngTensor	2018-11-02 17:57:46 -04:00
VictorSanh	e6a710f684	device	2018-11-02 17:54:22 -04:00
VictorSanh	3ebf1a13c9	Fix loss computation for indexes bigger than max_seq_length.	2018-11-02 17:49:35 -04:00
Tim Rault	629bd006bf	Convert optimization_test.py to PyTorch	2018-11-02 17:50:17 +01:00
VictorSanh	38f740a1d5	Fix bug writing predictions in run_squad_pytorch	2018-11-02 11:29:45 -04:00
VictorSanh	ee29871f8d	Debug run_squad_pytorch	2018-11-02 11:07:32 -04:00
VictorSanh	101eabff90	Debug run_squad_pytorch	2018-11-02 10:44:08 -04:00
VictorSanh	bb0a510330	Print for debug run_squad	2018-11-02 10:16:07 -04:00
thomwolf	c84315ec35	model fixes + ipnb fixes	2018-11-02 15:11:16 +01:00
Tim Rault	3ff2ec5eb3	Move command-line argparse arguments into main() function	2018-11-02 14:42:05 +01:00
thomwolf	c9690e57f8	adding jupyter, updating extract features adding simple test file	2018-11-02 14:25:21 +01:00
VictorSanh	844b2f0e6f	Small update Readme	2018-11-02 08:57:15 -04:00

... 296 297 298 299 300

15000 Commits All Branches Search

15000 Commits

All Branches