Sam Shleifer
|
09a2f40684
|
Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792)
Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>
|
2020-07-18 13:57:33 -04:00 |
Sam Shleifer
|
dad5e12e54
|
[seq2seq] distillation.py accepts trainer arguments (#5865)
|
2020-07-18 07:43:57 -04:00 |
Sam Shleifer
|
ba2400189b
|
[seq2seq] MAX_LEN env var for MT commands (#5837)
|
2020-07-17 22:51:31 -04:00 |
Nathan Raw
|
529850ae7b
|
Lightning Updates for v0.8.5 (#5798)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-07-17 22:43:06 -04:00 |
Sam Shleifer
|
353b8f1e7a
|
Add mbart-large-cc25, support translation finetuning (#5129)
improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg
|
2020-07-07 13:23:01 -04:00 |
Sam Shleifer
|
27a7fe7a8d
|
examples/seq2seq: never override $WANDB_PROJECT (#5407)
|
2020-06-30 15:29:13 -04:00 |
Kevin Canwen Xu
|
331d8d2936
|
Upload DistilBART artwork (#5394)
|
2020-06-30 18:11:11 +08:00 |
Sam Shleifer
|
a316a6aaa8
|
[seq2seq docs] Move evaluation down, fix typo (#5365)
|
2020-06-29 10:36:04 -04:00 |
Sam Shleifer
|
393b8dc09a
|
examples/seq2seq/run_eval.py fixes and docs (#5322)
|
2020-06-26 19:20:43 -04:00 |
Sam Shleifer
|
5543b30aa6
|
[pl_examples] default warmup steps=0 (#5316)
|
2020-06-26 15:03:41 -04:00 |
Sam Shleifer
|
e008d520bb
|
[examples/seq2seq] more README improvements (#5274)
|
2020-06-25 10:13:01 -04:00 |
Sam Shleifer
|
40457bcebb
|
examples/seq2seq supports translation (#5202)
|
2020-06-24 23:58:11 -04:00 |