transformers

Commit Graph

Author	SHA1	Message	Date
Sam Shleifer	5ab21b072f	[s2s] Test hub configs in self-scheduled CI (#6809 )	2020-08-28 17:05:52 -04:00
Sam Shleifer	3cac867fac	t5 model should make decoder_attention_mask (#6800 )	2020-08-28 15:22:33 -04:00
Sam Shleifer	20f7786453	Fix style (#6803 )	2020-08-28 15:02:25 -04:00
Sam Shleifer	9336086ab5	prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654 ) * broken test * batch parity * tests pass * boom boom * boom boom * split out bart tokenizer tests * fix tests * boom boom * Fixed dataset bug * Fix marian * Undo extra * Get marian working * Fix t5 tok tests * Test passing * Cleanup * better assert msg * require torch * Fix mbart tests * undo extra decoder_attn_mask change * Fix import * pegasus tokenizer can ignore src_lang kwargs * unused kwarg test cov * boom boom * add todo for pegasus issue * cover one word translation edge case * Cleanup * doc	2020-08-28 11:15:17 -04:00
RafaelWO	cb276b41de	Transformer-XL: Improved tokenization with sacremoses (#6322 ) * Improved tokenization with sacremoses * The TransfoXLTokenizer is now using sacremoses for tokenization * Added tokenization of comma-separated and floating point numbers. * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses * Added corresponding tests * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast * Added deprecation warning to TransfoXLTokenizerFast * isort change Co-authored-by: Teven <teven.lescao@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-28 09:56:17 -04:00
Ahmed Elnaggar	930153e7d2	Add ProtBert model card (#6764 )	2020-08-28 12:12:28 +08:00
Stas Bekman	743d131d76	[style] set the minimal required version for `black` (#6784 ) `make style` with `black` < 20.8b1 is a no go (in case some other package forced a lower version) - so make it explicit to avoid confusion	2020-08-28 11:38:09 +08:00
Sam Shleifer	fb78a90d6a	PL: --adafactor option (#6776 )	2020-08-27 22:19:46 -04:00
Stas Bekman	92ac2fa7d1	[transformers-cli] fix logger getter (#6777 )	2020-08-27 20:01:17 -04:00
Lysandre	42fddacd1c	Format	2020-08-27 18:31:51 +02:00
Stas Bekman	70fccc5cf3	new Makefile target: docs (#6510 ) * [doc] multiple corrections to "Summary of the tasks" * add a new "docs" target to validate docs and document it * fix mixup	2020-08-27 12:25:16 -04:00
Stas Bekman	dbfe34f2f5	[test schedulers] adjust to test the first step's reading (#6429 ) * [test schedulers] small improvement * cleanup	2020-08-27 12:23:28 -04:00
Stas Bekman	e6b811f0a7	[testing] replace hardcoded paths to allow running tests from anywhere (#6523 ) * [testing] replace hardcoded paths to allow running tests from anywhere * fix the merge conflict	2020-08-27 12:22:18 -04:00
Sam Shleifer	9d1b4db2aa	add nlp install (#6767 )	2020-08-27 11:08:14 -04:00
Tom Grek	c225e872ed	Fix it to work with BART (#6756 )	2020-08-27 09:04:50 -04:00
Lysandre	0d2c111a0c	Format	2020-08-27 14:56:47 +02:00
Julien Plu	6f289dc97a	Fix the TF Trainer gradient accumulation and the TF NER example (#6713 ) * Align TF NER example over the PT one * Fix Dataset call * Fix gradient accumulation training * Apply style * Address Sylvain's comments * Address Sylvain's comments * Apply style	2020-08-27 08:45:34 -04:00
Lysandre Debut	41aa2b4ef1	Adafactor docs (#6765 )	2020-08-27 05:16:50 -04:00
Nikolai Yakovenko	971d1802d0	Add AdaFactor optimizer from fairseq (#6722 ) * AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM. * update PR fixes, add basic test * bug -- incorrect params in test * bugfix -- import Adafactor into test * bugfix -- removed accidental T5 include * resetting T5 to master * bugfix -- include Adafactor in __init__ * longer loop for adafactor test * remove double error class declare * lint * black * isort * Update src/transformers/optimization.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * single docstring * Cleanup docstring Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-27 04:58:13 -04:00
Sam Shleifer	4bd7be9a42	s2s distillation uses AutoModelForSeqToSeqLM (#6761 )	2020-08-26 23:25:11 -04:00
Ahmed Elnaggar	05e7150a53	create ProtBert-BFD model card. (#6724 )	2020-08-27 02:19:19 +02:00
Sam Shleifer	61518e2df3	[s2s] run_eval.py QOL improvements and cleanup(#6746 )	2020-08-26 18:59:20 -04:00
Igli Manaj	434936f34a	Model Card for Multilingual Passage Reranking BERT (#6755 )	2020-08-26 18:00:27 -04:00
Joe Davison	10a34501f1	add __init__.py to utils (#6754 )	2020-08-26 23:51:10 +02:00
Ali Safaya	61b9ed8074	Model card for kuisailab/albert-large-arabic (#6730 ) * Create README.md * Update README.md	2020-08-26 17:27:56 -04:00
Ali Safaya	8e0d51e4f2	Model card for kuisailab/albert-xlarge-arabic (#6731 ) * Create README.md * Update README.md	2020-08-26 17:27:42 -04:00
Ali Safaya	70c96a10e9	Model card for kuisailab/albert-base-arabic (#6729 ) * Create README.md * Update README.md	2020-08-26 17:27:34 -04:00
Sagor Sarker	cc4ba79f68	added model card for codeswitch-spaeng-sentiment-analysis-lince (#6727 ) * added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * Update README.md	2020-08-26 17:26:32 -04:00
Tanmay Thakur	e10fb9cbe6	Create model card for lordtt13/COVID-SciBERT (#6718 )	2020-08-26 17:22:25 -04:00
Adam Montgomerie	baeba53e88	Adding model cards for 5 models (#6703 ) * Added model cards for 4 models Added model cards for: - roberta-base-bulgarian - roberta-base-bulgarian-pos - roberta-small-bulgarian - roberta-small-bulgarian-pos * fixed link text * Update README.md * Create README.md * removed trailing bracket * Add language metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-26 17:20:55 -04:00
Julien Chaumond	3242e4d942	[model_cards] Fix tiny typos	2020-08-26 23:16:06 +02:00
Joe Davison	99407f9d1e	add xlm-roberta-large-xnli model card (#6723 ) * add xlm-roberta-large-xnli model card * update pt example * typo	2020-08-26 16:05:59 -04:00
Patrick von Platen	858b7d5873	[TF Longformer] Improve Speed for TF Longformer (#6447 ) * add tf graph compile tests * fix conflict * remove more tf transpose statements * fix conflicts * fix comment typos * move function to class function * fix black * fix black * make style	2020-08-26 14:55:41 -04:00
Lysandre	a75c64d80c	Black 20 release	2020-08-26 17:20:22 +02:00
Lysandre	e78c110338	isort 5	2020-08-26 17:13:49 +02:00
Julien Plu	02e8cd5584	Fix optimizer (#6717 )	2020-08-26 11:12:44 -04:00
Lysandre Debut	77abd1e79f	Centralize logging (#6434 ) * Logging * Style * hf_logging > utils.logging * Address @thomwolf's comments * Update test * Update src/transformers/benchmark/benchmark_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Revert bad change Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-26 11:10:36 -04:00
Jay Yip	461ae86812	Fix tf boolean mask in graph mode (#6741 )	2020-08-26 05:15:35 -04:00
Patrick von Platen	925f34bbbd	Add "tie_word_embeddings" config param (#6692 ) * add tie_word_embeddings * correct word embeddings in modeling utils * make style * make config param only relevant for torch * make style * correct typo * delete deprecated arg in transo-xl	2020-08-26 04:58:21 -04:00
Patrick von Platen	fa8ee8e855	fix torchscript docs (#6740 )	2020-08-26 04:51:56 -04:00
Sylvain Gugger	64c7c2bc15	Install nlp for github actions test (#6728 )	2020-08-25 14:58:38 -04:00
Sam Shleifer	624495706c	T5Tokenizer adds EOS token if not already added (#5866 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-25 14:56:08 -04:00
Sam Shleifer	e11d923bfc	Fix pegasus-xsum integration test (#6726 )	2020-08-25 14:06:28 -04:00
Tomo Lazovich	7e6397a7d8	[squad] make examples and dataset accessible from SquadDataset object (#6710 ) * [squad] make examples and dataset accessible from SquadDataset object * [squad] add support for legacy cache files	2020-08-25 13:32:56 -04:00
Funtowicz Morgan	ac9702c284	Fix ONNX test_quantize unittest (#6716 )	2020-08-25 13:24:40 -04:00
Zane Lim	074340339a	Create README.md (#6721 ) add model card for singbert large	2020-08-26 00:11:24 +08:00
Patrick von Platen	d17cce2270	add missing keys (#6719 )	2020-08-25 11:38:51 -04:00
Arnav Sharma	a25c9fc8e1	Selected typo fix (#6687 )	2020-08-25 15:39:02 +02:00
Funtowicz Morgan	625318f525	tensor.nonzero() is deprecated in PyTorch 1.6 (#6715 ) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-08-25 08:12:54 -04:00
Sylvain Gugger	124c3d6adc	Add tokenizer to Trainer (#6689 )	2020-08-25 07:47:09 -04:00

... 4 5 6 7 8 ...

5250 Commits All Branches Search

5250 Commits

All Branches