Sam Shleifer
5ab21b072f
[s2s] Test hub configs in self-scheduled CI ( #6809 )
2020-08-28 17:05:52 -04:00
Sam Shleifer
3cac867fac
t5 model should make decoder_attention_mask ( #6800 )
2020-08-28 15:22:33 -04:00
Sam Shleifer
20f7786453
Fix style ( #6803 )
2020-08-28 15:02:25 -04:00
Sam Shleifer
9336086ab5
prepare_seq2seq_batch makes labels/ decoder_input_ids made later. ( #6654 )
...
* broken test
* batch parity
* tests pass
* boom boom
* boom boom
* split out bart tokenizer tests
* fix tests
* boom boom
* Fixed dataset bug
* Fix marian
* Undo extra
* Get marian working
* Fix t5 tok tests
* Test passing
* Cleanup
* better assert msg
* require torch
* Fix mbart tests
* undo extra decoder_attn_mask change
* Fix import
* pegasus tokenizer can ignore src_lang kwargs
* unused kwarg test cov
* boom boom
* add todo for pegasus issue
* cover one word translation edge case
* Cleanup
* doc
2020-08-28 11:15:17 -04:00
RafaelWO
cb276b41de
Transformer-XL: Improved tokenization with sacremoses ( #6322 )
...
* Improved tokenization with sacremoses
* The TransfoXLTokenizer is now using sacremoses for tokenization
* Added tokenization of comma-separated and floating point numbers.
* Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses
* Added corresponding tests
* Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast
* Added deprecation warning to TransfoXLTokenizerFast
* isort change
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-08-28 09:56:17 -04:00
Ahmed Elnaggar
930153e7d2
Add ProtBert model card ( #6764 )
2020-08-28 12:12:28 +08:00
Stas Bekman
743d131d76
[style] set the minimal required version for `black` ( #6784 )
...
`make style` with `black` < 20.8b1 is a no go (in case some other package forced a lower version) - so make it explicit to avoid confusion
2020-08-28 11:38:09 +08:00
Sam Shleifer
fb78a90d6a
PL: --adafactor option ( #6776 )
2020-08-27 22:19:46 -04:00
Stas Bekman
92ac2fa7d1
[transformers-cli] fix logger getter ( #6777 )
2020-08-27 20:01:17 -04:00
Lysandre
42fddacd1c
Format
2020-08-27 18:31:51 +02:00
Stas Bekman
70fccc5cf3
new Makefile target: docs ( #6510 )
...
* [doc] multiple corrections to "Summary of the tasks"
* add a new "docs" target to validate docs and document it
* fix mixup
2020-08-27 12:25:16 -04:00
Stas Bekman
dbfe34f2f5
[test schedulers] adjust to test the first step's reading ( #6429 )
...
* [test schedulers] small improvement
* cleanup
2020-08-27 12:23:28 -04:00
Stas Bekman
e6b811f0a7
[testing] replace hardcoded paths to allow running tests from anywhere ( #6523 )
...
* [testing] replace hardcoded paths to allow running tests from anywhere
* fix the merge conflict
2020-08-27 12:22:18 -04:00
Sam Shleifer
9d1b4db2aa
add nlp install ( #6767 )
2020-08-27 11:08:14 -04:00
Tom Grek
c225e872ed
Fix it to work with BART ( #6756 )
2020-08-27 09:04:50 -04:00
Lysandre
0d2c111a0c
Format
2020-08-27 14:56:47 +02:00
Julien Plu
6f289dc97a
Fix the TF Trainer gradient accumulation and the TF NER example ( #6713 )
...
* Align TF NER example over the PT one
* Fix Dataset call
* Fix gradient accumulation training
* Apply style
* Address Sylvain's comments
* Address Sylvain's comments
* Apply style
2020-08-27 08:45:34 -04:00
Lysandre Debut
41aa2b4ef1
Adafactor docs ( #6765 )
2020-08-27 05:16:50 -04:00
Nikolai Yakovenko
971d1802d0
Add AdaFactor optimizer from fairseq ( #6722 )
...
* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM.
* update PR fixes, add basic test
* bug -- incorrect params in test
* bugfix -- import Adafactor into test
* bugfix -- removed accidental T5 include
* resetting T5 to master
* bugfix -- include Adafactor in __init__
* longer loop for adafactor test
* remove double error class declare
* lint
* black
* isort
* Update src/transformers/optimization.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
* single docstring
* Cleanup docstring
Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-08-27 04:58:13 -04:00
Sam Shleifer
4bd7be9a42
s2s distillation uses AutoModelForSeqToSeqLM ( #6761 )
2020-08-26 23:25:11 -04:00
Ahmed Elnaggar
05e7150a53
create ProtBert-BFD model card. ( #6724 )
2020-08-27 02:19:19 +02:00
Sam Shleifer
61518e2df3
[s2s] run_eval.py QOL improvements and cleanup( #6746 )
2020-08-26 18:59:20 -04:00
Igli Manaj
434936f34a
Model Card for Multilingual Passage Reranking BERT ( #6755 )
2020-08-26 18:00:27 -04:00
Joe Davison
10a34501f1
add __init__.py to utils ( #6754 )
2020-08-26 23:51:10 +02:00
Ali Safaya
61b9ed8074
Model card for kuisailab/albert-large-arabic ( #6730 )
...
* Create README.md
* Update README.md
2020-08-26 17:27:56 -04:00
Ali Safaya
8e0d51e4f2
Model card for kuisailab/albert-xlarge-arabic ( #6731 )
...
* Create README.md
* Update README.md
2020-08-26 17:27:42 -04:00
Ali Safaya
70c96a10e9
Model card for kuisailab/albert-base-arabic ( #6729 )
...
* Create README.md
* Update README.md
2020-08-26 17:27:34 -04:00
Sagor Sarker
cc4ba79f68
added model card for codeswitch-spaeng-sentiment-analysis-lince ( #6727 )
...
* added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card
* fixed typo
* fixed typo
* fixed typo
* fixed typo
* fixed typo
* fixed typo
* fixed typo
* Update README.md
2020-08-26 17:26:32 -04:00
Tanmay Thakur
e10fb9cbe6
Create model card for lordtt13/COVID-SciBERT ( #6718 )
2020-08-26 17:22:25 -04:00
Adam Montgomerie
baeba53e88
Adding model cards for 5 models ( #6703 )
...
* Added model cards for 4 models
Added model cards for:
- roberta-base-bulgarian
- roberta-base-bulgarian-pos
- roberta-small-bulgarian
- roberta-small-bulgarian-pos
* fixed link text
* Update README.md
* Create README.md
* removed trailing bracket
* Add language metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-08-26 17:20:55 -04:00
Julien Chaumond
3242e4d942
[model_cards] Fix tiny typos
2020-08-26 23:16:06 +02:00
Joe Davison
99407f9d1e
add xlm-roberta-large-xnli model card ( #6723 )
...
* add xlm-roberta-large-xnli model card
* update pt example
* typo
2020-08-26 16:05:59 -04:00
Patrick von Platen
858b7d5873
[TF Longformer] Improve Speed for TF Longformer ( #6447 )
...
* add tf graph compile tests
* fix conflict
* remove more tf transpose statements
* fix conflicts
* fix comment typos
* move function to class function
* fix black
* fix black
* make style
2020-08-26 14:55:41 -04:00
Lysandre
a75c64d80c
Black 20 release
2020-08-26 17:20:22 +02:00
Lysandre
e78c110338
isort 5
2020-08-26 17:13:49 +02:00
Julien Plu
02e8cd5584
Fix optimizer ( #6717 )
2020-08-26 11:12:44 -04:00
Lysandre Debut
77abd1e79f
Centralize logging ( #6434 )
...
* Logging
* Style
* hf_logging > utils.logging
* Address @thomwolf's comments
* Update test
* Update src/transformers/benchmark/benchmark_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Revert bad change
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-26 11:10:36 -04:00
Jay Yip
461ae86812
Fix tf boolean mask in graph mode ( #6741 )
2020-08-26 05:15:35 -04:00
Patrick von Platen
925f34bbbd
Add "tie_word_embeddings" config param ( #6692 )
...
* add tie_word_embeddings
* correct word embeddings in modeling utils
* make style
* make config param only relevant for torch
* make style
* correct typo
* delete deprecated arg in transo-xl
2020-08-26 04:58:21 -04:00
Patrick von Platen
fa8ee8e855
fix torchscript docs ( #6740 )
2020-08-26 04:51:56 -04:00
Sylvain Gugger
64c7c2bc15
Install nlp for github actions test ( #6728 )
2020-08-25 14:58:38 -04:00
Sam Shleifer
624495706c
T5Tokenizer adds EOS token if not already added ( #5866 )
...
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-25 14:56:08 -04:00
Sam Shleifer
e11d923bfc
Fix pegasus-xsum integration test ( #6726 )
2020-08-25 14:06:28 -04:00
Tomo Lazovich
7e6397a7d8
[squad] make examples and dataset accessible from SquadDataset object ( #6710 )
...
* [squad] make examples and dataset accessible from SquadDataset object
* [squad] add support for legacy cache files
2020-08-25 13:32:56 -04:00
Funtowicz Morgan
ac9702c284
Fix ONNX test_quantize unittest ( #6716 )
2020-08-25 13:24:40 -04:00
Zane Lim
074340339a
Create README.md ( #6721 )
...
add model card for singbert large
2020-08-26 00:11:24 +08:00
Patrick von Platen
d17cce2270
add missing keys ( #6719 )
2020-08-25 11:38:51 -04:00
Arnav Sharma
a25c9fc8e1
Selected typo fix ( #6687 )
2020-08-25 15:39:02 +02:00
Funtowicz Morgan
625318f525
tensor.nonzero() is deprecated in PyTorch 1.6 ( #6715 )
...
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
2020-08-25 08:12:54 -04:00
Sylvain Gugger
124c3d6adc
Add tokenizer to Trainer ( #6689 )
2020-08-25 07:47:09 -04:00