Patrick von Platen
d5db6c37d4
[Seq2Seq Templates] Fix check_repo.py templates file ( #9277 )
...
* add enc dec pt model to check repo
* fix indent
2020-12-23 11:40:20 +01:00
Xu Song
4bafc43b0e
Fix param error ( #9273 )
...
TypeError: forward() got an unexpected keyword argument 'token_type_ids'
2020-12-23 11:34:57 +01:00
Xu Song
58e8a7611f
Fix gpt2 document ( #9272 )
2020-12-23 11:34:15 +01:00
Patrick von Platen
cbe63949d7
Model Templates for Seq2Seq ( #9251 )
...
* adapt cookie cutter
* fix copy past statement
* delete copy statements for now
* remove unused import from template
* make doc rst
* correct config docstring
* correct training
* correct inputs processing tf enc dec
* make style
* adapt templates
* clean tabs
* correct tensor -> Tensor naming
* correct indent
* correct templates
* fix the test
* break lines to avoid > 119
* Apply suggestions from code review
2020-12-22 23:41:20 +01:00
Sylvain Gugger
e6c1f1cad8
Revert renaming in finetune_trainer ( #9262 )
2020-12-22 15:42:34 -05:00
Sylvain Gugger
ab17758874
Add speed metrics to all example scripts + template ( #9260 )
2020-12-22 14:02:26 -05:00
Julien Chaumond
5b5f7dd09c
[hf_api] Fix incorrect typing
2020-12-22 19:52:47 +01:00
Julien Plu
1558d191e6
Fix TF BART for saved model creation ( #9252 )
...
* Fix TF BART for saved model creation
* Apply style
* Update src/transformers/models/bart/modeling_tf_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/bart/modeling_tf_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Rework the fix
* Fix condition
* Apply style
* Fix condition
* Fix shape_list
* Apply Patrick's solution
* Apply Patrick's solution
* Rebase
* make tests pass
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
2020-12-22 18:07:04 +01:00
Manuel Romero
37d6fb5d04
Fix link to bertabs/README.md ( #9255 )
2020-12-22 11:41:23 -05:00
Manuel Romero
189c1b91a6
Fix link to old language modeling script ( #9254 )
2020-12-22 11:40:47 -05:00
Sylvain Gugger
490b39e614
Seq2seq trainer ( #9241 )
...
* Add label smoothing in Trainer
* Add options for scheduler and Adafactor in Trainer
* Put Seq2SeqTrainer in the main lib
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Address review comments and adapt scripts
* Documentation
* Move test not using script to tests folder
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-12-22 11:33:44 -05:00
Sylvain Gugger
1fc7119181
Fix script that check objects are documented ( #9259 )
2020-12-22 11:12:58 -05:00
Patrick von Platen
e9d77ccd5a
[EncoderDecoder] Make tests more aggressive ( #9256 )
...
* add tests
* make style and fix bart bug
* fix bart past key value edge case
* correct tf bart test
* fix gpt2 tf
* fix t5 test
2020-12-22 17:00:04 +01:00
Sylvain Gugger
ec07da65e2
Update the README of the text classification example ( #9237 )
...
* Update the README of the text classification example
* Update examples/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Adapt comment from review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-12-21 15:23:40 -05:00
Teven
4eef5889ac
Adding performer fine-tuning research exampke ( #9239 )
...
* added run_mlm_performer.py research example
* make styke
* make styke
* Added a README !
2020-12-21 21:19:41 +01:00
Patrick von Platen
9a12b9696f
[MPNet] Add slow to fast tokenizer converter ( #9233 )
...
* add converter
* delet unnecessary comments
2020-12-21 15:41:34 +01:00
Suraj Patil
f4432b7e01
add base model classes to bart subclassed models ( #9230 )
...
* add base model classes to bart subclassed models
* add doc
2020-12-21 19:56:46 +05:30
TobiasNorlund
08abdabda1
Fixed beam search generation for GPT2 and T5 ( #9219 )
2020-12-21 08:05:23 -05:00
Julien Plu
161a6461db
Fix TF template ( #9234 )
2020-12-21 13:52:16 +01:00
Julien Plu
5a8a4eb187
Improve BERT-like models performance with better self attention ( #9124 )
...
* Improve BERT-like models attention layers
* Apply style
* Put back error raising instead of assert
* Update template
* Fix copies
* Apply raising valueerror in MPNet
* Restore the copy check for the Intermediate layer in Longformer
* Update longformer
2020-12-21 13:10:15 +01:00
Patrick von Platen
6b034309ca
fix warning ( #9231 )
2020-12-21 10:41:34 +01:00
Amog Kamsetty
a4b21cdd20
[RAG] Add Ray implementation for distributed retrieval ( #9197 )
...
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* uncomment
* uncomment
* wip
* updates
* add docstring
* updates
* fix arg
* fixes
* add unit tests
* update readme
* update readme
* update finetune script
* update test
* add test
* add ray to test dependencies
* separate ray and ray tune
* formatting
* shutdown ray at end of test
* fix tests
* formatting
* formatting
* even more formatting
* address comments
* formatting
* add files
* Update examples/research_projects/rag/test_distributed_retriever.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* address comments
* addressing comments
Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-208.us-west-2.compute.internal>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-12-21 10:39:30 +01:00
Stas Bekman
f38c4ad302
better logging and help ( #9203 )
2020-12-20 10:28:28 -08:00
sandip
e0e255be1f
Added TF TransfoXL Sequence Classification ( #9169 )
...
* TF Transfoxl seq classification
* Update test_modeling_tf_transfo_xl.py
Added num_labels to config level
* TF Transfoxl seq classification
* Update test_modeling_tf_transfo_xl.py
Added num_labels to config level
* code refactor
* code refactor
* code refator
2020-12-19 14:44:04 +01:00
Stas Bekman
6b850b671d
[run_glue] add speed metrics ( #9198 )
...
* add speed metrics
* suggestions
2020-12-18 17:09:30 -08:00
Stas Bekman
3ff5e8955a
[t5 doc] typos ( #9199 )
...
* [t5 doc] typos
a few run away backticks
@sgugger
* style
2020-12-18 16:03:26 -08:00
Aleksey Tikhonov
291974c65c
GPT-model attention heads pruning example ( #9189 )
...
* Pruning for GPT attn heads
* The code formatted according to the transformers requirements
* Update run_prune_gpt.py
* Update run_prune_gpt.py
2020-12-18 16:32:10 -05:00
Sylvain Gugger
1198ba8fba
Add timing inside Trainer ( #9196 )
...
* Add timing inside Trainer
* Fix tests
* Add n_objs for train
* Sort logs
2020-12-18 15:10:39 -05:00
Sylvain Gugger
9a25c5bd3a
Add new run_swag example ( #9175 )
...
* Add new run_swag example
* Add check
* Add sample
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Very important change to make Lysandre happy
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-12-18 14:19:24 -05:00
Sylvain Gugger
3e56e2ce04
Fix typo
2020-12-18 10:11:07 -05:00
Manuel Romero
077a5dce32
Fix link to old SQUAD fine-tuning script ( #9181 )
2020-12-18 09:12:10 -05:00
Stas Bekman
84d5879eaf
[setup] correct transformers version format ( #9176 )
...
setuptools has a pretty fixed expectation of version numbers.
This PR fixes the dev version number and adds a comment with correct formats for the future editors
This fix removes this warning on `make fixup|style|etc` or any other time `setup.py` is being run.
```
setuptools/dist.py:452: UserWarning: Normalizing '4.2.0dev0' to '4.2.0.dev0'
warnings.warn(tmpl.format(**locals()))
```
and the alternative:
```
/setuptools/dist.py:452: UserWarning: Normalizing '4.0.0-rc-1' to '4.0.0rc1
```
Fixes : #8749
@LysandreJik, @sgugger
2020-12-18 08:55:55 -05:00
Wissam Antoun
fd7b6a5274
fixed JSON error in run_qa with fp16 ( #9186 )
2020-12-18 07:53:23 -05:00
Manuel Romero
66a14a2f6f
Fix link to old NER fine-tuning script ( #9182 )
2020-12-17 19:50:01 -05:00
Stas Bekman
f06d0fadc9
[trainer] apex fixes and tests ( #9180 )
2020-12-17 16:49:11 -08:00
sandip
467e9158b4
Added TF CTRL Sequence Classification ( #9151 )
...
* Added TF CTRL Sequence Classification
* code refactor
2020-12-17 18:10:57 -05:00
Stas Bekman
63841c559b
add tests for the new sharded ddp fairscale integration ( #9177 )
2020-12-17 14:24:03 -08:00
Lysandre
bf713cdec7
setup.py development version
2020-12-17 11:29:31 -05:00
Lysandre
bd40345d3e
v4.1.1 docs
2020-12-17 11:28:38 -05:00
Lysandre
bfa4ccf77d
Release: v4.1.1
2020-12-17 11:25:49 -05:00
Lysandre
e0790cca78
Fix TAPAS doc
2020-12-17 11:25:05 -05:00
Sylvain Gugger
6d2e864db7
Put all models in the constants ( #9170 )
...
* Put all models in the constants
* Add Google AI mention in the main README
2020-12-17 11:23:21 -05:00
Lysandre
f83d9c8da7
v4.1.0 docs
2020-12-17 10:16:07 -05:00
Lysandre
f5438ab8a2
Release: v4.1.0
2020-12-17 10:04:55 -05:00
Lysandre
ac2c7e398f
Remove erroneous character
2020-12-17 09:47:19 -05:00
Sylvain Gugger
77d6941e64
Fix gradient clipping for Sharded DDP ( #9168 )
...
* Fix gradient clipping for Sharded DDP
* Fix typos in comments
2020-12-17 09:44:24 -05:00
Lysandre Debut
1aca3d6afa
Add disclaimer to TAPAS rst file ( #9167 )
...
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
2020-12-17 09:34:06 -05:00
Lysandre
dc9f245442
Torch scatter with torch 1.7.0
2020-12-16 13:48:57 -05:00
Sylvain Gugger
9a67185344
Experimental support for fairscale ShardedDDP ( #9139 )
...
* Experimental stupport for fairscale ShardedDDP
* Add import error if fairscale not available
* Address review comments
* Fix seq2seq trainer
2020-12-16 13:47:48 -05:00
Lysandre Debut
1c1a2ffbff
TableQuestionAnsweringPipeline ( #9145 )
...
* AutoModelForTableQuestionAnswering
* TableQuestionAnsweringPipeline
* Apply suggestions from Patrick's code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Sylvain and Patrick comments
* Better PyTorch/TF error message
* Add integration tests
* Argument Handler naming
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
* Fix docs to appease the documentation gods
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-12-16 12:31:50 -05:00