Julien Chaumond
d4c2cb402d
Kill model archive maps ( #4636 )
...
* Kill model archive maps
* Fixup
* Also kill model_archive_map for MaskedBertPreTrainedModel
* Unhook config_archive_map
* Tokenizers: align with model id changes
* make style && make quality
* Fix CI
2020-06-02 09:39:33 -04:00
Patrick von Platen
47a551d17b
[pipeline] Tokenizer should not add special tokens for text generation ( #4686 )
...
* allow to not add special tokens
* remove print
2020-06-02 11:03:46 +02:00
Funtowicz Morgan
f6d5046af1
Override get_vocab for fast tokenizer. ( #4717 )
2020-06-02 11:02:27 +02:00
Lysandre Debut
88762a2f8c
Specify PyTorch versions for examples ( #4710 )
2020-06-02 04:29:28 -04:00
Lorenzo Ampil
d3ef14f931
Add community notebook for sentiment span extraction ( #4700 )
2020-06-02 09:59:53 +02:00
Sylvain Gugger
7677936316
Make docstring match args ( #4711 )
2020-06-01 15:22:51 -04:00
Lysandre
6449c494d0
close #4685
2020-06-01 12:57:52 -04:00
Julien Chaumond
ec8717d5d8
[config] Ensure that id2label always takes precedence over num_labels
2020-06-01 16:54:55 +02:00
Julien Chaumond
751a1e0890
[config] Ensure that id2label always takes precedence over num_labels
...
Fixes bug reported in https://github.com/huggingface/transformers/issues/4669
See #3967 for context
2020-06-01 16:25:56 +02:00
Rens
ec62b7d953
Fix onnx export input names order ( #4641 )
...
* pass on tokenizer to pipeline
* order input names when convert to onnx
* update style
* remove unused imports
* make ordered inputs list needs to be mutable
* add test custom bert model
* remove unused imports
2020-06-01 16:12:48 +02:00
Victor SANH
bf760c80b5
finish README
2020-06-01 09:23:31 -04:00
Victor SANH
9d7d9b3ae0
weird import
2020-06-01 09:23:31 -04:00
Victor SANH
2a3c88a659
Update examples/movement-pruning/README.md
...
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-06-01 09:23:31 -04:00
Victor SANH
4ac462bfb8
Update examples/movement-pruning/README.md
...
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-06-01 09:23:31 -04:00
Victor SANH
35fa0bbca0
clarify README
2020-06-01 09:23:31 -04:00
Victor SANH
cc746a5020
flake8 compliance
2020-06-01 09:23:31 -04:00
Victor SANH
b11386e158
less prints in saving prunebert
2020-06-01 09:23:31 -04:00
Victor SANH
8b5d4003ab
complete README
2020-06-01 09:23:31 -04:00
Victor SANH
5c8e5b3709
commplying with isort
2020-06-01 09:23:31 -04:00
Victor SANH
db2a3b2e01
space
2020-06-01 09:23:31 -04:00
Victor SANH
5f8f2d849a
add floppy bert model notebok
2020-06-01 09:23:31 -04:00
Victor SANH
b41948f5cd
add requirements
2020-06-01 09:23:31 -04:00
Victor SANH
fb8f4277b2
add scripts
2020-06-01 09:23:31 -04:00
Victor SANH
d489a6d3d5
add masked_run_*
2020-06-01 09:23:31 -04:00
Victor SANH
e4c07faf0a
add sparsity modules
2020-06-01 09:23:31 -04:00
Mehrdad Farahani
667003e447
Create README.md ( #4665 )
2020-06-01 08:29:09 -04:00
Mehrdad Farahani
ed23f5909e
HooshvareLab readme parsbert-armananer ( #4666 )
...
Readme for HooshvareLab/bert-base-parsbert-armananer-uncased
2020-06-01 08:28:43 -04:00
Mehrdad Farahani
3750b9b0b0
HooshvareLab readme parsbert-peymaner ( #4667 )
...
Readme for HooshvareLab/bert-base-parsbert-peymaner-uncased
2020-06-01 08:28:25 -04:00
Mehrdad Farahani
036c2c6b02
Update HooshvareLab/bert-base-parsbert-uncased ( #4687 )
...
mBERT results added regarding NER datasets!
2020-06-01 08:27:00 -04:00
Manuel Romero
74872c19d3
Create README.md ( #4684 )
2020-06-01 05:45:54 -04:00
Patrick von Platen
0866669e75
[EncoderDecoder] Fix initialization and save/load bug ( #4680 )
...
* fix bug
* add more tests
2020-05-30 01:25:19 +02:00
Patrick von Platen
6f82aea66b
Include `nlp` notebook for model evaluation ( #4676 )
2020-05-29 19:38:56 +02:00
Wei Fang
33b7532e69
Fix longformer attention mask type casting when using apex ( #4574 )
...
* Fix longformer attention mask casting when using apex
* remove extra type casting
2020-05-29 18:13:30 +02:00
Patrick von Platen
56ee2560be
[Longformer] Better handling of global attention mask vs local attention mask ( #4672 )
...
* better api
* improve automatic setting of global attention mask
* fix longformer bug
* fix global attention mask in test
* fix global attn mask flatten
* fix slow tests
* update docstring
* update docs and make more robust
* improve attention mask
2020-05-29 17:58:42 +02:00
Simon Böhm
e2230ba77b
Fix BERT example code for NSP and Multiple Choice ( #3953 )
...
Change the example code to use encode_plus since the token_type_id
wasn't being correctly set.
2020-05-29 11:55:55 -04:00
Zhangyx
3a5d1ea2a5
Fix two bugs: 1. Index of test data of SST-2. 2. Label index of MNLI data. ( #4546 )
2020-05-29 11:12:24 -04:00
Patrick von Platen
9c17256447
[Longformer] Multiple choice for longformer ( #4645 )
...
* add multiple choice for longformer
* add models to docs
* adapt docstring
* add test to longformer
* add longformer for mc in init and modeling auto
* fix tests
2020-05-29 13:46:08 +02:00
Iz Beltagy
91487cbb8e
[Longformer] fix model name in examples ( #4653 )
...
* fix longformer model names in examples
* a better name for the notebook
2020-05-29 13:12:35 +02:00
flozi00
b5015a2a0f
gpt2 typo ( #4629 )
...
* gpt2 typo
* Add files via upload
2020-05-28 16:44:43 -04:00
Iz Beltagy
fe5cb1a1c8
Adding community notebook ( #4642 )
...
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-05-28 22:35:15 +02:00
Suraj Patil
aecaaf73a4
[Community notebooks] add longformer-for-qa notebook ( #4652 )
2020-05-28 22:27:22 +02:00
Anthony MOI
5e737018e1
Fix add_special_tokens on fast tokenizers ( #4531 )
2020-05-28 10:54:45 -04:00
Suraj Patil
e444648a30
LongformerForTokenClassification ( #4638 )
2020-05-28 12:48:18 +02:00
Lavanya Shukla
3cc2c2a150
add 2 colab notebooks ( #4505 )
...
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-05-28 11:18:16 +02:00
Iz Beltagy
ef03ae874f
[Longformer] more models + model cards ( #4628 )
...
* adding freeze roberta models
* model cards
* lint
2020-05-28 11:11:05 +02:00
Patrick von Platen
96f57c9ccb
[Benchmark] Memory benchmark utils ( #4198 )
...
* improve memory benchmarking
* correct typo
* fix current memory
* check torch memory allocated
* better pytorch function
* add total cached gpu memory
* add total gpu required
* improve torch gpu usage
* update memory usage
* finalize memory tracing
* save intermediate benchmark class
* fix conflict
* improve benchmark
* improve benchmark
* finalize
* make style
* improve benchmarking
* correct typo
* make train function more flexible
* fix csv save
* better repr of bytes
* better print
* fix __repr__ bug
* finish plot script
* rename plot file
* delete csv and small improvements
* fix in plot
* fix in plot
* correct usage of timeit
* remove redundant line
* remove redundant line
* fix bug
* add hf parser tests
* add versioning and platform info
* make style
* add gpu information
* ensure backward compatibility
* finish adding all tests
* Update src/transformers/benchmark/benchmark_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/benchmark/benchmark_args_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* delete csv files
* fix isort ordering
* add out of memory handling
* add better train memory handling
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-05-27 23:22:16 +02:00
Suraj Patil
ec4cdfdd05
LongformerForSequenceClassification ( #4580 )
...
* LongformerForSequenceClassification
* better naming x=>hidden_states, fix typo in doc
* Update src/transformers/modeling_longformer.py
* Update src/transformers/modeling_longformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-05-27 22:30:00 +02:00
Suraj Patil
4402879ee4
[Model Card] model card for longformer-base-4096-finetuned-squadv1 ( #4625 )
2020-05-27 18:48:03 +02:00
Lysandre Debut
6a17688021
per_device instead of per_gpu/error thrown when argument unknown ( #4618 )
...
* per_device instead of per_gpu/error thrown when argument unknown
* [docs] Restore examples.md symlink
* Correct absolute links so that symlink to the doc works correctly
* Update src/transformers/hf_argparser.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
* Warning + reorder
* Docs
* Style
* not for squad
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-05-27 11:36:55 -04:00
Mehrdad Farahani
1381b6d01d
README for HooshvareLab ( #4610 )
...
HooshvareLab/bert-base-parsbert-uncased
2020-05-27 11:25:36 -04:00