Arthur
253f9a3f97
[`GPTNeoX`] Faster rotary embedding for GPTNeoX (based on llama changes) ( #25830 )
...
* Faster rotary embedding for GPTNeoX
* there might be un-necessary moves from device
* fixup
* fix dtype issue
* add copied from statements
* fox copies
* oupsy
* add copied from Llama for scaled ones as well
* fixup
* fix
* fix copies
2023-10-05 10:05:39 +02:00
Arthur
b4e66d7a67
[ `NougatProcessor`] Fix the default channel ( #26608 )
...
fix
2023-10-05 09:38:08 +02:00
Yeyang
43bfd093e1
add zh translation for installation ( #26084 )
...
* translate installation to zh
* fix translation typo
2023-10-04 09:39:02 -07:00
Sanchit Gandhi
2d8ee9817c
[Wav2Vec2] Fix tokenizer set lang ( #26349 )
...
* fix wav2vec2 doctest
* suggestion
* fix
* final fix
* revert since we need AddedTokens
2023-10-04 17:12:09 +01:00
Galland
f9ab07f920
Update mistral.md to update 404 link ( #26590 )
2023-10-04 17:48:11 +02:00
Arthur
c037b2e340
skip flaky hub tests ( #26594 )
...
skip flaky
2023-10-04 17:47:55 +02:00
Soyoung Yoon
ca7912d191
Fix encoder->decoder typo bug in convert_t5x_checkpoint_to_pytorch.py ( #26587 )
...
Fix bug in convert_t5x_checkpoint_to_pytorch.py
2023-10-04 17:34:32 +02:00
Matt
8b03615b7b
Fix embarrassing typo in the doc chat template! ( #26596 )
2023-10-04 16:28:53 +01:00
dg845
9deb18ca1a
Add # Copied from statements to audio feature extractors that use the floats_list function ( #26581 )
...
Add # Copied from statements to audio feature extractors that use the floats_list function.
2023-10-04 17:09:48 +02:00
Sanchit Gandhi
0a49f909bc
[Mistral] Update config docstring ( #26593 )
...
* fix copies
* fix missing docstring
* make style
* oops
2023-10-04 16:02:34 +01:00
Phuc Van Phan
6015f91a5a
refactor: change default block_size ( #26229 )
...
* refactor: change default block_size
* fix: return tf to origin
* fix: change files to origin
* rebase
* rebase
* rebase
* rebase
* rebase
* rebase
* rebase
* rebase
* refactor: add min block_size to files
* reformat: add min block_size for run_clm tf
2023-10-04 15:31:38 +01:00
Matt
8b46c5bcfc
Add add_generation_prompt argument to apply_chat_template ( #26573 )
...
* Add add_generation_prompt argument to apply_chat_template
* Add add_generation_prompt argument to apply_chat_template and update default templates
* Fix typo
* Add generation prompts section to chat templating guide
* Add generation prompts section to chat templating guide
* Minor style fix
2023-10-04 15:15:29 +01:00
Sylvain Gugger
03af4c42a6
Docstring check ( #26052 )
...
* Fix number of minimal calls to the Hub with peft integration
* Alternate design
* And this way?
* Revert
* Nits to fix
* Add util
* Print when changes are made
* Add list to ignore
* Add more rules
* Manual fixes
* deal with kwargs
* deal with enum defaults
* avoid many digits for floats
* Manual fixes
* Fix regex
* Fix regex
* Auto fix
* Style
* Apply script
* Add ignored list
* Add check that templates are filled
* Adding to CI checks
* Add back semi-fix
* Ignore more objects
* More auto-fixes
* Ignore missing objects
* Remove temp semi-fix
* Fixes
* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Deal with float defaults
* Fix small defaults
* Address review comment
* Treat
* Post-rebase cleanup
* Address review comment
* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
* Address review comment
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2023-10-04 15:13:37 +02:00
Bharat Ramanathan
122b2657f8
feat: add trainer label to wandb run upon initialization ( #26466 )
2023-10-04 14:57:41 +02:00
statelesshz
4fdf47cd3c
Extend Trainer to enable Ascend NPU to use the fused Adamw optimizer when training ( #26194 )
2023-10-04 14:57:11 +02:00
dependabot[bot]
fc296f419e
Bump pillow from 9.3.0 to 10.0.1 in /examples/research_projects/decision_transformer ( #26580 )
...
Bump pillow in /examples/research_projects/decision_transformer
Bumps [pillow](https://github.com/python-pillow/Pillow ) from 9.3.0 to 10.0.1.
- [Release notes](https://github.com/python-pillow/Pillow/releases )
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst )
- [Commits](https://github.com/python-pillow/Pillow/compare/9.3.0...10.0.1 )
---
updated-dependencies:
- dependency-name: pillow
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-04 11:52:46 +02:00
김준재_T3056
2f3ea08a07
docs: feat: add clip notebook resources from OSSCA community ( #26505 )
2023-10-03 11:20:22 -07:00
Lysandre Debut
5c66378cea
[Tokenizers] Skip tests temporarily ( #26574 )
...
* Skip tests temporarily
* style
* Add additional test
2023-10-03 19:43:42 +02:00
Jungnerd
2c7b26f508
🌐 [i18n-KO] Translated `semantic_segmentation.md` to Korean ( #26515 )
...
* docs: ko: sementic_segmentation.md
* feat: manual draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* fix: resolve suggestions
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* fix: edit the title
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-10-03 10:25:50 -07:00
Sanchit Gandhi
57f44dc428
[Whisper] Allow basic text normalization ( #26149 )
...
* [Whisper] Allow basic text normalization
* up
* style copies
2023-10-03 17:57:16 +01:00
Lysandre
bd6205919a
v4.35.0.dev0
2023-10-03 16:54:37 +02:00
Arthur
c26b2a29e5
[`Nougat`] from transformers import * ( #26562 )
...
* remove unprotected import to PIL
* cleanup
---------
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-10-03 16:32:12 +02:00
Younes Belkada
2aef9a9601
[`PEFT`] Final fixes ( #26559 )
...
* fix issues with PEFT
* logger warning futurewarning issues
* fixup
* adapt from suggestions
* oops
* rm test
2023-10-03 14:53:09 +02:00
Younes Belkada
ae9a344cce
[`Mistral`] Add Flash Attention-2 support for `mistral` ( #26464 )
...
* add FA-2 support for mistral
* fixup
* add sliding windows
* fixing few nits
* v1 slicing cache - logits do not match
* add comment
* fix bugs
* more mem efficient
* add warning once
* add warning once
* oops
* fixup
* more comments
* copy
* add safety checker
* fixup
* Update src/transformers/models/mistral/modeling_mistral.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* copied from
* up
* raise when padding side is right
* fixup
* add doc + few minor changes
* fixup
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-10-03 13:44:46 +02:00
Arthur
1a2e966cfe
Nit-added-tokens ( #26538 )
...
* fix stripping
* nits
* fix another test
* styling
* fix?
* update
* revert bad merge
* found the bug
* YES SIR
* is that change really required?
* make fast even faster
* re order functions
2023-10-03 12:23:46 +02:00
Srijan Sahay Srivastava
245da7ed38
[Doctest] Add `configuration_encoder_decoder.py` ( #26519 )
...
* [Doctest] Add configuration_encoder_decoder.py
Added configuration_encoder_decoder.py to utils/documentation_tests.txt for doctest
* Revert "[Doctest] Add configuration_encoder_decoder.py"
This reverts commit bd653535a4
.
* [Doctest] Add configuration_encoder_decoder.py
add configuration_encoder_decoder.py to utils/documentation_tests.txt
* [Doctest] Add configuration_encoder_decoder.py
add configuration_encoder_decoder.py to utils/documentation_tests.txt
* [Doctest] Add configuration_encoder_decoder.py
add configuration_encoder_decoder.py to utils/documentation_tests.txt
* changed as per request
* fixed line 46
2023-10-03 11:21:24 +02:00
Funtowicz Morgan
3632fb3c25
[AMD] Add initial version for run_tests_multi_gpu ( #26346 )
...
* Add initial version for run_tests_multi_gpu
* Trigger change in BERT
* fix typo setup -> setup_gpu
* Add tag mi210
* Enable multi-gpu jobs
* One more
* Use dynamic device allocation
* Attempt to fix syntax for docker create
* fix script path
* fix
* temp machine type
* fix label
* Enable multi-gpu tests
* Rename multi-amd-gpu to multi-gpu
* Let's not be lazy dude
* Update rocm-smi output
* Add gpu_flavour in the matrix
* Fix typos
* merge single/multi dispatch into the matrix
* Format.
* Revert BERT's change
---------
Co-authored-by: Guillaume LEGENDRE <glegendre01@gmail.com>
2023-10-03 11:13:45 +02:00
Sanchit Gandhi
768aa3d9cd
[Wav2Vec2 and Co] Update init tests for PT 2.1 ( #26494 )
2023-10-03 10:52:34 +02:00
Nathan Cahill
b5ca8fcd20
Add tokenizer kwargs to fill mask pipeline. ( #26234 )
...
* add tokenizer kwarg inputs
* Adding tokenizer_kwargs to _sanitize_parameters
* Add truncation=True example to tests
* Update test_pipelines_fill_mask.py
* Update test_pipelines_fill_mask.py
* make fix-copies and make style
* Update fill_mask.py
Replace single tick with double
* make fix-copies
* Style
---------
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-10-03 10:25:10 +02:00
Patrick von Platen
df6a855e7b
[RFC, Logging] Change warning to info ( #26545 )
...
[Logging] Change warning to info
2023-10-03 08:55:39 +02:00
dependabot[bot]
cf345d5f38
Bump urllib3 from 1.26.9 to 1.26.17 in /examples/research_projects/decision_transformer ( #26554 )
...
Bump urllib3 in /examples/research_projects/decision_transformer
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.9 to 1.26.17.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.9...1.26.17 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-03 08:55:12 +02:00
dependabot[bot]
6de6fdd06d
Bump urllib3 from 1.26.5 to 1.26.17 in /examples/research_projects/visual_bert ( #26552 )
...
Bump urllib3 in /examples/research_projects/visual_bert
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.5 to 1.26.17.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.5...1.26.17 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-03 08:55:01 +02:00
dependabot[bot]
e092b4ad68
Bump urllib3 from 1.26.5 to 1.26.17 in /examples/research_projects/lxmert ( #26551 )
...
Bump urllib3 in /examples/research_projects/lxmert
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.5 to 1.26.17.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.5...1.26.17 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-03 08:54:50 +02:00
Florian Zimmermeister
9ed538f2e6
[i18n-DE] contribute chapter ( #26481 )
...
* start working on next chapter
* finish testing
* Update docs/source/de/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/de/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/de/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-10-02 09:56:40 -07:00
Wonhyeong Seo
1470f731b6
🌐 [i18n-KO] Translated `tokenizer_summary.md` to Korean ( #26243 )
...
* docs: ko: toknenizer_summary.md
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Juntae <79131091+sronger@users.noreply.github.com>
Co-Authored-By: Injin Paek <71638597+eenzeenee@users.noreply.github.com>
* update review
* fix: resolve suggestions
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
---------
Co-authored-by: HanNayeoniee <nayeon2.han@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com>
Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-10-02 09:55:33 -07:00
Arthur
c20d90d577
add build_inputs_with_special_tokens to LlamaFast ( #26297 )
...
* add build_inputs_with_special_tokens to LlamaFast
* fixup
* Update src/transformers/models/llama/tokenization_llama_fast.py
2023-10-02 18:30:44 +02:00
Arthur
bab3331906
Code-llama-nit ( #26300 )
...
* fix encoding when the fill token is None
* add tests and edge cases
* fiuxp
* Update tests/models/code_llama/test_tokenization_code_llama.py
2023-10-02 18:29:27 +02:00
Adithya Hegde Kota
4b4c6aabfb
[Doctest] Add configuration_roformer.py ( #26530 )
...
* [Doctest] Add configuration_roformer.py
* [Doctest] Add configuration_roformer.py
* [Doctest] Add configuration_roformer.py
* [Doctest] Add configuration_roformer.py
* Removed documentation_test.txt
* Removed configuration_roformer.py
* Update not_doctested.txt
2023-10-02 17:19:13 +02:00
Arthur
e4dad4fe32
Remove-warns ( #26483 )
...
* fix stripping
* remove some warnings and update some warnings
* revert changes for other PR
2023-10-02 16:52:00 +02:00
Younes Belkada
1b8decb04c
[`PEFT`] Protect `adapter_kwargs` check ( #26537 )
...
Update modeling_utils.py
2023-10-02 14:59:24 +02:00
Arthur
63864e057f
Fix model integration ci ( #26322 )
...
* fix wav2vec2
* nit
* stash
* one more file to update
* fix byt5
* vocab size is 256, don't change that!
* use other revision
* test persimon in smaller size
* style
* tests
* nits
* update add tokens from pretrained
* test tokenization
* nits
* potential fnet fix?
* more nits
* nits
* correct test
* assert close
* udpate
* ouch
* fix it
* some more nits
* FINALLU
* use `adept` checkpoints
* more adept checkpoints
* that was invlved!
2023-10-02 13:55:46 +02:00
Younes Belkada
6824461f2a
[`core`/ `auto` ] Fix bnb test with code revision + bug with code revision ( #26431 )
...
* fix bnb test with code revision
* fix test
* Apply suggestions from code review
* Update src/transformers/models/auto/auto_factory.py
* Update src/transformers/models/auto/auto_factory.py
* Update src/transformers/models/auto/auto_factory.py
2023-10-02 11:35:07 +02:00
Younes Belkada
24178c2461
[`PEFT`] Pass token when calling `find_adapter_config` ( #26488 )
...
* try
* nit
* nits
2023-10-02 11:23:03 +02:00
HelgeS
7d6627d0d9
Fix broken link to video classification task ( #26487 )
2023-10-02 11:19:11 +02:00
marcmk6
6d02ca4bb9
Fix issue of canine forward requiring input_ids anyway ( #26290 )
...
* fix issue of canine forward requires input_ids anyway
The `forward` requires `input_ids` for deriving other variables in all cases. Change this to use the given one between `input_ids` and `inputs_embeds`
* fix canine forward
The current `forward` requires (the shape of) `input_ids` for deriving other variables whenever `input_ids` or `inputs_embeds` is provided. Change this to use the given one instead of `input_ids` all the time.
* fix format
* fix format
2023-10-02 11:06:40 +02:00
Jan Philipp Harries
7d77d7f79c
Fix requests connection error during modelcard creation ( #26518 )
...
fix requests connection error
Co-authored-by: Jan Philipp Harries <jphme@users.noreply.github.com>
2023-10-02 10:52:51 +02:00
Florian Seiler
ca0379b8c8
Fix num_heads in _upad_input ( #26490 )
...
* Fix num_heads in _upad_input
The variable num_key_value_heads has falsely been named num_heads, which led to reshaping the query_layer using the wrong attention head count. (It would have been enough to use the correct variable self.num_heads instead of num_heads, but I renamed num_heads to num_key_value_heads for clarity)
* fixed copies using make fix-copies and ran make fixup
---------
Co-authored-by: fseiler <f.seiler@jerocom.de>
2023-10-02 10:10:19 +02:00
Lysandre Debut
67239f7360
Revert falcon exception ( #26472 )
...
* Revert "Falcon: fix revision propagation (#26006 )"
This reverts commit 118c676ef3
.
* Revert "Put Falcon back (#25960 )"
This reverts commit 22a69f1d7d
.
2023-10-02 09:13:19 +02:00
Sanchit Gandhi
0b192de1f3
[ASR Pipe] Improve docs and error messages ( #26476 )
...
* improve docs/errors
* why whisper
* Update docs/source/en/pipeline_tutorial.md
Co-authored-by: Lysandre Debut <hi@lysand.re>
* specify pt only
---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
2023-09-29 18:32:37 +01:00
Sanchit Gandhi
68e85fc822
[Flax Examples] Seq2Seq ASR Fine-Tuning Script ( #21764 )
...
* from seq2seq speech
* [Flax] Example script for speech seq2seq
* tests and fixes
* make style
* fix: label padding tokens
* fix: label padding tokens over list
* update ln names for Whisper
* try datasets iter loader
* create readme and append results
* style
* make style
* adjust lr
* use pt dataloader
* make fast
* pin gen max len
* finish
* add pt to requirements for test
* fix pt -> torch
* add accelerate
2023-09-29 16:42:58 +01:00