transformers/src/transformers
fxmarty 49204c1d37
Better SDPA unmasking implementation (#29318)
* better unmask imple

* comment

* typo

* bug report pytorch

* cleanup

* fix import

* add back example

* retrigger ci

* come on
2024-02-28 16:36:47 +01:00
..
benchmark [tests] enable benchmark unit tests on XPU (#29284) 2024-02-27 09:44:48 +00:00
commands Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
data Remove fast tokenization warning in Data Collators (#28213) 2024-01-02 18:32:23 +00:00
generation Fix a few typos in `GenerationMixin`'s docstring (#29277) 2024-02-27 18:15:43 +00:00
integrations fix(mlflow): check mlflow version to use the synchronous flag (#29195) 2024-02-23 09:19:51 +01:00
kernels DeformableDetrModel support fp16 (#29013) 2024-02-15 12:31:09 +00:00
models Better SDPA unmasking implementation (#29318) 2024-02-28 16:36:47 +01:00
onnx Byebye torch 1.10 (#28207) 2024-01-11 16:18:27 +01:00
pipelines Token level timestamps for long-form generation in Whisper (#29148) 2024-02-27 18:15:26 +00:00
quantizers Add training version check for AQLM quantizer. (#29142) 2024-02-21 17:09:36 +01:00
sagemaker [Refactor] Relative imports wherever we can (#21880) 2023-03-02 09:45:42 +01:00
tools Ability to override clean_code_for_run (#28783) 2024-02-05 03:48:41 +01:00
utils Starcoder2 model - bis (#29215) 2024-02-28 01:24:34 +01:00
__init__.py Starcoder2 model - bis (#29215) 2024-02-28 01:24:34 +01:00
activations.py SiLU activation wrapper for safe importing (#28509) 2024-01-15 19:36:59 +00:00
activations_tf.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
audio_utils.py Porting the torchaudio kaldi fbank implementation to audio_utils (#26182) 2023-09-21 17:52:47 +02:00
cache_utils.py [ `gemma`] Adds support for Gemma 💎 (#29167) 2024-02-21 14:21:28 +01:00
configuration_utils.py Fix `attn_implementation` documentation (#29295) 2024-02-27 10:43:01 +01:00
convert_graph_to_onnx.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
convert_pytorch_checkpoint_to_tf2.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
convert_slow_tokenizer.py [ `gemma`] Adds support for Gemma 💎 (#29167) 2024-02-21 14:21:28 +01:00
convert_slow_tokenizers_checkpoints_to_fast.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
convert_tf_hub_seq_to_seq_bert_to_pytorch.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
debug_utils.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
deepspeed.py 🚨🚨🚨 [`Refactor`] Move third-party related utility files into `integrations/` folder 🚨🚨🚨 (#25599) 2023-08-25 17:13:34 +02:00
dependency_versions_check.py ⚠️ Time to say goodbye to py37 (#24091) 2023-06-28 07:22:39 +02:00
dependency_versions_table.py unpin torch (#28892) 2024-02-06 17:21:05 +01:00
dynamic_module_utils.py simplify get_class_in_module and fix for paths containing a dot (#29262) 2024-02-28 03:10:36 +01:00
feature_extraction_sequence_utils.py Fix wav2vec2 is_batched check to include 2-D numpy arrays (#23223) 2023-05-22 12:57:45 -04:00
feature_extraction_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
file_utils.py Add FastSpeech2Conformer (#23439) 2024-01-03 18:01:06 +00:00
generation_flax_utils.py Generate: deprecate old src imports (#28607) 2024-01-27 15:54:19 +00:00
generation_tf_utils.py Generate: deprecate old src imports (#28607) 2024-01-27 15:54:19 +00:00
generation_utils.py Generate: deprecate old src imports (#28607) 2024-01-27 15:54:19 +00:00
hf_argparser.py Docs: add `kwargs` type to fix formatting (#24733) 2023-07-11 16:21:29 +01:00
hyperparameter_search.py [integration] Update Ray Tune integration for Ray 2.7 (#26499) 2023-12-09 11:04:13 +01:00
image_processing_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
image_transforms.py Normalize floating point cast (#27249) 2023-11-10 15:35:27 +00:00
image_utils.py Raise unused kwargs image processor (#29063) 2024-02-20 16:20:20 +01:00
keras_callbacks.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
modelcard.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
modeling_attn_mask_utils.py Better SDPA unmasking implementation (#29318) 2024-02-28 16:36:47 +01:00
modeling_flax_outputs.py Resnet flax (#21472) 2023-03-24 19:45:57 +00:00
modeling_flax_pytorch_utils.py Flax mistral (#26943) 2024-01-31 14:19:02 +01:00
modeling_flax_utils.py [ `gemma`] Adds support for Gemma 💎 (#29167) 2024-02-21 14:21:28 +01:00
modeling_outputs.py Improved type hinting for all attention parameters (#28479) 2024-01-24 16:47:34 +00:00
modeling_tf_outputs.py Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
modeling_tf_pytorch_utils.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
modeling_tf_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
modeling_utils.py Fix `attn_implementation` documentation (#29295) 2024-02-27 10:43:01 +01:00
optimization.py Added passing parameters to "reduce_lr_on_plateau" scheduler (#27860) 2023-12-08 14:06:10 +01:00
optimization_tf.py Fix Keras scheduler import so it works for older versions of Keras (#28895) 2024-02-07 12:28:24 +00:00
processing_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
pytorch_utils.py Add serialization logic to pytree types (#27871) 2024-01-29 10:41:20 +01:00
safetensors_conversion.py Make using safetensors files automated. (#27571) 2023-12-01 15:51:10 +01:00
testing_utils.py [`require_read_token`] fix typo (#29345) 2024-02-28 10:13:57 +01:00
tf_utils.py Wrap Keras methods to support BatchEncoding (#28734) 2024-01-31 13:18:42 +00:00
time_series_utils.py [Time-Series] Autoformer model (#21891) 2023-05-30 10:23:32 +02:00
tokenization_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tokenization_utils_base.py Add chat support to text generation pipeline (#28945) 2024-02-16 16:41:01 +00:00
tokenization_utils_fast.py [`Tokenizer`] Fix slow and fast serialization (#26570) 2023-10-18 16:30:53 +02:00
trainer.py Add feature extraction mapping for automatic metadata update (#28944) 2024-02-26 10:35:37 +00:00
trainer_callback.py Change progress logging to once across all nodes (#28373) 2024-01-12 15:01:21 -05:00
trainer_pt_utils.py Introduce AcceleratorConfig dataclass (#28664) 2024-02-14 10:18:09 -05:00
trainer_seq2seq.py Have seq2seq just use gather (#27025) 2023-11-14 14:54:44 -05:00
trainer_utils.py Add compatibility with skip_memory_metrics for mps device (#29264) 2024-02-27 09:58:43 -05:00
training_args.py FEAT [`Trainer` / `bnb`]: Add RMSProp from `bitsandbytes` to HF `Trainer` (#29082) 2024-02-20 02:43:02 +01:00
training_args_seq2seq.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
training_args_tf.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00