transformers

History

João David 22ad1ea70c Add TF swiftformer (#23342 ) * Duplicate swiftformer * Convert SwiftFormerPatchEmbedding * Convert SwiftFormerEmbeddings * Convert TFSwiftFormerMlp * Convert TFSwiftFormerConvEncoder * Convert TFSwiftFormerLocalRepresentation * convert TFSwiftFormerEncoderBlock * Convert SwiftFormerStage * Convert SwiftFormerEncoder * Add TFSWiftFormerPreTrainedModel * Convert SwiftFormerForImageClassification * Add kwargs and start drop path * Fix syntax * Change Model class name * Add TFSwiftFormer to __init__ * Duplicate test_modeling_swiftformer * First test conversions * Change require_torch to require_tf * Add exports to swiftformer __init__ * Add TFSwiftFormerModel wrapper * Fix __init__ and run black * Remove docstring from MainLayer, fix padding * Use keras.layers.Activation on keras.Sequential * Fix swiftformer exports * Fix activation layer from config * Remove post_inits * Use tf.keras.layers.ZeroPadding2D * Convert torch normalize * Change tf test input shape * Fix softmax and reduce_sum * Convert expand_dims and repeat * Add missing reshape and tranpose * Simplify TFSwiftFormerEncoderBlock.call * Fix mismatch in patch embeddings * Fix expected output shape to match channels last * Fix swiftformer typo * Disable test_onnx * Fix TFSwiftFormerForImageClassification call * Add unpack inputs * Convert flatten(2).mean(-1) * Change vision dummy inputs (to be reviewed) * Change test_forward_signature to use .call * Fix @unpack_inputs * Set return_tensors="tf" and rename class * Rename wrongly named patch_embeddings layer * Add serving_output and change dummy_input shape * Make dimensions BCHW and transpose inside embedding layer * Change SwiftFormerEncoderBlock * Fix ruff problems * Add image size to swiftformer config * Change tranpose to MainLayer and use -1 for reshape * Remove serving_outputs and dummy_inputs * Remove test_initialization test from tf model * Make Sequential component a separate layer * Fix layers' names * Tranpose encoder outputs * Fix tests and check if hidden states is not None * Fix TFSwiftFormerForImageClassification * Run make fixup * Run make fix-copies * Update modeling_tf_auto * Update docs * Fix modeling auto mapping * Update modelint_tf_swiftformer docs * Fill image_size doc and type * Add reduction=None to loss computation * Update docs * make style * Debug: Delete the tip to see if that changes anything * Re-add tip * Remove add_code_sample_docstrings * Remove unused import * Get the debug to actually tell us the problem it has with the docs * Try a substitution to match the PyTorch file? * Add swiftformer to ignore list * Add build() methods * Update copyright year Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Remove FIXME comment * Remove from_pt * Update copyright year Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Rename one-letter variables * Remove FIXMEs related to momentum * Remove old TODO comment * Remove outstanding FIXME comments * Get dropout rate from config * Add specific dropout config for MLP * Add convencoder dropout to config * Pass config to SwiftFormerDropPath layer * Fix drop_path variable name and add Adapted from comment * Run ruff * Removed copied from comment * Run fix copies * Change drop_path to identity to match pt * Cleanup build() methods and move to new keras imports * Update docs/source/en/model_doc/swiftformer.md Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Raise error if drop_path_rate > 0.0 * Apply suggestions from code review Replace (self.dim), with self.dim, Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Remove drop_path function * Add training to TFSwiftFormerEncoder * Set self.built = True last Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Should have been added to previous commit Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Change default_feature_extractor to default_image_processor Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Import Keras from modeling_tf_utils * Remove relative import * Run ruff --fix * Move import keras to tf_available * Add copied from comment to test_forward_signature * Reduce batch size and num_labels * Extract loss logic to hf_compute_loss * Run ruff format --------- Co-authored-by: Matt <rocketknight1@gmail.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>		2024-04-23 15:08:23 +02:00
..
benchmark	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190 )	2024-04-23 15:08:17 +02:00
extended	🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190 )	2024-04-23 15:08:17 +02:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190 )	2024-04-23 15:08:17 +02:00
generation	Do not remove half seq length in generation tests (#30016 )	2024-04-23 15:08:22 +02:00
models	Add TF swiftformer (#23342 )	2024-04-23 15:08:23 +02:00
optimization	Make schedulers picklable by making lr_lambda fns global (#21768 )	2023-03-02 12:08:43 -05:00
peft_integration	FIX [`CI`]: Fix failing tests for peft integration (#29330 )	2024-02-29 03:56:16 +01:00
pipelines	Fix all torch pipeline failures except one (#30290 )	2024-04-23 15:08:13 +02:00
quantization	FIX / AWQ: Fix failing exllama test (#30288 )	2024-04-23 15:08:10 +02:00
repo_utils	Allow `# Ignore copy` (#27328 )	2023-12-07 10:00:08 +01:00
sagemaker	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tokenization	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
tools	Add support for for loops in python interpreter (#24429 )	2023-06-26 09:58:14 -04:00
trainer	Update unwrap from accelerate (#29933 )	2024-04-23 15:08:22 +02:00
utils	Fix test transposing image with EXIF Orientation tag (#30319 )	2024-04-23 15:08:17 +02:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_cache_utils.py	Generate: add tests for caches with `pad_to_multiple_of` (#29462 )	2024-03-06 10:57:04 +00:00
test_configuration_common.py	[ `PretrainedConfig`] Improve messaging (#27438 )	2023-11-15 14:10:39 +01:00
test_configuration_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_feature_extraction_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00
test_image_processing_common.py	Raise unused kwargs image processor (#29063 )	2024-02-20 16:20:20 +01:00
test_image_processing_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00
test_image_transforms.py	Normalize floating point cast (#27249 )	2023-11-10 15:35:27 +00:00
test_modeling_common.py	Enable multi-device for some models (#30207 )	2024-04-23 15:08:20 +02:00
test_modeling_flax_common.py	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
test_modeling_flax_utils.py	Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599 )	2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py	Add tf_keras imports to prepare for Keras 3 (#28588 )	2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py	Cast bfloat16 to float32 for Numpy conversions (#29755 )	2024-03-21 14:04:11 +00:00
test_modeling_utils.py	Fix config + attn_implementation in AutoModelForCausalLM.from_pretrained (#30299 )	2024-04-23 15:08:23 +02:00
test_pipeline_mixin.py	Image Feature Extraction pipeline (#28216 )	2024-02-05 14:50:07 +00:00
test_processing_common.py	Don't save `processor_config.json` if a processor has no extra attribute (#28584 )	2024-01-19 09:59:14 +00:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
test_tokenization_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00