transformers

History

Arthur fb1c62e973 [`Add Mamba`] Adds support for the `Mamba` models (#28094 ) * initial-commit * start cleaning * small nits * small nits * current updates * add kernels * small refactoring little step * add comments * styling * nit * nits * Style * Small changes * Push dummy mambda simple slow * nit * Use original names * Use original names and remove norm * Updates for inference params * Style nd updates * nits * Match logits * Add a test * Add expected generated text * nits doc, imports and styling * style * oups * dont install kernels, invite users to install the required kernels * let use use the original packages * styling * nits * fix some copieds * update doc * fix-copies * styling done * nits * fix import check * run but wrong cuda ress * mamba CUDA works :) * fix the fast path * config naming nits * conversion script is not required at this stage * finish fixing the fast path: generation make sense now! * nit * Let's start working on the CIs * style * better style * more nits * test nit * quick fix for now * nits * nit * nit * nit * nits * update test rest * fixup * update test * nit * some fixes * nits * update test values * fix styling * nit * support peft * integrations tests require torchg * also add slow markers * styling * chose forward wisely * nits * update tests * fix gradient checkpointing * fixup * nit * fix doc * check copies * fix the docstring * fix some more tests * style * fix beam search * add init schene * update * nit * fix * fixup the doc * fix the doc * fixup * tentative update but slow is no longer good * nit * should we always use float32? * nits * revert wrong changes * res in float32 * cleanup * skip fmt for now * update generation values * update test values running original model * fixup * update tests + rename inference_params to cache_params + make sure training does not use cache_params * small nits * more nits * fix final CIs * style * nit doc * I hope final doc nits * nit * 🫠 * final touch! * fix torch import * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * Apply suggestions from code review * fix fix and fix * fix base model prefix! * nit * Update src/transformers/models/mamba/__init__.py * Update docs/source/en/model_doc/mamba.md Co-authored-by: Lysandre Debut <hi@lysand.re> * nit --------- Co-authored-by: Lysandre Debut <hi@lysand.re>		2024-03-05 20:01:06 +09:00
..
asr.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
audio_classification.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
document_question_answering.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
idefics.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
image_captioning.md	[Docs] Fix backticks in inline code and documentation links (#28875 )	2024-02-06 11:15:44 -08:00
image_classification.md	Add SiglipForImageClassification and CLIPForImageClassification (#28952 )	2024-02-14 08:41:31 +01:00
image_feature_extraction.md	Image Feature Extraction docs (#28973 )	2024-02-27 09:39:58 +00:00
image_to_image.md	Image-to-Image Task Guide (#26595 )	2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md	fixed typos (issue 27919) (#27920 )	2023-12-11 18:44:23 -05:00
language_modeling.md	[`Add Mamba`] Adds support for the `Mamba` models (#28094 )	2024-03-05 20:01:06 +09:00
mask_generation.md	Mask Generation Task Guide (#28897 )	2024-02-14 18:29:49 +00:00
masked_language_modeling.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
monocular_depth_estimation.md	Add Depth Anything (#28654 )	2024-01-25 09:34:50 +01:00
multiple_choice.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
object_detection.md	Fixing visualization code for object detection to support both types of bounding box. (#27842 )	2023-12-22 13:24:40 +00:00
prompting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
question_answering.md	fix the post-processing link (#29091 )	2024-02-19 10:15:58 +00:00
semantic_segmentation.md	Fix indentation error - semantic_segmentation.md (#28117 )	2023-12-18 12:47:54 -05:00
sequence_classification.md	Starcoder2 model - bis (#29215 )	2024-02-28 01:24:34 +01:00
summarization.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
text-to-speech.md	Add FastSpeech2Conformer (#23439 )	2024-01-03 18:01:06 +00:00
token_classification.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
translation.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
video_classification.md	[Docs] Add language identifiers to fenced code blocks (#28955 )	2024-02-12 10:48:31 -08:00
visual_question_answering.md	VQA task guide (#25244 )	2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md	[docs] Fix model reference in zero shot image classification example (#26206 )	2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md	[Docs] Update README and default pipelines (#28864 )	2024-02-12 10:21:36 +01:00