transformers

History

Abhi Venigalla 005b957fb8 Add DBRX Model (#29921 ) * wip * fix __init__.py * add docs * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * address comments 1 * work on make fixup * pass configs down * add sdpa attention * remove DbrxBlock * add to configuration_auto * docstring now passes formatting test * fix style * update READMEs * add dbrx to modeling_auto * make fix-copies generated this * add DBRX_PRETRAINED_CONFIG_ARCHIVE_MAP * config docstring passes formatting test * rename moe_loss_weight to router_aux_loss_coef * add to flash-attn documentation * fix model-path in tests * Explicitly make `"suli"` the default `ffn_act_fn` Co-authored-by: Wing Lian <wing.lian@gmail.com> * default to using router_aux_loss_coef over ffn_config[moe_loss_weight] * fix _flash_attn_uses_top_left_mask and is_causal * fix tests path * don't use token type IDs * follow Llama and remove token_type_ids from test * init ConfigTester differently so tests pass * remove multiple choice test * remove question + answer test * remove sequence classification test * remove token classification test * copy Llama tests and remove token_type_ids from test inputs * do not test pruning or headmasking; style code * add _tied_weights_keys parameter to pass test * add type hints * fix type check * update config tester * remove masked_lm test * remove encoder tests * initialize DbrxModelTester with correct params * style * torch_dtype does not rely on torch * run make fixup, fix-copies * use https://huggingface.co/v2ray/dbrx-base-fixed/blob/main/modeling_dbrx.py * add copyright info * fix imports and DbrxRotaryEmbedding * update DbrxModel docstring * use copies * change model path in docstring * use config in DbrxFFN * fix flashattention2, sdpaattention * input config to DbrXAttention, DbrxNormAttentionNorm * more fixes * fix * fix again! * add informative comment * fix ruff? * remove print statement + style * change doc-test * fix doc-test * fix docstring * delete commented out text * make defaults match dbrx-instruct * replace `router_aux_loss_coef` with `moe_loss_weight` * is_decoder=True * remove is_decoder from configtester * implement sdpa properly * make is_decoder pass tests * start on the GenerationTesterMixin tests * add dbrx to sdpa documentation * skip weight typing test * style * initialize smaller model Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Add DBRX to toctree * skip test_new_cache_format * make config defaults smaller again * add pad_token_id * remove pad_token_id from config * Remove all references to DBRX_PRETRAINED_CONFIG_ARCHIVE_MAP * Update src/transformers/models/dbrx/__init__.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/dbrx/modeling_dbrx.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update docs/source/en/model_doc/dbrx.md Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/dbrx/configuration_dbrx.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update docs/source/en/model_doc/dbrx.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix typo * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update docs, fix configuration_auto.py * address pr comments * remove is_decoder flag * slice * fix requires grad * remove grad * disconnect differently * remove grad * enable grads * patch * detach expert * nissan al ghaib * Update modeling_dbrx.py * Update src/transformers/models/dbrx/modeling_dbrx.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * replace "Gemma" with "Dbrx" * remove # type: ignore * don't hardcode vocab_size * remove ToDo * Re-add removed idefics2 line * Update test to use tiny-random! * Remove TODO * Remove one more case of loading the entire dbrx-instruct in the tests * Update src/transformers/models/dbrx/modeling_dbrx.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * address some comments * small model * add dbrx to tokenization_auto * More docstrings with add_start_docstrings * Dbrx for now * add PipelineTesterMixin * Update src/transformers/models/dbrx/configuration_dbrx.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove flash-attn2 import error * fix docstring Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add useage example * put on one line Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix ffn_act_fn Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * change "dbrx" to "DBRX" for display purposes. * fix __init__.py? * fix __init__.py * fix README * return the aux_loss * remove extra spaces * fix configuration_auto.py * fix format in tokenization_auto * remove new line * add more useage examples --------- Co-authored-by: Abhi Venigalla <abhi.venigalla@databricks.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Eitan Turok <eitan.turok@databricks.com> Co-authored-by: Eitan Turok <150733043+eitanturok@users.noreply.github.com> Co-authored-by: Wing Lian <wing.lian@gmail.com> Co-authored-by: Eitan Turok <eitanturok@gmail.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Matt <rocketknight1@gmail.com> Co-authored-by: Your Name <you@example.com> Co-authored-by: Mihir Patel <mihir.v.patel7@gmail.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2024-04-18 15:18:52 +02:00
..
albert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
align	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
altclip	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
audio_spectrogram_transformer	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
auto	Fix auto tests (#30067 )	2024-04-05 17:49:46 +02:00
autoformer	Add tests for batching support (#29297 )	2024-03-12 17:46:19 +00:00
bark	Fix FA2 tests (#29909 )	2024-04-01 07:51:00 +00:00
bart	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
barthez	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
bartpho	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
beit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
bert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
bert_generation	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
bert_japanese	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
bertweet	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
big_bird	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
bigbird_pegasus	Generate: left-padding test, revisited (#29515 )	2024-03-08 10:06:46 +00:00
biogpt	[`generate`] fix breaking change for patch (#29976 )	2024-04-02 09:51:45 +02:00
bit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
blenderbot	Generate: left-padding test, revisited (#29515 )	2024-03-08 10:06:46 +00:00
blenderbot_small	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
blip	BLIP - fix pt-tf equivalence test (#30258 )	2024-04-16 17:46:53 +01:00
blip_2	BLIP - fix pt-tf equivalence test (#30258 )	2024-04-16 17:46:53 +01:00
bloom	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
bridgetower	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
bros	[tests] add the missing `require_torch_multi_gpu` flag (#30250 )	2024-04-15 16:30:52 +01:00
byt5	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
camembert	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
canine	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
chinese_clip	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
clap	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
clip	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
clipseg	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
clvp	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
code_llama	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
codegen	Add token type ids to CodeGenTokenizer (#29265 )	2024-04-17 12:19:18 +02:00
cohere	Fix `torch.fx` symbolic tracing for LLama (#30047 )	2024-04-05 15:14:09 +02:00
conditional_detr	Add tests for batching support (#29297 )	2024-03-12 17:46:19 +00:00
convbert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
convnext	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
convnextv2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
cpm	Fix PipelineTests skip conditions (#22320 )	2023-03-22 20:02:24 +01:00
cpmant	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
ctrl	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
cvt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
data2vec	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
dbrx	Add DBRX Model (#29921 )	2024-04-18 15:18:52 +02:00
deberta	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
deberta_v2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
decision_transformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
deformable_detr	DeformableDETR support bfloat16 (#29232 )	2024-03-04 14:18:09 +00:00
deit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
depth_anything	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
deta	[`YOLOS`] Fix - return padded annotations (#29300 )	2024-03-01 09:42:13 +00:00
detr	Add tests for batching support (#29297 )	2024-03-12 17:46:19 +00:00
dinat	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
dinov2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
distilbert	Fix FA2 tests (#29909 )	2024-04-01 07:51:00 +00:00
dit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
donut	Fix donut token2json multiline (#30300 )	2024-04-18 09:30:40 +01:00
dpr	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
dpt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
efficientformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
efficientnet	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
electra	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
encodec	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
encoder_decoder	Generate: missing generation config eos token setting in encoder-decoder tests (#29146 )	2024-02-20 16:17:51 +00:00
ernie	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
ernie_m	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
esm	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
falcon	Fix falcon with SDPA, alibi but no passed mask (#30123 )	2024-04-08 22:25:07 +08:00
fastspeech2_conformer	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
flaubert	Fix 29807 sinusoidal positional encodings in Flaubert, Informer and XLM (#29904 )	2024-04-02 10:27:26 +02:00
flava	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
fnet	Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915 )	2024-03-28 09:53:31 +00:00
focalnet	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
fsmt	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
funnel	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
fuyu	Update tiny model summary file (#27388 )	2023-11-23 21:00:39 +01:00
gemma	Fix slow tests for important models to be compatible with A10 runners (#29905 )	2024-04-09 13:28:54 +02:00
git	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
glpn	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
gpt2	Adding Flash Attention 2 Support for GPT2 (#29226 )	2024-03-28 09:31:24 +00:00
gpt_bigcode	CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266 )	2023-08-02 20:22:36 +02:00
gpt_neo	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
gpt_neox	RoPE models: add numerical sanity-check test for RoPE scaling (#29808 )	2024-03-28 11:25:50 +00:00
gpt_neox_japanese	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
gpt_sw3	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
gptj	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
gptsan_japanese	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
graphormer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
grounding_dino	Adding grounding dino (#26087 )	2024-04-11 08:32:16 +01:00
groupvit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
herbert	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
hubert	Fix failing tests on `main` due to torch 2.1 (#26607 )	2023-10-05 10:27:05 +02:00
ibert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
idefics	[`ProcessingIdefics`] Attention mask bug with padding (#29449 )	2024-04-04 10:11:09 +01:00
idefics2	Add Idefics2 (#30253 )	2024-04-15 17:03:03 +01:00
imagegpt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
informer	Fix 29807 sinusoidal positional encodings in Flaubert, Informer and XLM (#29904 )	2024-04-02 10:27:26 +02:00
instructblip	BLIP - fix pt-tf equivalence test (#30258 )	2024-04-16 17:46:53 +01:00
jamba	Add jamba (#29943 )	2024-04-18 11:04:02 +02:00
jukebox	Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915 )	2024-03-28 09:53:31 +00:00
kosmos2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
layoutlm	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
layoutlmv2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
layoutlmv3	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
layoutxlm	Add correct batched handling for apply_chat_template (#29222 )	2024-03-20 15:50:22 +00:00
led	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
levit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
lilt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
llama	Fix slow tests for important models to be compatible with A10 runners (#29905 )	2024-04-09 13:28:54 +02:00
llava	Fix Llava chat template examples (#30130 )	2024-04-11 10:38:24 +02:00
llava_next	Fix vipllava for generation (#29874 )	2024-04-03 17:00:08 +01:00
longformer	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
longt5	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
luke	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
lxmert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
m2m_100	Add Flash Attention 2 to M2M100 model (#30256 )	2024-04-18 10:27:58 +02:00
mamba	Mamba `slow_forward` gradient fix (#29563 )	2024-03-27 04:52:12 +01:00
marian	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
markuplm	Add correct batched handling for apply_chat_template (#29222 )	2024-03-20 15:50:22 +00:00
mask2former	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
maskformer	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
mbart	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
mbart50	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
mega	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
megatron_bert	CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266 )	2023-08-02 20:22:36 +02:00
megatron_gpt2	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
mgp_str	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
mistral	Enable fx tracing for Mistral (#30209 )	2024-04-17 14:38:48 +05:00
mixtral	Enable fx tracing for Mistral (#30209 )	2024-04-17 14:38:48 +05:00
mluke	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
mobilebert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mobilenet_v1	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mobilenet_v2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mobilevit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mobilevitv2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mpnet	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
mpt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mra	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
mt5	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
musicgen	Add Flash Attention 2 support to Musicgen and Musicgen Melody (#29939 )	2024-04-02 11:23:49 +01:00
musicgen_melody	Add Flash Attention 2 support to Musicgen and Musicgen Melody (#29939 )	2024-04-02 11:23:49 +01:00
mvp	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
nat	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
nezha	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
nllb	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
nllb_moe	Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915 )	2024-03-28 09:53:31 +00:00
nougat	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
nystromformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
olmo	Add OLMo model family (#29890 )	2024-04-17 17:59:07 +02:00
oneformer	Fix OneFormer `post_process_instance_segmentation` for panoptic tasks (#29304 )	2024-03-04 11:04:49 +00:00
openai	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
opt	Proper build() methods for TF (#27794 )	2023-12-14 15:17:30 +00:00
owlv2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
owlvit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
patchtsmixer	PatchtTST and PatchTSMixer fixes (#28083 )	2024-01-29 10:09:26 +00:00
patchtst	PatchtTST and PatchTSMixer fixes (#28083 )	2024-01-29 10:09:26 +00:00
pegasus	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
pegasus_x	device agnostic models testing (#27146 )	2023-10-31 18:12:14 +01:00
perceiver	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
persimmon	[tests] add `require_bitsandbytes` marker (#30116 )	2024-04-08 12:49:31 +01:00
phi	RoPE models: add numerical sanity-check test for RoPE scaling (#29808 )	2024-03-28 11:25:50 +00:00
phobert	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
pix2struct	BLIP - fix pt-tf equivalence test (#30258 )	2024-04-16 17:46:53 +01:00
plbart	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
poolformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
pop2piano	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
prophetnet	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
pvt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
pvt_v2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
qdqbert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
qwen2	Enable fx tracing for Mistral (#30209 )	2024-04-17 14:38:48 +05:00
qwen2_moe	Enable fx tracing for Mistral (#30209 )	2024-04-17 14:38:48 +05:00
rag	Add `dataset_revision` argument to `RagConfig` (#29610 )	2024-03-14 16:48:11 +01:00
realm	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
recurrent_gemma	Fix `RecurrentGemmaIntegrationTest.test_2b_sample` (#30222 )	2024-04-12 17:53:25 +02:00
reformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
regnet	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
rembert	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
resnet	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
roberta	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
roberta_prelayernorm	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
roc_bert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
roformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
rwkv	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
sam	Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915 )	2024-03-28 09:53:31 +00:00
seamless_m4t	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
seamless_m4t_v2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
segformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
seggpt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
sew	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
sew_d	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
siglip	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
speech_encoder_decoder	Generate: missing generation config eos token setting in encoder-decoder tests (#29146 )	2024-02-20 16:17:51 +00:00
speech_to_text	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
speech_to_text_2	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
speecht5	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
splinter	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
squeezebert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
stablelm	[`StableLm`] Add QK normalization and Parallel Residual Support (#29745 )	2024-04-08 23:51:58 +02:00
starcoder2	Fix FA2 tests (#29909 )	2024-04-01 07:51:00 +00:00
superpoint	Update output of SuperPointForKeypointDetection (#29809 )	2024-04-11 14:59:30 +02:00
swiftformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
swin	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
swin2sr	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
swinv2	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
switch_transformers	Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915 )	2024-03-28 09:53:31 +00:00
t5	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
table_transformer	Add tests for batching support (#29297 )	2024-03-12 17:46:19 +00:00
tapas	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
time_series_transformer	Add tests for batching support (#29297 )	2024-03-12 17:46:19 +00:00
timesformer	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
timm_backbone	[AutoBackbone] Add test (#26094 )	2023-09-18 23:47:54 +02:00
trocr	CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266 )	2023-08-02 20:22:36 +02:00
tvlt	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
tvp	Enable instantiating model with pretrained backbone weights (#28214 )	2024-01-23 11:01:50 +00:00
udop	fix: Replace deprecated `assertEquals` with `assertEqual` (#30241 )	2024-04-15 09:36:06 +01:00
umt5	Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915 )	2024-03-28 09:53:31 +00:00
unispeech	[Wav2Vec2 and Co] Update init tests for PT 2.1 (#26494 )	2023-10-03 10:52:34 +02:00
unispeech_sat	Byebye torch 1.10 (#28207 )	2024-01-11 16:18:27 +01:00
univnet	Add tests for batching support (#29297 )	2024-03-12 17:46:19 +00:00
upernet	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
videomae	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vilt	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vipllava	Add LLaVa-1.6, bis (#29586 )	2024-03-20 15:51:12 +00:00
vision_encoder_decoder	Fix `VisionEncoderDecoder` Positional Arg (#29497 )	2024-03-07 20:45:51 +00:00
vision_text_dual_encoder	[`Styling`] stylify using ruff (#27144 )	2023-11-16 17:43:19 +01:00
visual_bert	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vit_hybrid	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vit_mae	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vit_msn	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vitdet	mark `test_initialization` as flaky in 2 model tests (#27906 )	2023-12-08 14:54:32 +01:00
vitmatte	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
vits	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
vivit	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
wav2vec2	Fix `skip_special_tokens` for `Wav2Vec2CTCTokenizer._decode` (#29311 )	2024-04-02 16:55:11 +02:00
wav2vec2_bert	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
wav2vec2_conformer	device agnostic models testing (#27146 )	2023-10-31 18:12:14 +01:00
wav2vec2_phoneme	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
wav2vec2_with_lm	Fix some tests using `"common_voice"` (#27147 )	2023-10-30 15:27:15 +01:00
wavlm	Byebye torch 1.10 (#28207 )	2024-01-11 16:18:27 +01:00
whisper	Fix slow tests for important models to be compatible with A10 runners (#29905 )	2024-04-09 13:28:54 +02:00
x_clip	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
xglm	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
xlm	Fix 29807 sinusoidal positional encodings in Flaubert, Informer and XLM (#29904 )	2024-04-02 10:27:26 +02:00
xlm_prophetnet	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
xlm_roberta	Adds pretrained IDs directly in the tests (#29534 )	2024-03-13 14:53:27 +01:00
xlm_roberta_xl	Revert low cpu mem tie weights (#29135 )	2024-02-20 12:06:46 +00:00
xlnet	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
xmod	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
yolos	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
yoso	Remove static pretrained maps from the library's internals (#29112 )	2024-03-25 10:33:38 +01:00
__init__.py	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00