transformers/tests/models
Abhi Venigalla 005b957fb8
Add DBRX Model (#29921)
* wip

* fix __init__.py

* add docs

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* address comments 1

* work on make fixup

* pass configs down

* add sdpa attention

* remove DbrxBlock

* add to configuration_auto

* docstring now passes formatting test

* fix style

* update READMEs

* add dbrx to modeling_auto

* make fix-copies generated this

* add DBRX_PRETRAINED_CONFIG_ARCHIVE_MAP

* config docstring passes formatting test

* rename moe_loss_weight to router_aux_loss_coef

* add to flash-attn documentation

* fix model-path in tests

* Explicitly make `"suli"` the default `ffn_act_fn`

Co-authored-by: Wing Lian <wing.lian@gmail.com>

* default to using router_aux_loss_coef over ffn_config[moe_loss_weight]

* fix _flash_attn_uses_top_left_mask and is_causal

* fix tests path

* don't use token type IDs

* follow Llama and remove token_type_ids from test

* init ConfigTester differently so tests pass

* remove multiple choice test

* remove question + answer test

* remove sequence classification test

* remove token classification test

* copy Llama tests and remove token_type_ids from test inputs

* do not test pruning or headmasking; style code

* add _tied_weights_keys parameter to pass test

* add type hints

* fix type check

* update config tester

* remove masked_lm test

* remove encoder tests

* initialize DbrxModelTester with correct params

* style

* torch_dtype does not rely on torch

* run make fixup, fix-copies

* use https://huggingface.co/v2ray/dbrx-base-fixed/blob/main/modeling_dbrx.py

* add copyright info

* fix imports and DbrxRotaryEmbedding

* update DbrxModel docstring

* use copies

* change model path in docstring

* use config in DbrxFFN

* fix flashattention2, sdpaattention

* input config to DbrXAttention, DbrxNormAttentionNorm

* more fixes

* fix

* fix again!

* add informative comment

* fix ruff?

* remove print statement + style

* change doc-test

* fix doc-test

* fix docstring

* delete commented out text

* make defaults match dbrx-instruct

* replace `router_aux_loss_coef` with `moe_loss_weight`

* is_decoder=True

* remove is_decoder from configtester

* implement sdpa properly

* make is_decoder pass tests

* start on the GenerationTesterMixin tests

* add dbrx to sdpa documentation

* skip weight typing test

* style

* initialize smaller model

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Add DBRX to toctree

* skip test_new_cache_format

* make config defaults smaller again

* add pad_token_id

* remove pad_token_id from config

* Remove all references to DBRX_PRETRAINED_CONFIG_ARCHIVE_MAP

* Update src/transformers/models/dbrx/__init__.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/dbrx/modeling_dbrx.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update docs/source/en/model_doc/dbrx.md

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/models/dbrx/configuration_dbrx.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update docs/source/en/model_doc/dbrx.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix typo

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update docs, fix configuration_auto.py

* address pr comments

* remove is_decoder flag

* slice

* fix requires grad

* remove grad

* disconnect differently

* remove grad

* enable grads

* patch

* detach expert

* nissan al ghaib

* Update modeling_dbrx.py

* Update src/transformers/models/dbrx/modeling_dbrx.py

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* replace "Gemma" with "Dbrx"

* remove # type: ignore

* don't hardcode vocab_size

* remove ToDo

* Re-add removed idefics2 line

* Update test to use tiny-random!

* Remove TODO

* Remove one more case of loading the entire dbrx-instruct in the tests

* Update src/transformers/models/dbrx/modeling_dbrx.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* address some comments

* small model

* add dbrx to tokenization_auto

* More docstrings with add_start_docstrings

* Dbrx for now

* add PipelineTesterMixin

* Update src/transformers/models/dbrx/configuration_dbrx.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove flash-attn2 import error

* fix docstring

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add useage example

* put on one line

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix ffn_act_fn

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* change "dbrx" to "DBRX" for display purposes.

* fix __init__.py?

* fix __init__.py

* fix README

* return the aux_loss

* remove extra spaces

* fix configuration_auto.py

* fix format in tokenization_auto

* remove new line

* add more useage examples

---------

Co-authored-by: Abhi Venigalla <abhi.venigalla@databricks.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Eitan Turok <eitan.turok@databricks.com>
Co-authored-by: Eitan Turok <150733043+eitanturok@users.noreply.github.com>
Co-authored-by: Wing Lian <wing.lian@gmail.com>
Co-authored-by: Eitan Turok <eitanturok@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Your Name <you@example.com>
Co-authored-by: Mihir Patel <mihir.v.patel7@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-04-18 15:18:52 +02:00
..
albert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
align Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
altclip Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
audio_spectrogram_transformer fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
auto Fix auto tests (#30067) 2024-04-05 17:49:46 +02:00
autoformer Add tests for batching support (#29297) 2024-03-12 17:46:19 +00:00
bark Fix FA2 tests (#29909) 2024-04-01 07:51:00 +00:00
bart Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
barthez Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
bartpho Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
beit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
bert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
bert_generation Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
bert_japanese Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
bertweet Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
big_bird Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
bigbird_pegasus Generate: left-padding test, revisited (#29515) 2024-03-08 10:06:46 +00:00
biogpt [`generate`] fix breaking change for patch (#29976) 2024-04-02 09:51:45 +02:00
bit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
blenderbot Generate: left-padding test, revisited (#29515) 2024-03-08 10:06:46 +00:00
blenderbot_small Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
blip BLIP - fix pt-tf equivalence test (#30258) 2024-04-16 17:46:53 +01:00
blip_2 BLIP - fix pt-tf equivalence test (#30258) 2024-04-16 17:46:53 +01:00
bloom Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
bridgetower Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
bros [tests] add the missing `require_torch_multi_gpu` flag (#30250) 2024-04-15 16:30:52 +01:00
byt5 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
camembert Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
canine Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
chinese_clip Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
clap Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
clip Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
clipseg Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
clvp Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
code_llama Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
codegen Add token type ids to CodeGenTokenizer (#29265) 2024-04-17 12:19:18 +02:00
cohere Fix `torch.fx` symbolic tracing for LLama (#30047) 2024-04-05 15:14:09 +02:00
conditional_detr Add tests for batching support (#29297) 2024-03-12 17:46:19 +00:00
convbert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
convnext Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
convnextv2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
cpm Fix PipelineTests skip conditions (#22320) 2023-03-22 20:02:24 +01:00
cpmant Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
ctrl Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
cvt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
data2vec Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
dbrx Add DBRX Model (#29921) 2024-04-18 15:18:52 +02:00
deberta Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
deberta_v2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
decision_transformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
deformable_detr DeformableDETR support bfloat16 (#29232) 2024-03-04 14:18:09 +00:00
deit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
depth_anything Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
deta [`YOLOS`] Fix - return padded annotations (#29300) 2024-03-01 09:42:13 +00:00
detr Add tests for batching support (#29297) 2024-03-12 17:46:19 +00:00
dinat Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
dinov2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
distilbert Fix FA2 tests (#29909) 2024-04-01 07:51:00 +00:00
dit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
donut Fix donut token2json multiline (#30300) 2024-04-18 09:30:40 +01:00
dpr Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
dpt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
efficientformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
efficientnet Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
electra Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
encodec fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
encoder_decoder Generate: missing generation config eos token setting in encoder-decoder tests (#29146) 2024-02-20 16:17:51 +00:00
ernie Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
ernie_m Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
esm Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
falcon Fix falcon with SDPA, alibi but no passed mask (#30123) 2024-04-08 22:25:07 +08:00
fastspeech2_conformer Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
flaubert Fix 29807 sinusoidal positional encodings in Flaubert, Informer and XLM (#29904) 2024-04-02 10:27:26 +02:00
flava Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
fnet Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915) 2024-03-28 09:53:31 +00:00
focalnet Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
fsmt Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
funnel Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
fuyu Update tiny model summary file (#27388) 2023-11-23 21:00:39 +01:00
gemma Fix slow tests for important models to be compatible with A10 runners (#29905) 2024-04-09 13:28:54 +02:00
git fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
glpn Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
gpt2 Adding Flash Attention 2 Support for GPT2 (#29226) 2024-03-28 09:31:24 +00:00
gpt_bigcode CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
gpt_neo Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
gpt_neox RoPE models: add numerical sanity-check test for RoPE scaling (#29808) 2024-03-28 11:25:50 +00:00
gpt_neox_japanese Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
gpt_sw3 Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
gptj Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
gptsan_japanese Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
graphormer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
grounding_dino Adding grounding dino (#26087) 2024-04-11 08:32:16 +01:00
groupvit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
herbert Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
hubert Fix failing tests on `main` due to torch 2.1 (#26607) 2023-10-05 10:27:05 +02:00
ibert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
idefics [`ProcessingIdefics`] Attention mask bug with padding (#29449) 2024-04-04 10:11:09 +01:00
idefics2 Add Idefics2 (#30253) 2024-04-15 17:03:03 +01:00
imagegpt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
informer Fix 29807 sinusoidal positional encodings in Flaubert, Informer and XLM (#29904) 2024-04-02 10:27:26 +02:00
instructblip BLIP - fix pt-tf equivalence test (#30258) 2024-04-16 17:46:53 +01:00
jamba Add jamba (#29943) 2024-04-18 11:04:02 +02:00
jukebox Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915) 2024-03-28 09:53:31 +00:00
kosmos2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
layoutlm Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
layoutlmv2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
layoutlmv3 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
layoutxlm Add correct batched handling for apply_chat_template (#29222) 2024-03-20 15:50:22 +00:00
led Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
levit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
lilt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
llama Fix slow tests for important models to be compatible with A10 runners (#29905) 2024-04-09 13:28:54 +02:00
llava Fix Llava chat template examples (#30130) 2024-04-11 10:38:24 +02:00
llava_next Fix vipllava for generation (#29874) 2024-04-03 17:00:08 +01:00
longformer Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
longt5 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
luke Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
lxmert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
m2m_100 Add Flash Attention 2 to M2M100 model (#30256) 2024-04-18 10:27:58 +02:00
mamba Mamba `slow_forward` gradient fix (#29563) 2024-03-27 04:52:12 +01:00
marian Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
markuplm Add correct batched handling for apply_chat_template (#29222) 2024-03-20 15:50:22 +00:00
mask2former fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
maskformer fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
mbart Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
mbart50 Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
mega Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
megatron_bert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
megatron_gpt2 Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
mgp_str Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
mistral Enable fx tracing for Mistral (#30209) 2024-04-17 14:38:48 +05:00
mixtral Enable fx tracing for Mistral (#30209) 2024-04-17 14:38:48 +05:00
mluke Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
mobilebert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mobilenet_v1 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mobilenet_v2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mobilevit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mobilevitv2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mpnet Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
mpt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mra Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
mt5 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
musicgen Add Flash Attention 2 support to Musicgen and Musicgen Melody (#29939) 2024-04-02 11:23:49 +01:00
musicgen_melody Add Flash Attention 2 support to Musicgen and Musicgen Melody (#29939) 2024-04-02 11:23:49 +01:00
mvp Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
nat Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
nezha Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
nllb Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
nllb_moe Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915) 2024-03-28 09:53:31 +00:00
nougat Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
nystromformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
olmo Add OLMo model family (#29890) 2024-04-17 17:59:07 +02:00
oneformer Fix OneFormer `post_process_instance_segmentation` for panoptic tasks (#29304) 2024-03-04 11:04:49 +00:00
openai Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
opt Proper build() methods for TF (#27794) 2023-12-14 15:17:30 +00:00
owlv2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
owlvit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
patchtsmixer PatchtTST and PatchTSMixer fixes (#28083) 2024-01-29 10:09:26 +00:00
patchtst PatchtTST and PatchTSMixer fixes (#28083) 2024-01-29 10:09:26 +00:00
pegasus Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
pegasus_x device agnostic models testing (#27146) 2023-10-31 18:12:14 +01:00
perceiver Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
persimmon [tests] add `require_bitsandbytes` marker (#30116) 2024-04-08 12:49:31 +01:00
phi RoPE models: add numerical sanity-check test for RoPE scaling (#29808) 2024-03-28 11:25:50 +00:00
phobert Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
pix2struct BLIP - fix pt-tf equivalence test (#30258) 2024-04-16 17:46:53 +01:00
plbart Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
poolformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
pop2piano Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
prophetnet Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
pvt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
pvt_v2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
qdqbert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
qwen2 Enable fx tracing for Mistral (#30209) 2024-04-17 14:38:48 +05:00
qwen2_moe Enable fx tracing for Mistral (#30209) 2024-04-17 14:38:48 +05:00
rag Add `dataset_revision` argument to `RagConfig` (#29610) 2024-03-14 16:48:11 +01:00
realm Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
recurrent_gemma Fix `RecurrentGemmaIntegrationTest.test_2b_sample` (#30222) 2024-04-12 17:53:25 +02:00
reformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
regnet Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
rembert fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
resnet Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
roberta Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
roberta_prelayernorm Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
roc_bert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
roformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
rwkv Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
sam Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915) 2024-03-28 09:53:31 +00:00
seamless_m4t Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
seamless_m4t_v2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
segformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
seggpt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
sew Fix typo (#25966) 2023-09-05 10:12:25 +02:00
sew_d Fix typo (#25966) 2023-09-05 10:12:25 +02:00
siglip Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
speech_encoder_decoder Generate: missing generation config eos token setting in encoder-decoder tests (#29146) 2024-02-20 16:17:51 +00:00
speech_to_text fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
speech_to_text_2 Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
speecht5 fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
splinter Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
squeezebert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
stablelm [`StableLm`] Add QK normalization and Parallel Residual Support (#29745) 2024-04-08 23:51:58 +02:00
starcoder2 Fix FA2 tests (#29909) 2024-04-01 07:51:00 +00:00
superpoint Update output of SuperPointForKeypointDetection (#29809) 2024-04-11 14:59:30 +02:00
swiftformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
swin Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
swin2sr Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
swinv2 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
switch_transformers Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915) 2024-03-28 09:53:31 +00:00
t5 Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
table_transformer Add tests for batching support (#29297) 2024-03-12 17:46:19 +00:00
tapas Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
time_series_transformer Add tests for batching support (#29297) 2024-03-12 17:46:19 +00:00
timesformer Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
timm_backbone [AutoBackbone] Add test (#26094) 2023-09-18 23:47:54 +02:00
trocr CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
tvlt fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
tvp Enable instantiating model with pretrained backbone weights (#28214) 2024-01-23 11:01:50 +00:00
udop fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) 2024-04-15 09:36:06 +01:00
umt5 Tests: replace `torch.testing.assert_allclose` by `torch.testing.assert_close` (#29915) 2024-03-28 09:53:31 +00:00
unispeech [Wav2Vec2 and Co] Update init tests for PT 2.1 (#26494) 2023-10-03 10:52:34 +02:00
unispeech_sat Byebye torch 1.10 (#28207) 2024-01-11 16:18:27 +01:00
univnet Add tests for batching support (#29297) 2024-03-12 17:46:19 +00:00
upernet Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
videomae Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vilt Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vipllava Add LLaVa-1.6, bis (#29586) 2024-03-20 15:51:12 +00:00
vision_encoder_decoder Fix `VisionEncoderDecoder` Positional Arg (#29497) 2024-03-07 20:45:51 +00:00
vision_text_dual_encoder [`Styling`] stylify using ruff (#27144) 2023-11-16 17:43:19 +01:00
visual_bert Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vit_hybrid Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vit_mae Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vit_msn Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vitdet mark `test_initialization` as flaky in 2 model tests (#27906) 2023-12-08 14:54:32 +01:00
vitmatte Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
vits Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
vivit Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
wav2vec2 Fix `skip_special_tokens` for `Wav2Vec2CTCTokenizer._decode` (#29311) 2024-04-02 16:55:11 +02:00
wav2vec2_bert Add new meta w2v2-conformer BERT-like model (#28165) 2024-01-18 13:37:34 +00:00
wav2vec2_conformer device agnostic models testing (#27146) 2023-10-31 18:12:14 +01:00
wav2vec2_phoneme Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
wav2vec2_with_lm Fix some tests using `"common_voice"` (#27147) 2023-10-30 15:27:15 +01:00
wavlm Byebye torch 1.10 (#28207) 2024-01-11 16:18:27 +01:00
whisper Fix slow tests for important models to be compatible with A10 runners (#29905) 2024-04-09 13:28:54 +02:00
x_clip Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
xglm Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
xlm Fix 29807 sinusoidal positional encodings in Flaubert, Informer and XLM (#29904) 2024-04-02 10:27:26 +02:00
xlm_prophetnet Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
xlm_roberta Adds pretrained IDs directly in the tests (#29534) 2024-03-13 14:53:27 +01:00
xlm_roberta_xl Revert low cpu mem tie weights (#29135) 2024-02-20 12:06:46 +00:00
xlnet Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
xmod Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
yolos Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
yoso Remove static pretrained maps from the library's internals (#29112) 2024-03-25 10:33:38 +01:00
__init__.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00