transformers/tests/models
Susnato Dhar 0e59c93983
update remaining `Pop2Piano` checkpoints (#25827)
update checkpoints
2023-08-29 18:00:40 +01:00
..
albert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
align CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
altclip Hotfix 2023-08-19 11:15:38 +02:00
audio_spectrogram_transformer CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
auto Remote code improvements (#23959) 2023-06-06 14:31:14 -04:00
autoformer Compute `dropout_probability` only in training mode (#24486) 2023-06-26 18:36:47 +02:00
bark Update Bark generation configs and tests (#25409) 2023-08-09 18:28:02 +02:00
bart CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
barthez
bartpho
beit Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
bert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
bert_generation CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
bert_japanese
bertweet
big_bird Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
bigbird_pegasus CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
biogpt CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
bit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
blenderbot CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
blenderbot_small CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
blip Input data format (#25464) 2023-08-16 17:45:02 +01:00
blip_2 CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
bloom Fix failing `test_batch_generation` for bloom (#25718) 2023-08-24 11:15:29 +02:00
bridgetower Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
byt5
camembert Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
canine CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
chinese_clip Input data format (#25464) 2023-08-16 17:45:02 +01:00
clap CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
clip Add FlaxCLIPTextModelWithProjection (#25254) 2023-08-25 10:58:14 +02:00
clipseg Fix `test_model_parallelism` (#25359) 2023-08-08 10:48:45 +02:00
code_llama [`CodeLlama`] Add support for `CodeLlama` (#25740) 2023-08-25 18:57:40 +02:00
codegen CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
conditional_detr Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
convbert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
convnext Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
convnextv2 Update `ConvNextV2ModelIntegrationTest::test_inference_image_classification_head` (#23402) 2023-05-16 23:35:11 +02:00
cpm
cpmant CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
ctrl CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
cvt Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
data2vec Fix `test_model_parallelism` (#25359) 2023-08-08 10:48:45 +02:00
deberta CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
deberta_v2 CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
decision_transformer
deformable_detr Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
deit Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
deta Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
detr fixing name position_embeddings to object_queries (#24652) 2023-08-29 09:09:45 +01:00
dinat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
dinov2 [DINOv2] Add backbone class (#25520) 2023-08-29 11:05:27 +01:00
distilbert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
dit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
donut Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
dpr CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
dpt Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
efficientformer Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
efficientnet 🚨🚨🚨 Remove softmax for EfficientNetForImageClassification 🚨🚨🚨 (#25501) 2023-08-14 17:08:47 +01:00
electra CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
encodec Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
encoder_decoder Move TF building to an actual build() method (#23760) 2023-06-06 18:30:51 +01:00
ernie CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
ernie_m CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
esm Fix `test_model_parallelism` (#25359) 2023-08-08 10:48:45 +02:00
falcon CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
flaubert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
flava Input data format (#25464) 2023-08-16 17:45:02 +01:00
fnet CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
focalnet Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
fsmt update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
funnel Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
git CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
glpn Input data format (#25464) 2023-08-16 17:45:02 +01:00
gpt2 Correct attention mask dtype for Flax GPT2 (#25636) 2023-08-25 17:36:37 +02:00
gpt_bigcode CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
gpt_neo CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
gpt_neox CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
gpt_neox_japanese CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
gpt_sw3
gptj CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
gptsan_japanese CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
graphormer Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
groupvit CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
herbert
hubert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
ibert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
idefics [idefics] idefics-9b test use 4bit quant (#25734) 2023-08-24 08:33:14 -07:00
imagegpt Input data format (#25464) 2023-08-16 17:45:02 +01:00
informer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
instructblip Update InstructBLIP & Align values after rescale update (#25209) 2023-08-03 11:01:10 +01:00
jukebox Replaces calls to `.cuda` with `.to(torch_device)` in tests (#25571) 2023-08-18 12:40:40 +02:00
layoutlm CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
layoutlmv2 [`split_special_tokens`] Add support for `split_special_tokens` argument to encode (#25081) 2023-08-18 13:26:27 +02:00
layoutlmv3 [`split_special_tokens`] Add support for `split_special_tokens` argument to encode (#25081) 2023-08-18 13:26:27 +02:00
layoutxlm [`split_special_tokens`] Add support for `split_special_tokens` argument to encode (#25081) 2023-08-18 13:26:27 +02:00
led Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
levit Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
lilt Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
llama [`LlamaTokenizer`] `tokenize` nits. (#25793) 2023-08-29 15:08:14 +02:00
longformer Fix more offload edge cases (#25342) 2023-08-07 17:45:41 +02:00
longt5 CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
luke CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
lxmert Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
m2m_100 update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
marian Marian: post-hack-fix correction (#25459) 2023-08-16 11:49:29 +01:00
markuplm [`split_special_tokens`] Add support for `split_special_tokens` argument to encode (#25081) 2023-08-18 13:26:27 +02:00
mask2former Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
maskformer Fix `MaskFormerModelIntegrationTest` OOM (#25544) 2023-08-16 18:11:24 +02:00
mbart CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
mbart50
mega CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
megatron_bert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
megatron_gpt2
mgp_str Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
mluke
mobilebert Hotfix 2023-08-19 11:15:38 +02:00
mobilenet_v1 Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
mobilenet_v2 Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
mobilevit Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
mobilevitv2 Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
mpnet CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
mpt Fix test_modeling_mpt typo in model id (#25606) 2023-08-21 11:11:21 +02:00
mra CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
mt5 Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
musicgen [MusicGen] Fix integration tests (#25169) 2023-07-28 18:50:15 +01:00
mvp CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
nat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
nezha CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
nllb 🚨🚨🚨 `[NLLB Tokenizer]` Fix the prefix tokens 🚨🚨🚨 (#22313) 2023-04-04 14:53:06 +02:00
nllb_moe [`NllbMoe`] Update code to properly support loss computation (#25429) 2023-08-17 17:21:56 +02:00
nystromformer CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
oneformer Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
openai CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
opt Replaces calls to `.cuda` with `.to(torch_device)` in tests (#25571) 2023-08-18 12:40:40 +02:00
owlvit Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
pegasus CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
pegasus_x CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
perceiver Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
phobert
pix2struct Input data format (#25464) 2023-08-16 17:45:02 +01:00
plbart CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
poolformer Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
pop2piano update remaining `Pop2Piano` checkpoints (#25827) 2023-08-29 18:00:40 +01:00
prophetnet CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
pvt Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
qdqbert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
rag Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
realm CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
reformer Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
regnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
rembert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
resnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
roberta Fix `test_model_parallelism` (#25359) 2023-08-08 10:48:45 +02:00
roberta_prelayernorm Fix `test_model_parallelism` (#25359) 2023-08-08 10:48:45 +02:00
roc_bert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
roformer CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
rwkv CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
sam Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
segformer Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
sew CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
sew_d CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
speech_encoder_decoder
speech_to_text Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
speech_to_text_2 CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
speecht5 Add Number Normalisation for SpeechT5 (#25447) 2023-08-22 08:12:57 +02:00
splinter CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
squeezebert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
swiftformer Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
swin Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
swin2sr Input data format (#25464) 2023-08-16 17:45:02 +01:00
swinv2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
switch_transformers Switch Transformers: remove overwritten beam sample test (#25458) 2023-08-11 13:16:01 +01:00
t5 [`LlamaTokenizer`] `tokenize` nits. (#25793) 2023-08-29 15:08:14 +02:00
table_transformer Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
tapas CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
time_series_transformer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
timesformer CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
timm_backbone Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
transfo_xl CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
trocr CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
tvlt Input data format (#25464) 2023-08-16 17:45:02 +01:00
umt5 CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
unispeech CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
unispeech_sat CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
upernet Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
videomae Input data format (#25464) 2023-08-16 17:45:02 +01:00
vilt Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
vision_encoder_decoder Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vision_text_dual_encoder Fix `VisionTextDualEncoderIntegrationTest` (#24661) 2023-07-05 13:44:30 +02:00
visual_bert CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
vit Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
vit_hybrid fix vit hybrid test (#25543) 2023-08-16 17:02:57 +02:00
vit_mae CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
vit_msn CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
vitdet Add ViTDet (#25524) 2023-08-29 10:03:52 +01:00
vivit Input data format (#25464) 2023-08-16 17:45:02 +01:00
wav2vec2 Mark flaky tests (#25463) 2023-08-11 15:26:45 +01:00
wav2vec2_conformer CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
wav2vec2_phoneme
wav2vec2_with_lm Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800) 2023-04-17 12:41:55 +02:00
wavlm CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
whisper Skip `test_beam_search_xla_generate_simple` for `T5` (#25566) 2023-08-17 15:30:46 +02:00
x_clip CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
xglm Replaces calls to `.cuda` with `.to(torch_device)` in tests (#25571) 2023-08-18 12:40:40 +02:00
xlm CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
xlm_prophetnet
xlm_roberta Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
xlm_roberta_xl CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
xlnet Skip `test_contrastive_generate` for `TFXLNet` (#25574) 2023-08-17 18:56:34 +02:00
xmod CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
yolos Refactor image processor testers (#25450) 2023-08-11 11:30:18 +01:00
yoso CI with `num_hidden_layers=2` 🚀🚀🚀 (#25266) 2023-08-02 20:22:36 +02:00
__init__.py