transformers

History

Poedator a0779b9e19 Llama: fix custom 4D masks, v2 (#30348 ) * 4d mask fixes * Update custom 4D mask logic * test moved to mixin * extra tests 4d mask * upd 4d mask and StaticCache handling * added Mask4DTestHard to mistral tests * post-rebase fixes * test fixes for StaticCache * make fix-copies * upd 1 after #30476 * fix common tests * rm elif attention_mask.dim() == 4: * tests combined, fixed, mixtral supported * bigbird style chg reverted * rm if attention_mask.dim() == 2 * modeling_llama formatting chg --------- Co-authored-by: Joao Gante <joao@huggingface.co>		2024-05-13 13:46:06 +02:00
..
__init__.py	LLaMA Implementation (#21955 )	2023-03-16 09:00:53 -04:00
test_modeling_flax_llama.py	Add Llama Flax Implementation (#24587 )	2023-12-07 07:05:00 +01:00
test_modeling_llama.py	Llama: fix custom 4D masks, v2 (#30348 )	2024-05-13 13:46:06 +02:00
test_tokenization_llama.py	[`LlamaTokenizerFast`] Refactor default llama (#28881 )	2024-04-23 23:12:59 +02:00