transformers/tests/models/mamba
Anton Vlasjuk cefb819f7a
Mamba `slow_forward` gradient fix (#29563)
* FIX: Cached slow forward in mamba
- additionally added mamba cached test
- added unused test (mamba causal lm forward and backward)
- fixed typo: "causl" --> "causal"

* formatting

* fix: use real `slow_forward` call instead of torch module's

* add shape assertion for mixer block test

* adjust shape assertion
2024-03-27 04:52:12 +01:00
..
__init__.py [`Add Mamba`] Adds support for the `Mamba` models (#28094) 2024-03-05 20:01:06 +09:00
test_modeling_mamba.py Mamba `slow_forward` gradient fix (#29563) 2024-03-27 04:52:12 +01:00