![]() |
||
---|---|---|
.. | ||
00_basic_gemm.ipynb | ||
01_epilogue.ipynb | ||
02_pytorch_extension_grouped_gemm.ipynb | ||
03_basic_conv2d.ipynb | ||
04_epilogue_visitor.ipynb | ||
README.md |
README.md
Examples of using the CUTLASS Python interface
-
Shows how declare, configure, compile, and run a CUTLASS GEMM using the Python interface
-
Shows how to fuse elementwise activation functions to GEMMs via the Python interface
-
02_pytorch_extension_grouped_gemm
Shows how to declare, compile, and run a grouped GEMM operation via the Python interface, along with how the emitted kernel can be easily exported to a PyTorch CUDA extension.
-
Shows how to declare, configure, compile, and run a CUTLASS Conv2d using the Python interface
-
Shows how to fuse elementwise activation functions to GEMMs via the Python Epilogue Visitor interface