burn/crates
Guillaume Lagrange aa79e36a8d
Add more quantization support for burn-jit (#2275)
* Add cubecl quantization kernels and QTensorOps for burn-jit

* Fix typo

* Fix output vec factor

* Fix output dtype size_of

* Remove unused code in dequantize test

* Fix dequantize vectorization

* Handle tensors when number of elems is not a multiple of 4

* Support quantize for tensors with less than 4 elems (no vectorization)

* Fix equal 0 test

* Add quantize/dequantize tests

* Add q_to_device

* Refactor kernels for latest cubecl

* intermediate i32 cast

* Fix size_of output type

* Use strict=false to ignore floating point precision issues with qparams equality

* Only check that lhs & rhs strategies match (but not strict on qparams values)

* Use assert_approx_eq on dequant values

* Reduce precision for flaky test

* Remove todo comment

* Add comment for cast to unsigned

* More comment

---------

Co-authored-by: louisfd <louisfd94@gmail.com>
2024-09-17 10:08:20 -04:00
..
burn Fix CI (#2268) 2024-09-10 12:13:48 -04:00
burn-autodiff Refactor burn-tensor: Split conv backward ops to allow conditional gradient computation (#2278) 2024-09-16 10:15:27 -04:00
burn-candle Add ops w/ default implementation for `QTensorOps` (#2125) 2024-09-09 12:21:47 -04:00
burn-common Make compatible with thumbv6m-none-eabi + add raspberry pi pico example (#2096) 2024-08-23 07:39:39 -04:00
burn-core Remove copy restriction for const generic modules (#2222) 2024-09-03 09:39:12 -04:00
burn-cuda Bump burn version to 0.15.0 2024-08-27 15:13:40 -04:00
burn-dataset Fix compile for dataset crate with vision feature (#2228) 2024-09-01 17:03:37 -04:00
burn-derive Fix module derive with generics (#2127) 2024-08-08 16:24:51 -04:00
burn-fusion Fix comments 2024-09-16 09:10:34 -04:00
burn-import simplify scope tracking in burn-import (#2207) 2024-09-09 12:19:26 -04:00
burn-jit Add more quantization support for burn-jit (#2275) 2024-09-17 10:08:20 -04:00
burn-ndarray Change ndarray mask_where implementation to correctly deal with NaNs (#2272) 2024-09-13 15:16:39 -04:00
burn-no-std-tests Refactor xtask to use tracel-xtask and refactor CI workflow (#2063) 2024-08-28 15:57:13 -04:00
burn-tch Add ops w/ default implementation for `QTensorOps` (#2125) 2024-09-09 12:21:47 -04:00
burn-tensor Add more quantization support for burn-jit (#2275) 2024-09-17 10:08:20 -04:00
burn-tensor-testgen Fix Cargo.toml repository links (#1749) 2024-05-09 15:40:05 -04:00
burn-train Fix panic messages being invisible in tui mode (#2226) 2024-09-06 16:22:00 -04:00
burn-wgpu Bump burn version to 0.15.0 2024-08-27 15:13:40 -04:00
onnx-ir Perf/slice (#2252) 2024-09-09 11:08:39 -04:00