burn/crates/burn-candle
Guillaume Lagrange eb899db16c
Add ops w/ default implementation for `QTensorOps` (#2125)
* Add q_* ops to match float ops

* Refactor q_* ops w/ dequant_op_quant macro

* Comparison ops are already implemented by default to compare dequantized values

* Add default arg min/max implementation and fix tch implementation

* Avoid division by zero scale

* Add default q_gather implementation (tch does not support on quantized tensor)

* Add warning instead for tch quantize_dynamic

* Call chunk backend implementation

* Add QFloat check for q_ ops

* Add tch q_min/max_dim_with_indices

* Add q_ ops tests

* Clippy fix

* Remove dead code/comments

* Fix quantization tests precision

* Set higher tolerance for ndarray backend

* Remove comment
2024-09-09 12:21:47 -04:00
..
src Add ops w/ default implementation for `QTensorOps` (#2125) 2024-09-09 12:21:47 -04:00
Cargo.toml Bump burn version to 0.15.0 2024-08-27 15:13:40 -04:00
LICENSE-APACHE Update licenses symlinks (#1613) 2024-04-12 14:43:58 -04:00
LICENSE-MIT Update licenses symlinks (#1613) 2024-04-12 14:43:58 -04:00
README.md [refactor] Move burn crates to their own crates directory (#1336) 2024-02-20 13:57:55 -05:00

README.md

Burn Candle Backend

This crate provides a backend for Burn based on the Candle framework.

It is still in alpha stage, not all operations are supported. It is usable for some use cases, like for inference.

It can be used with CPU or CUDA. On macOS computations can be accelerated by using the Accelerate framework.

Feature Flags

The following features are supported:

  • cuda - Cuda GPU device (NVIDIA only)
  • accelerate - Accelerate framework (macOS only)