burn/README.md

<div align="center">
<img src="https://raw.githubusercontent.com/burn-rs/burn/main/assets/logo-burn-full.png" width="200px"/>

[![Current Crates.io Version](https://img.shields.io/crates/v/burn.svg)](https://crates.io/crates/burn)
[![Test Status](https://github.com/burn-rs/burn/actions/workflows/test.yml/badge.svg)](https://github.com/burn-rs/burn/actions/workflows/test.yml)
[![Documentation](https://docs.rs/burn/badge.svg)](https://docs.rs/burn)
[![Rust Version](https://img.shields.io/badge/Rust-1.65.0-blue)](https://releases.rs/docs/released/1.65.0)
[![license](https://shields.io/badge/license-MIT%2FApache--2.0-blue)](https://github.com/burn-rs/burn/blob/master/LICENSE)

> This library aims to be a complete deep learning framework with extreme flexibility written in Rust.
> The goal would be to satisfy researchers as well as practitioners making it easier to experiment, train and deploy your models.

<div align="left">

__Sections__

* [Features](#features)
* [Get Started](#get-started)
    * [Examples](#examples)
    * [Components](#components)
        * [Backend](#backend)
        * [Tensor](#tensor)
        * [Module](#module)
        * [Config](#config)
        * [Learner](#learner)
* [License](#license)

## Features

 * Flexible and intuitive custom neural network [module](#module) 🔥
 * [Training](#learner) with full support for `metric`, `logging` and `checkpointing` 📈
 * [Tensor](#tensor) crate with backends as pluging 🔧
   * [Tch](https://github.com/burn-rs/burn/tree/main/burn-tch) backend with CPU/GPU support 🚀
   * [NdArray](https://github.com/burn-rs/burn/tree/main/burn-ndarray) backend with fast compile time 👌
   * [Autodiff](https://github.com/burn-rs/burn/tree/main/burn-autodiff) backend making any backend differentiable 🌟
 * [Dataset](https://github.com/burn-rs/burn/tree/main/burn-dataset) crate with multiple utilities and sources 📚

## Get Started

The best way to get started with `burn` is to clone the repo and play with the [examples](#examples).
This may also be a good idea to take a look the main [components](#components) of `burn` to get a quick overview of the fundamental building blocks.

### Examples

* [MNIST](https://github.com/burn-rs/burn/tree/main/examples/mnist) train a model on CPU/GPU using different backends.
* [Text Classification](https://github.com/burn-rs/burn/tree/main/examples/text-classification) train a transformer encoder from scratch on GPU.

### Components

Knowing the main components will be of great help when starting playing with `burn`.

#### Backend

Almost everything is based on the `Backend` trait, which allows to run tensor operations with different implementations without having to change your code.
A backend does not necessary have autodiff capabilities, the `ADBackend` trait is there to specify when autodiff is required.

#### Tensor

The `Tensor` struct is at the core of the `burn` framework.
It takes two generic parameters, the `Backend` and the number of dimensions `D`,

Backpropagation is also supported on any backend by making them auto differentiable using a simple decorator.

```rust
use burn::tensor::backend::{ADBackend, Backend};
use burn::tensor::{Distribution, Tensor};
use burn_autodiff::ADBackendDecorator;
use burn_ndarray::NdArrayBackend;
use burn_tch::TchBackend;

fn simple_function<B: Backend>() -> Tensor<B, 2> {
    let x = Tensor::<B, 2>::random([3, 3], Distribution::Standard);
    let y = Tensor::<B, 2>::random([3, 3], Distribution::Standard);

    x.matmul(&y)
}

fn simple_function_grads<B: ADBackend>() -> B::Gradients {
    let z = simple_function::<B>();

    z.backward()
}

fn main() {
    let _z = simple_function::<NdArrayBackend<f32>>(); // Compiles
    let _z = simple_function::<TchBackend<f32>>(); // Compiles

    let _grads = simple_function_grads::<NdArrayBackend<f32>>(); // Doesn't compile
    let _grads = simple_function_grads::<TchBackend<f32>>(); // Doesn't compile

    type ADNdArrayBackend = ADBackendDecorator<NdArrayBackend<f32>>;
    type ADTchBackend = ADBackendDecorator<TchBackend<f32>>;

    let _grads = simple_function_grads::<ADNdArrayBackend>(); // Compiles
    let _grads = simple_function_grads::<ADTchBackend>(); // Compiles
}
```

#### Module

The `Module` derive let your create your own neural network modules similar to PyTorch.

```rust
use burn::nn;
use burn::module::{Param, Module};
use burn::tensor::backend::Backend;

#[derive(Module, Debug)]
struct MyModule<B: Backend> {
  my_param: Param<nn::Linear<B>>,
  repeat: usize,
}
```

Note that only the fields wrapped inside `Param` are updated during training, and the other ones should implement `Clone`.

#### Config

The `Config` derive lets you define serializable and deserializable configurations or hyper-parameters for your [modules](#module) or any components.

```rust
use burn::config::Config;

#[derive(Config)]
struct MyConfig {
    #[config(default = 1.0e-6)]
    pub epsilon: usize,
    pub dim: usize,
}
```
The derive also adds useful methods to your config.

```rust
fn main() {
    let config = MyConfig::new(100);
    println!("{}", config.epsilon); // 1.0.e-6
    println!("{}", config.dim); // 100
    let config =  MyConfig::new(100).with_epsilon(1.0e-8);
    println!("{}", config.epsilon); // 1.0.e-8
}
```

#### Learner

The `Learner` is the main `struct` that let you train a neural network with support for `logging`, `metric`, `checkpointing` and more.
In order to create a learner, you must use the `LearnerBuilder`.

```rust
use burn::train::LearnerBuilder;
use burn::train::metric::{AccuracyMetric, LossMetric};

fn main() {
    let dataloader_train = ...;
    let dataloader_valid = ...;

    let model = ...;
    let optim = ...;

    let learner = LearnerBuilder::new("/tmp/artifact_dir")
        .metric_train_plot(AccuracyMetric::new())
        .metric_valid_plot(AccuracyMetric::new())
        .metric_train(LossMetric::new())
        .metric_valid(LossMetric::new())
        .with_file_checkpointer::<f32>(2)
        .num_epochs(10)
        .build(model, optim);

    let _model_trained = learner.fit(dataloader_train, dataloader_valid);
}
```

See this [example](https://github.com/burn-rs/burn/tree/main/examples/mnist) for a real usage.

## License

Burn is distributed under the terms of both the MIT license and the Apache License (Version 2.0).
See [LICENSE-APACHE](./LICENSE-APACHE) and [LICENSE-MIT](./LICENSE-MIT) for details.
Opening a pull request is assumed to signal agreement with these licensing terms.
Update projects (#29) 2022-09-05 02:22:56 +08:00			`<div align="center">`
Change links in README to work on crates.io (#114) 2022-11-29 09:08:36 +08:00			`<img src="https://raw.githubusercontent.com/burn-rs/burn/main/assets/logo-burn-full.png" width="200px"/>`
doc: update README 2022-07-28 04:15:48 +08:00
Update projects (#29) 2022-09-05 02:22:56 +08:00			`[![Current Crates.io Version](https://img.shields.io/crates/v/burn.svg)](https://crates.io/crates/burn)`
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`[![Test Status](https://github.com/burn-rs/burn/actions/workflows/test.yml/badge.svg)](https://github.com/burn-rs/burn/actions/workflows/test.yml)`
Fix README (#30) 2022-09-05 09:29:34 +08:00			`[![Documentation](https://docs.rs/burn/badge.svg)](https://docs.rs/burn)`
refactor: sub ops (#66) 2022-11-05 22:00:52 +08:00			`[![Rust Version](https://img.shields.io/badge/Rust-1.65.0-blue)](https://releases.rs/docs/released/1.65.0)`
Update projects (#29) 2022-09-05 02:22:56 +08:00			`[![license](https://shields.io/badge/license-MIT%2FApache--2.0-blue)](https://github.com/burn-rs/burn/blob/master/LICENSE)`
doc: update README 2022-07-28 04:15:48 +08:00
Fix `cargo run --example mnist` (#90) 2022-11-10 09:45:58 +08:00			`> This library aims to be a complete deep learning framework with extreme flexibility written in Rust.`
Doc/burn (#54) 2022-10-05 08:30:03 +08:00			`> The goal would be to satisfy researchers as well as practitioners making it easier to experiment, train and deploy your models.`

Update projects (#29) 2022-09-05 02:22:56 +08:00			`<div align="left">`

Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`__Sections__`

			`* [Features](#features)`
			`* [Get Started](#get-started)`
			`* [Examples](#examples)`
			`* [Components](#components)`
			`* [Backend](#backend)`
			`* [Tensor](#tensor)`
			`* [Module](#module)`
			`* [Config](#config)`
			`* [Learner](#learner)`
			`* [License](#license)`

Doc/burn (#54) 2022-10-05 08:30:03 +08:00			`## Features`

doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`* Flexible and intuitive custom neural network [module](#module) 🔥`
			* [Training](#learner) with full support for `metric`, `logging` and `checkpointing` 📈
			`* [Tensor](#tensor) crate with backends as pluging 🔧`
			`* [Tch](https://github.com/burn-rs/burn/tree/main/burn-tch) backend with CPU/GPU support 🚀`
			`* [NdArray](https://github.com/burn-rs/burn/tree/main/burn-ndarray) backend with fast compile time 👌`
			`* [Autodiff](https://github.com/burn-rs/burn/tree/main/burn-autodiff) backend making any backend differentiable 🌟`
			`* [Dataset](https://github.com/burn-rs/burn/tree/main/burn-dataset) crate with multiple utilities and sources 📚`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00
			`## Get Started`

doc: update readme (#112) 2022-11-21 08:41:55 +08:00			The best way to get started with `burn` is to clone the repo and play with the [examples](#examples).
			This may also be a good idea to take a look the main [components](#components) of `burn` to get a quick overview of the fundamental building blocks.
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00
			`### Examples`
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`* [MNIST](https://github.com/burn-rs/burn/tree/main/examples/mnist) train a model on CPU/GPU using different backends.`
Example/text classification (#123) 2022-12-03 06:42:49 +08:00			`* [Text Classification](https://github.com/burn-rs/burn/tree/main/examples/text-classification) train a transformer encoder from scratch on GPU.`
Add cuda gpu example + doc (#91) 2022-11-10 10:32:51 +08:00
Doc/burn (#54) 2022-10-05 08:30:03 +08:00			`### Components`

			Knowing the main components will be of great help when starting playing with `burn`.

Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`#### Backend`
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
			Almost everything is based on the `Backend` trait, which allows to run tensor operations with different implementations without having to change your code.
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			A backend does not necessary have autodiff capabilities, the `ADBackend` trait is there to specify when autodiff is required.
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
			`#### Tensor`

			The `Tensor` struct is at the core of the `burn` framework.
			It takes two generic parameters, the `Backend` and the number of dimensions `D`,

doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`Backpropagation is also supported on any backend by making them auto differentiable using a simple decorator.`

Doc/burn (#54) 2022-10-05 08:30:03 +08:00			```rust
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`use burn::tensor::backend::{ADBackend, Backend};`
			`use burn::tensor::{Distribution, Tensor};`
			`use burn_autodiff::ADBackendDecorator;`
			`use burn_ndarray::NdArrayBackend;`
			`use burn_tch::TchBackend;`

			`fn simple_function<B: Backend>() -> Tensor<B, 2> {`
			`let x = Tensor::<B, 2>::random([3, 3], Distribution::Standard);`
			`let y = Tensor::<B, 2>::random([3, 3], Distribution::Standard);`

			`x.matmul(&y)`
			`}`

			`fn simple_function_grads<B: ADBackend>() -> B::Gradients {`
			`let z = simple_function::<B>();`
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`z.backward()`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`}`
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`fn main() {`
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`let _z = simple_function::<NdArrayBackend<f32>>(); // Compiles`
			`let _z = simple_function::<TchBackend<f32>>(); // Compiles`

			`let _grads = simple_function_grads::<NdArrayBackend<f32>>(); // Doesn't compile`
			`let _grads = simple_function_grads::<TchBackend<f32>>(); // Doesn't compile`

			`type ADNdArrayBackend = ADBackendDecorator<NdArrayBackend<f32>>;`
			`type ADTchBackend = ADBackendDecorator<TchBackend<f32>>;`

			`let _grads = simple_function_grads::<ADNdArrayBackend>(); // Compiles`
			`let _grads = simple_function_grads::<ADTchBackend>(); // Compiles`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`}`
			```
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
			`#### Module`

doc: update readme (#112) 2022-11-21 08:41:55 +08:00			The `Module` derive let your create your own neural network modules similar to PyTorch.
Doc/burn (#54) 2022-10-05 08:30:03 +08:00
			```rust
			`use burn::nn;`
			`use burn::module::{Param, Module};`
			`use burn::tensor::backend::Backend;`

			`#[derive(Module, Debug)]`
			`struct MyModule<B: Backend> {`
			`my_param: Param<nn::Linear<B>>,`
			`repeat: usize,`
			`}`
			```

			Note that only the fields wrapped inside `Param` are updated during training, and the other ones should implement `Clone`.

Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`#### Config`

Fix minor typos in README and optim/adam.rs (#79) * fix typos: "checkpointing", "epsilone" * fix typo: "epsilone" 2022-11-07 06:06:24 +08:00			The `Config` derive lets you define serializable and deserializable configurations or hyper-parameters for your [modules](#module) or any components.
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00
			```rust
			`use burn::config::Config;`

			`#[derive(Config)]`
			`struct MyConfig {`
			`#[config(default = 1.0e-6)]`
Fix minor typos in README and optim/adam.rs (#79) * fix typos: "checkpointing", "epsilone" * fix typo: "epsilone" 2022-11-07 06:06:24 +08:00			`pub epsilon: usize,`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`pub dim: usize,`
			`}`
			```
Fix minor typos in README and optim/adam.rs (#79) * fix typos: "checkpointing", "epsilone" * fix typo: "epsilone" 2022-11-07 06:06:24 +08:00			`The derive also adds useful methods to your config.`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00
			```rust
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`fn main() {`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`let config = MyConfig::new(100);`
Fix minor typos in README and optim/adam.rs (#79) * fix typos: "checkpointing", "epsilone" * fix typo: "epsilone" 2022-11-07 06:06:24 +08:00			`println!("{}", config.epsilon); // 1.0.e-6`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`println!("{}", config.dim); // 100`
Fix minor typos in README and optim/adam.rs (#79) * fix typos: "checkpointing", "epsilone" * fix typo: "epsilone" 2022-11-07 06:06:24 +08:00			`let config = MyConfig::new(100).with_epsilon(1.0e-8);`
			`println!("{}", config.epsilon); // 1.0.e-8`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			`}`
			```

			`#### Learner`

			The `Learner` is the main `struct` that let you train a neural network with support for `logging`, `metric`, `checkpointing` and more.
			In order to create a learner, you must use the `LearnerBuilder`.

			```rust
			`use burn::train::LearnerBuilder;`
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`use burn::train::metric::{AccuracyMetric, LossMetric};`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00
doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`fn main() {`
			`let dataloader_train = ...;`
			`let dataloader_valid = ...;`

			`let model = ...;`
			`let optim = ...;`

			`let learner = LearnerBuilder::new("/tmp/artifact_dir")`
			`.metric_train_plot(AccuracyMetric::new())`
			`.metric_valid_plot(AccuracyMetric::new())`
			`.metric_train(LossMetric::new())`
			`.metric_valid(LossMetric::new())`
			`.with_file_checkpointer::<f32>(2)`
			`.num_epochs(10)`
			`.build(model, optim);`

			`let _model_trained = learner.fit(dataloader_train, dataloader_valid);`
			`}`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00			```

doc: update readme (#112) 2022-11-21 08:41:55 +08:00			`See this [example](https://github.com/burn-rs/burn/tree/main/examples/mnist) for a real usage.`
Doc/readme (#55) * doc: readme * update * update repository 2022-10-07 05:44:04 +08:00
			`## License`

			`Burn is distributed under the terms of both the MIT license and the Apache License (Version 2.0).`
			`See [LICENSE-APACHE](./LICENSE-APACHE) and [LICENSE-MIT](./LICENSE-MIT) for details.`
			`Opening a pull request is assumed to signal agreement with these licensing terms.`