transformers/examples/README.md

# Examples

Version 2.9 of 🤗 Transformers introduces a new [`Trainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer.py) class for PyTorch, and its equivalent [`TFTrainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer_tf.py) for TF 2.
Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.

Here is the list of all our examples:
- **grouped by task** (all official examples work for multiple models)
- with information on whether they are **built on top of `Trainer`/`TFTrainer`** (if not, they still work, they might just lack some features),
- whether they also include examples for **`pytorch-lightning`**, which is a great fully-featured, general-purpose training library for PyTorch,
- links to **Colab notebooks** to walk through the scripts and run them easily,
- links to **Cloud deployments** to be able to deploy large-scale trainings in the Cloud with little to no setup.

This is still a work-in-progress – in particular documentation is still sparse – so please **contribute improvements/pull requests.**


## The Big Table of Tasks

| Task | Example datasets | Trainer support | TFTrainer support | pytorch-lightning | Colab
|---|---|:---:|:---:|:---:|:---:|
| [**`language-modeling`**](https://github.com/huggingface/transformers/tree/master/examples/language-modeling)       | Raw text        | ✅ | -  | -  | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb)
| [**`text-classification`**](https://github.com/huggingface/transformers/tree/master/examples/text-classification)   | GLUE, XNLI      | ✅ | ✅ | ✅ | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/trainer/01_text_classification.ipynb)
| [**`token-classification`**](https://github.com/huggingface/transformers/tree/master/examples/token-classification) | CoNLL NER       | ✅ | ✅ | ✅ | -
| [**`multiple-choice`**](https://github.com/huggingface/transformers/tree/master/examples/multiple-choice)           | SWAG, RACE, ARC | ✅ | ✅ | -  | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ViktorAlm/notebooks/blob/master/MPC_GPU_Demo_for_TF_and_PT.ipynb)
| [**`question-answering`**](https://github.com/huggingface/transformers/tree/master/examples/question-answering)     | SQuAD           | ✅ | ✅ | -  | -
| [**`text-generation`**](https://github.com/huggingface/transformers/tree/master/examples/text-generation)           | -               | n/a | n/a | n/a | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/02_how_to_generate.ipynb)
| [**`distillation`**](https://github.com/huggingface/transformers/tree/master/examples/distillation)                 | All             | - | -  | -  | -
| [**`summarization`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                     | CNN/Daily Mail  | ✅  | -  | ✅  | -
| [**`translation`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                       | WMT             | ✅  | -  | ✅  | -
| [**`bertology`**](https://github.com/huggingface/transformers/tree/master/examples/bertology)                       | -               | - | -  | -  | -
| [**`adversarial`**](https://github.com/huggingface/transformers/tree/master/examples/adversarial)                   | HANS            | ✅ | -  | -  | -


<br>

## Important note

**Important**
To make sure you can successfully run the latest versions of the example scripts, you have to install the library from source and install some example-specific requirements.
Execute the following steps in a new virtual environment:

```bash
git clone https://github.com/huggingface/transformers
cd transformers
pip install .
pip install -r ./examples/requirements.txt
```

## One-click Deploy to Cloud (wip)

**Coming soon!**

## Running on TPUs

When using Tensorflow, TPUs are supported out of the box as a `tf.distribute.Strategy`.

When using PyTorch, we support TPUs thanks to `pytorch/xla`. For more context and information on how to setup your TPU environment refer to Google's documentation and to the
very detailed [pytorch/xla README](https://github.com/pytorch/xla/blob/master/README.md).

In this repo, we provide a very simple launcher script named [xla_spawn.py](https://github.com/huggingface/transformers/tree/master/examples/xla_spawn.py) that lets you run our example scripts on multiple TPU cores without any boilerplate.
Just pass a `--num_cores` flag to this script, then your regular training script with its arguments (this is similar to the `torch.distributed.launch` helper for torch.distributed).

For example for `run_glue`:

```bash
python examples/xla_spawn.py --num_cores 8 \
	examples/text-classification/run_glue.py
	--model_name_or_path bert-base-cased \
	--task_name mnli \
	--data_dir ./data/glue_data/MNLI \
	--output_dir ./models/tpu \
	--overwrite_output_dir \
	--do_train \
	--do_eval \
	--num_train_epochs 1 \
	--save_steps 20000
```

Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.

## Logging & Experiment tracking

You can easily log and monitor your runs code. The following are currently supported:

* [TensorBoard](https://www.tensorflow.org/tensorboard)
* [Weights & Biases](https://docs.wandb.com/library/integrations/huggingface)
* [Comet ML](https://www.comet.ml/docs/python-sdk/huggingface/)

### Weights & Biases

To use Weights & Biases, install the wandb package with:

```bash
pip install wandb
```

Then log in the command line:

```bash
wandb login
```

If you are in Jupyter or Colab, you should login with:

```python
import wandb
wandb.login()
```

Whenever you use `Trainer` or `TFTrainer` classes, your losses, evaluation metrics, model topology and gradients (for `Trainer` only) will automatically be logged.

When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through `WandbLogger`. Refer to related [documentation & examples](https://docs.wandb.com/library/integrations/lightning).

### Comet.ml

To use `comet_ml`, install the Python package with:

```bash
pip install comet_ml
```

or if in a Conda environment:

```bash
conda install -c comet_ml -c anaconda -c conda-forge comet_ml
```
-												Fix examples titles and optimization doc page (#5408)


											
										
										
											2020-07-01 20:11:25 +08:00
+								# Examples
-												Better examples

											
										
										
											2019-09-07 00:00:12 +08:00
-												Add hugs (#5225)


											
										
										
											2020-06-24 19:56:14 +08:00
+								Version 2.9 of 🤗 Transformers introduces a new [`Trainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer.py) class for PyTorch, and its equivalent [`TFTrainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer_tf.py) for TF 2.
-												Rework TF trainer (#6038)

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import
											
										
										
											2020-07-30 02:32:01 +08:00
+								Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
 								Here is the list of all our examples:
 								- **grouped by task** (all official examples work for multiple models)
 								- with information on whether they are **built on top of `Trainer`/`TFTrainer`** (if not, they still work, they might just lack some features),
-												[examples] Add column for pytorch-lightning support
											
										
										
											2020-05-08 03:26:58 +08:00
+								- whether they also include examples for **`pytorch-lightning`**, which is a great fully-featured, general-purpose training library for PyTorch,
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
+								- links to **Colab notebooks** to walk through the scripts and run them easily,
 								- links to **Cloud deployments** to be able to deploy large-scale trainings in the Cloud with little to no setup.
 								This is still a work-in-progress – in particular documentation is still sparse – so please **contribute improvements/pull requests.**
-												Fix examples titles and optimization doc page (#5408)


											
										
										
											2020-07-01 20:11:25 +08:00
+								## The Big Table of Tasks
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
-												[examples] Streamline doc

											
										
										
											2020-05-15 08:34:31 +08:00
+								| Task | Example datasets | Trainer support | TFTrainer support | pytorch-lightning | Colab
 								|---|---|:---:|:---:|:---:|:---:|
-												per_device instead of per_gpu/error thrown when argument unknown (#4618)

* per_device instead of per_gpu/error thrown when argument unknown

* [docs] Restore examples.md symlink

* Correct absolute links so that symlink to the doc works correctly

* Update src/transformers/hf_argparser.py

Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Warning + reorder

* Docs

* Style

* not for squad

Co-authored-by: Julien Chaumond <chaumond@gmail.com>
											
										
										
											2020-05-27 23:36:55 +08:00
+								| [**`language-modeling`**](https://github.com/huggingface/transformers/tree/master/examples/language-modeling)       | Raw text        | ✅ | -  | -  | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb)
 								| [**`text-classification`**](https://github.com/huggingface/transformers/tree/master/examples/text-classification)   | GLUE, XNLI      | ✅ | ✅ | ✅ | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/trainer/01_text_classification.ipynb)
 								| [**`token-classification`**](https://github.com/huggingface/transformers/tree/master/examples/token-classification) | CoNLL NER       | ✅ | ✅ | ✅ | -
 								| [**`multiple-choice`**](https://github.com/huggingface/transformers/tree/master/examples/multiple-choice)           | SWAG, RACE, ARC | ✅ | ✅ | -  | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ViktorAlm/notebooks/blob/master/MPC_GPU_Demo_for_TF_and_PT.ipynb)
-												Update The Big Table of Tasks

Co-Authored-By: Suraj Patil <surajp815@gmail.com>
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

											
										
										
											2020-07-11 00:07:29 +08:00
+								| [**`question-answering`**](https://github.com/huggingface/transformers/tree/master/examples/question-answering)     | SQuAD           | ✅ | ✅ | -  | -
-												[doc] Make it clearer that `text-generation` does not involve training

											
										
										
											2020-06-05 20:59:22 +08:00
+								| [**`text-generation`**](https://github.com/huggingface/transformers/tree/master/examples/text-generation)           | -               | n/a | n/a | n/a | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/02_how_to_generate.ipynb)
-												doc changes (#7385)


											
										
										
											2020-09-25 20:00:36 +08:00
+								| [**`distillation`**](https://github.com/huggingface/transformers/tree/master/examples/distillation)                 | All             | - | -  | -  | -
 								| [**`summarization`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                     | CNN/Daily Mail  | ✅  | -  | ✅  | -
 								| [**`translation`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                       | WMT             | ✅  | -  | ✅  | -
 								| [**`bertology`**](https://github.com/huggingface/transformers/tree/master/examples/bertology)                       | -               | - | -  | -  | -
 								| [**`adversarial`**](https://github.com/huggingface/transformers/tree/master/examples/adversarial)                   | HANS            | ✅ | -  | -  | -
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
-												[examples] Streamline doc

											
										
										
											2020-05-15 08:34:31 +08:00
+								<br>
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
 								## Important note
-												Better examples

											
										
										
											2019-09-07 00:00:12 +08:00
-												Support for torch-lightning in NER examples (#2890)

* initial pytorch lightning commit

* tested multigpu

* Fix learning rate schedule

* black formatting

* fix flake8

* isort

* isort

* .

Co-authored-by: Check your git settings! <chris@chris-laptop>

											
										
										
											2020-02-21 00:50:05 +08:00
+								**Important**
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
+								To make sure you can successfully run the latest versions of the example scripts, you have to install the library from source and install some example-specific requirements.
-												fix #1450 - add doc

											
										
										
											2019-12-05 18:26:55 +08:00
+								Execute the following steps in a new virtual environment:
-												update the documentation

											
										
										
											2019-11-21 01:13:38 +08:00
 								```bash
-												Uniformize #1952

											
										
										
											2019-11-28 00:05:18 +08:00
+								git clone https://github.com/huggingface/transformers
-												update the documentation

											
										
										
											2019-11-21 01:13:38 +08:00
+								cd transformers
-												Remove [--editable] in install instructions.

Use -e only in docs targeted at contributors.

If a user copy-pastes  command line with [--editable], they will hit
an error. If they don't know the --editable option, we're giving them
a choice to make before they can move forwards, but this isn't a choice
they need to make right now.

											
										
										
											2019-12-24 15:46:08 +08:00
+								pip install .
-												fix #1450 - add doc

											
										
										
											2019-12-05 18:26:55 +08:00
+								pip install -r ./examples/requirements.txt
-												update the documentation

											
										
										
											2019-11-21 01:13:38 +08:00
+								```
-												[examples] Streamline doc

											
										
										
											2020-05-15 08:34:31 +08:00
+								## One-click Deploy to Cloud (wip)
-												[doc] rm Azure buttons as not implemented yet
											
										
										
											2020-10-01 05:31:08 +08:00
+								**Coming soon!**
-												[examples] Streamline doc

											
										
										
											2020-05-15 08:34:31 +08:00
-												Examples readme.md (#4215)

* README

* Update README.md
											
										
										
											2020-05-08 03:00:06 +08:00
+								## Running on TPUs
-												Table of contents

											
										
										
											2019-09-07 00:08:36 +08:00
-												[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223)

* [TPU] Doc, fix xla_spawn.py, only preprocess dataset once

* Update examples/README.md

* [xla_spawn] Add `_mp_fn` to other Trainer scripts

* [TPU] Fix: eval dataloader was None
											
										
										
											2020-05-09 02:10:05 +08:00
+								When using Tensorflow, TPUs are supported out of the box as a `tf.distribute.Strategy`.
 								When using PyTorch, we support TPUs thanks to `pytorch/xla`. For more context and information on how to setup your TPU environment refer to Google's documentation and to the
 								very detailed [pytorch/xla README](https://github.com/pytorch/xla/blob/master/README.md).
-												per_device instead of per_gpu/error thrown when argument unknown (#4618)

* per_device instead of per_gpu/error thrown when argument unknown

* [docs] Restore examples.md symlink

* Correct absolute links so that symlink to the doc works correctly

* Update src/transformers/hf_argparser.py

Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Warning + reorder

* Docs

* Style

* not for squad

Co-authored-by: Julien Chaumond <chaumond@gmail.com>
											
										
										
											2020-05-27 23:36:55 +08:00
+								In this repo, we provide a very simple launcher script named [xla_spawn.py](https://github.com/huggingface/transformers/tree/master/examples/xla_spawn.py) that lets you run our example scripts on multiple TPU cores without any boilerplate.
-												[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223)

* [TPU] Doc, fix xla_spawn.py, only preprocess dataset once

* Update examples/README.md

* [xla_spawn] Add `_mp_fn` to other Trainer scripts

* [TPU] Fix: eval dataloader was None
											
										
										
											2020-05-09 02:10:05 +08:00
+								Just pass a `--num_cores` flag to this script, then your regular training script with its arguments (this is similar to the `torch.distributed.launch` helper for torch.distributed).
 								For example for `run_glue`:
 								```bash
 								python examples/xla_spawn.py --num_cores 8 \
 									examples/text-classification/run_glue.py
 									--model_name_or_path bert-base-cased \
 									--task_name mnli \
 									--data_dir ./data/glue_data/MNLI \
 									--output_dir ./models/tpu \
 									--overwrite_output_dir \
 									--do_train \
 									--do_eval \
 									--num_train_epochs 1 \
 									--save_steps 20000
 								```
 								Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.
-												docs(wandb): explain how to use W&B integration (#5607)

* docs(wandb): explain how to use W&B integration

fix #5262

* Also mention TensorBoard

Co-authored-by: Julien Chaumond <chaumond@gmail.com>
											
										
										
											2020-07-14 17:12:33 +08:00
 								## Logging & Experiment tracking
-												Adds comet_ml to the list of auto-experiment loggers (#6176)

* Support for Comet.ml

* Need to import comet first

* Log this model, not the one in the backprop step

* Log args as hyperparameters; use framework to allow fine control

* Log hyperparameters with context

* Apply black formatting

* isort fix integrations

* isort fix __init__

* Update src/transformers/trainer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_tf.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address review comments

* Style + Quality, remove Tensorboard import test

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
											
										
										
											2020-08-06 23:31:30 +08:00
+								You can easily log and monitor your runs code. The following are currently supported:
 								* [TensorBoard](https://www.tensorflow.org/tensorboard)
 								* [Weights & Biases](https://docs.wandb.com/library/integrations/huggingface)
 								* [Comet ML](https://www.comet.ml/docs/python-sdk/huggingface/)
 								### Weights & Biases
-												docs(wandb): explain how to use W&B integration (#5607)

* docs(wandb): explain how to use W&B integration

fix #5262

* Also mention TensorBoard

Co-authored-by: Julien Chaumond <chaumond@gmail.com>
											
										
										
											2020-07-14 17:12:33 +08:00
 								To use Weights & Biases, install the wandb package with:
 								```bash
 								pip install wandb
 								```
 								Then log in the command line:
 								```bash
 								wandb login
 								```
 								If you are in Jupyter or Colab, you should login with:
 								```python
 								import wandb
 								wandb.login()
 								```
 								Whenever you use `Trainer` or `TFTrainer` classes, your losses, evaluation metrics, model topology and gradients (for `Trainer` only) will automatically be logged.
-												correct pl link in readme (#6364)


											
										
										
											2020-08-10 15:08:46 +08:00
+								When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through `WandbLogger`. Refer to related [documentation & examples](https://docs.wandb.com/library/integrations/lightning).
-												Adds comet_ml to the list of auto-experiment loggers (#6176)

* Support for Comet.ml

* Need to import comet first

* Log this model, not the one in the backprop step

* Log args as hyperparameters; use framework to allow fine control

* Log hyperparameters with context

* Apply black formatting

* isort fix integrations

* isort fix __init__

* Update src/transformers/trainer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_tf.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address review comments

* Style + Quality, remove Tensorboard import test

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
											
										
										
											2020-08-06 23:31:30 +08:00
 								### Comet.ml
 								To use `comet_ml`, install the Python package with:
 								```bash
 								pip install comet_ml
 								```
 								or if in a Conda environment:
 								```bash
 								conda install -c comet_ml -c anaconda -c conda-forge comet_ml
 								```