transformers/examples/README.md

# Examples

Version 2.9 of 🤗 Transformers introduced a new [`Trainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer.py) class for PyTorch, and its equivalent [`TFTrainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer_tf.py) for TF 2.
Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.

Here is the list of all our examples:
- **grouped by task** (all official examples work for multiple models)
- with information on whether they are **built on top of `Trainer`/`TFTrainer`** (if not, they still work, they might
  just lack some features),
- whether or not they leverage the [🤗 Datasets](https://github.com/huggingface/datasets) library.
- links to **Colab notebooks** to walk through the scripts and run them easily,
- links to **Cloud deployments** to be able to deploy large-scale trainings in the Cloud with little to no setup.


## Important note

**Important**

To make sure you can successfully run the latest versions of the example scripts, you have to **install the library from source** and install some example-specific requirements.
Execute the following steps in a new virtual environment:

```bash
git clone https://github.com/huggingface/transformers
cd transformers
pip install .
pip install -r ./examples/requirements.txt
```

Alternatively, you can run the version of the examples as they were for your current version of Transformers via (for instance with v3.4.0):
```bash
git checkout tags/v3.4.0
```

## The Big Table of Tasks

| Task | Example datasets | Trainer support | TFTrainer support | 🤗 Datasets | Colab
|---|---|:---:|:---:|:---:|:---:|
| [**`language-modeling`**](https://github.com/huggingface/transformers/tree/master/examples/language-modeling)       | Raw text        | ✅ | -  | ✅ | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb)
| [**`text-classification`**](https://github.com/huggingface/transformers/tree/master/examples/text-classification)   | GLUE, XNLI      | ✅ | ✅ | ✅ | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://github.com/huggingface/notebooks/blob/master/examples/text_classification.ipynb)
| [**`token-classification`**](https://github.com/huggingface/transformers/tree/master/examples/token-classification) | CoNLL NER       | ✅ | ✅ | ✅ | -
| [**`multiple-choice`**](https://github.com/huggingface/transformers/tree/master/examples/multiple-choice)           | SWAG, RACE, ARC | ✅ | ✅ | - | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ViktorAlm/notebooks/blob/master/MPC_GPU_Demo_for_TF_and_PT.ipynb)
| [**`question-answering`**](https://github.com/huggingface/transformers/tree/master/examples/question-answering)     | SQuAD           | ✅ | ✅ | - | -
| [**`text-generation`**](https://github.com/huggingface/transformers/tree/master/examples/text-generation)           | -               | n/a | n/a | - | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/02_how_to_generate.ipynb)
| [**`distillation`**](https://github.com/huggingface/transformers/tree/master/examples/distillation)                 | All             | - | -  | - | -
| [**`summarization`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                     | CNN/Daily Mail  | ✅  | - | - | -
| [**`translation`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                       | WMT             | ✅  | - | - | -
| [**`bertology`**](https://github.com/huggingface/transformers/tree/master/examples/bertology)                       | -               | - | - | - | -
| [**`adversarial`**](https://github.com/huggingface/transformers/tree/master/examples/adversarial)                   | HANS            | ✅ | - | - | -


<br>

## One-click Deploy to Cloud (wip)

**Coming soon!**

## Running on TPUs

When using Tensorflow, TPUs are supported out of the box as a `tf.distribute.Strategy`.

When using PyTorch, we support TPUs thanks to `pytorch/xla`. For more context and information on how to setup your TPU environment refer to Google's documentation and to the
very detailed [pytorch/xla README](https://github.com/pytorch/xla/blob/master/README.md).

In this repo, we provide a very simple launcher script named [xla_spawn.py](https://github.com/huggingface/transformers/tree/master/examples/xla_spawn.py) that lets you run our example scripts on multiple TPU cores without any boilerplate.
Just pass a `--num_cores` flag to this script, then your regular training script with its arguments (this is similar to the `torch.distributed.launch` helper for torch.distributed). 
Note that this approach does not work for examples that use `pytorch-lightning`.

For example for `run_glue`:

```bash
python examples/xla_spawn.py --num_cores 8 \
	examples/text-classification/run_glue.py \
	--model_name_or_path bert-base-cased \
	--task_name mnli \
	--data_dir ./data/glue_data/MNLI \
	--output_dir ./models/tpu \
	--overwrite_output_dir \
	--do_train \
	--do_eval \
	--num_train_epochs 1 \
	--save_steps 20000
```

Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.

## Logging & Experiment tracking

You can easily log and monitor your runs code. The following are currently supported:

* [TensorBoard](https://www.tensorflow.org/tensorboard)
* [Weights & Biases](https://docs.wandb.com/library/integrations/huggingface)
* [Comet ML](https://www.comet.ml/docs/python-sdk/huggingface/)

### Weights & Biases

To use Weights & Biases, install the wandb package with:

```bash
pip install wandb
```

Then log in the command line:

```bash
wandb login
```

If you are in Jupyter or Colab, you should login with:

```python
import wandb
wandb.login()
```

Whenever you use `Trainer` or `TFTrainer` classes, your losses, evaluation metrics, model topology and gradients (for `Trainer` only) will automatically be logged.

When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through `WandbLogger`. Refer to related [documentation & examples](https://docs.wandb.com/library/integrations/lightning).

### Comet.ml

To use `comet_ml`, install the Python package with:

```bash
pip install comet_ml
```

or if in a Conda environment:

```bash
conda install -c comet_ml -c anaconda -c conda-forge comet_ml
```
Fix examples titles and optimization doc page (#5408) 2020-07-01 20:11:25 +08:00			`# Examples`
Better examples 2019-09-07 00:00:12 +08:00
Move installation instructions to the top (#8106) 2020-10-28 05:32:20 +08:00			Version 2.9 of 🤗 Transformers introduced a new [`Trainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer.py) class for PyTorch, and its equivalent [`TFTrainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer_tf.py) for TF 2.
Rework TF trainer (#6038) * Fully rework training/prediction loops * fix method name * Fix variable name * Fix property name * Fix scope * Fix method name * Fix tuple index * Fix tuple index * Fix indentation * Fix variable name * fix eval before log * Add drop remainder for test dataset * Fix step number + fix logging datetime * fix eval loss value * use global step instead of step + fix logging at step 0 * Fix logging datetime * Fix global_step usage * Fix breaking loop + logging datetime * Fix step in prediction loop * Fix step breaking * Fix train/test loops * Force TF at least 2.2 for the trainer * Use assert_cardinality to facilitate the dataset size computation * Log steps per epoch * Make tfds compliant with TPU * Make tfds compliant with TPU * Use TF dataset enumerate instead of the Python one * revert previous commit * Fix data_dir * Apply style * rebase on master * Address Sylvain's comments * Address Sylvain's and Lysandre comments * Trigger CI * Remove unused import 2020-07-30 02:32:01 +08:00			`Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.`
Examples readme.md (#4215) * README * Update README.md 2020-05-08 03:00:06 +08:00
			`Here is the list of all our examples:`
			`- grouped by task (all official examples work for multiple models)`
Finalize lm examples (#8188) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-10-31 02:20:18 +08:00			- with information on whether they are built on top of `Trainer`/`TFTrainer` (if not, they still work, they might
			`just lack some features),`
			`- whether or not they leverage the [🤗 Datasets](https://github.com/huggingface/datasets) library.`
Examples readme.md (#4215) * README * Update README.md 2020-05-08 03:00:06 +08:00			`- links to Colab notebooks to walk through the scripts and run them easily,`
			`- links to Cloud deployments to be able to deploy large-scale trainings in the Cloud with little to no setup.`


			`## Important note`
Better examples 2019-09-07 00:00:12 +08:00
Support for torch-lightning in NER examples (#2890) * initial pytorch lightning commit * tested multigpu * Fix learning rate schedule * black formatting * fix flake8 * isort * isort * . Co-authored-by: Check your git settings! <chris@chris-laptop> 2020-02-21 00:50:05 +08:00			`Important`
Move installation instructions to the top (#8106) 2020-10-28 05:32:20 +08:00
			`To make sure you can successfully run the latest versions of the example scripts, you have to install the library from source and install some example-specific requirements.`
fix #1450 - add doc 2019-12-05 18:26:55 +08:00			`Execute the following steps in a new virtual environment:`
update the documentation 2019-11-21 01:13:38 +08:00
			```bash
Uniformize #1952 2019-11-28 00:05:18 +08:00			`git clone https://github.com/huggingface/transformers`
update the documentation 2019-11-21 01:13:38 +08:00			`cd transformers`
Remove [--editable] in install instructions. Use -e only in docs targeted at contributors. If a user copy-pastes command line with [--editable], they will hit an error. If they don't know the --editable option, we're giving them a choice to make before they can move forwards, but this isn't a choice they need to make right now. 2019-12-24 15:46:08 +08:00			`pip install .`
fix #1450 - add doc 2019-12-05 18:26:55 +08:00			`pip install -r ./examples/requirements.txt`
update the documentation 2019-11-21 01:13:38 +08:00			```

Move installation instructions to the top (#8106) 2020-10-28 05:32:20 +08:00			`Alternatively, you can run the version of the examples as they were for your current version of Transformers via (for instance with v3.4.0):`
			```bash
			`git checkout tags/v3.4.0`
			```

			`## The Big Table of Tasks`

Finalize lm examples (#8188) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-10-31 02:20:18 +08:00			`\| Task \| Example datasets \| Trainer support \| TFTrainer support \| 🤗 Datasets \| Colab`
			`\|---\|---\|:---:\|:---:\|:---:\|:---:\|`
			\| [`language-modeling`](https://github.com/huggingface/transformers/tree/master/examples/language-modeling) \| Raw text \| ✅ \| - \| ✅ \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb)
			\| [`text-classification`](https://github.com/huggingface/transformers/tree/master/examples/text-classification) \| GLUE, XNLI \| ✅ \| ✅ \| ✅ \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://github.com/huggingface/notebooks/blob/master/examples/text_classification.ipynb)
Add new token classification example (#8340) * Add new token classification example * Remove txt file * Add test * With actual testing done * Less warmup is better * Update examples/token-classification/run_ner_new.py Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Fix test * Make Lysandre happy * Last touches and rename * Rename in tests * Address review comments * More run_ner -> run_ner_old Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-11-10 00:39:55 +08:00			\| [`token-classification`](https://github.com/huggingface/transformers/tree/master/examples/token-classification) \| CoNLL NER \| ✅ \| ✅ \| ✅ \| -
Finalize lm examples (#8188) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-10-31 02:20:18 +08:00			\| [`multiple-choice`](https://github.com/huggingface/transformers/tree/master/examples/multiple-choice) \| SWAG, RACE, ARC \| ✅ \| ✅ \| - \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ViktorAlm/notebooks/blob/master/MPC_GPU_Demo_for_TF_and_PT.ipynb)
			\| [`question-answering`](https://github.com/huggingface/transformers/tree/master/examples/question-answering) \| SQuAD \| ✅ \| ✅ \| - \| -
			\| [`text-generation`](https://github.com/huggingface/transformers/tree/master/examples/text-generation) \| - \| n/a \| n/a \| - \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/02_how_to_generate.ipynb)
			\| [`distillation`](https://github.com/huggingface/transformers/tree/master/examples/distillation) \| All \| - \| - \| - \| -
			\| [`summarization`](https://github.com/huggingface/transformers/tree/master/examples/seq2seq) \| CNN/Daily Mail \| ✅ \| - \| - \| -
			\| [`translation`](https://github.com/huggingface/transformers/tree/master/examples/seq2seq) \| WMT \| ✅ \| - \| - \| -
			\| [`bertology`](https://github.com/huggingface/transformers/tree/master/examples/bertology) \| - \| - \| - \| - \| -
			\| [`adversarial`](https://github.com/huggingface/transformers/tree/master/examples/adversarial) \| HANS \| ✅ \| - \| - \| -
Move installation instructions to the top (#8106) 2020-10-28 05:32:20 +08:00

			`<br>`

[examples] Streamline doc 2020-05-15 08:34:31 +08:00			`## One-click Deploy to Cloud (wip)`

[doc] rm Azure buttons as not implemented yet 2020-10-01 05:31:08 +08:00			`Coming soon!`
[examples] Streamline doc 2020-05-15 08:34:31 +08:00
Examples readme.md (#4215) * README * Update README.md 2020-05-08 03:00:06 +08:00			`## Running on TPUs`
Table of contents 2019-09-07 00:08:36 +08:00
[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None 2020-05-09 02:10:05 +08:00			When using Tensorflow, TPUs are supported out of the box as a `tf.distribute.Strategy`.

			When using PyTorch, we support TPUs thanks to `pytorch/xla`. For more context and information on how to setup your TPU environment refer to Google's documentation and to the
			`very detailed [pytorch/xla README](https://github.com/pytorch/xla/blob/master/README.md).`

per_device instead of per_gpu/error thrown when argument unknown (#4618) * per_device instead of per_gpu/error thrown when argument unknown * [docs] Restore examples.md symlink * Correct absolute links so that symlink to the doc works correctly * Update src/transformers/hf_argparser.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Warning + reorder * Docs * Style * not for squad Co-authored-by: Julien Chaumond <chaumond@gmail.com> 2020-05-27 23:36:55 +08:00			`In this repo, we provide a very simple launcher script named [xla_spawn.py](https://github.com/huggingface/transformers/tree/master/examples/xla_spawn.py) that lets you run our example scripts on multiple TPU cores without any boilerplate.`
examples/docs: caveat that PL examples don't work on TPU (#8309) 2020-11-09 21:55:22 +08:00			Just pass a `--num_cores` flag to this script, then your regular training script with its arguments (this is similar to the `torch.distributed.launch` helper for torch.distributed).
			Note that this approach does not work for examples that use `pytorch-lightning`.
[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None 2020-05-09 02:10:05 +08:00
			For example for `run_glue`:

			```bash
			`python examples/xla_spawn.py --num_cores 8 \`
Corrected typo in readme (#8320) 2020-11-05 20:48:36 +08:00			`examples/text-classification/run_glue.py \`
[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None 2020-05-09 02:10:05 +08:00			`--model_name_or_path bert-base-cased \`
			`--task_name mnli \`
			`--data_dir ./data/glue_data/MNLI \`
			`--output_dir ./models/tpu \`
			`--overwrite_output_dir \`
			`--do_train \`
			`--do_eval \`
			`--num_train_epochs 1 \`
			`--save_steps 20000`
			```

			`Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.`
docs(wandb): explain how to use W&B integration (#5607) * docs(wandb): explain how to use W&B integration fix #5262 * Also mention TensorBoard Co-authored-by: Julien Chaumond <chaumond@gmail.com> 2020-07-14 17:12:33 +08:00
			`## Logging & Experiment tracking`

Adds comet_ml to the list of auto-experiment loggers (#6176) * Support for Comet.ml * Need to import comet first * Log this model, not the one in the backprop step * Log args as hyperparameters; use framework to allow fine control * Log hyperparameters with context * Apply black formatting * isort fix integrations * isort fix __init__ * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_tf.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Address review comments * Style + Quality, remove Tensorboard import test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> 2020-08-06 23:31:30 +08:00			`You can easily log and monitor your runs code. The following are currently supported:`

			`* [TensorBoard](https://www.tensorflow.org/tensorboard)`
			`* [Weights & Biases](https://docs.wandb.com/library/integrations/huggingface)`
			`* [Comet ML](https://www.comet.ml/docs/python-sdk/huggingface/)`

			`### Weights & Biases`
docs(wandb): explain how to use W&B integration (#5607) * docs(wandb): explain how to use W&B integration fix #5262 * Also mention TensorBoard Co-authored-by: Julien Chaumond <chaumond@gmail.com> 2020-07-14 17:12:33 +08:00
			`To use Weights & Biases, install the wandb package with:`

			```bash
			`pip install wandb`
			```

			`Then log in the command line:`

			```bash
			`wandb login`
			```

			`If you are in Jupyter or Colab, you should login with:`

			```python
			`import wandb`
			`wandb.login()`
			```

			Whenever you use `Trainer` or `TFTrainer` classes, your losses, evaluation metrics, model topology and gradients (for `Trainer` only) will automatically be logged.

correct pl link in readme (#6364) 2020-08-10 15:08:46 +08:00			When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through `WandbLogger`. Refer to related [documentation & examples](https://docs.wandb.com/library/integrations/lightning).
Adds comet_ml to the list of auto-experiment loggers (#6176) * Support for Comet.ml * Need to import comet first * Log this model, not the one in the backprop step * Log args as hyperparameters; use framework to allow fine control * Log hyperparameters with context * Apply black formatting * isort fix integrations * isort fix __init__ * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_tf.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Address review comments * Style + Quality, remove Tensorboard import test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> 2020-08-06 23:31:30 +08:00
			`### Comet.ml`

			To use `comet_ml`, install the Python package with:

			```bash
			`pip install comet_ml`
			```

			`or if in a Conda environment:`

			```bash
			`conda install -c comet_ml -c anaconda -c conda-forge comet_ml`
			```