transformers/examples
Sam Shleifer a5638b2b3a
[s2s] dynamic batch size with --max_tokens_per_batch (#7030)
2020-09-17 15:19:34 -04:00
..
adversarial Black 20 release 2020-08-26 17:20:22 +02:00
benchmarking Black 20 release 2020-08-26 17:20:22 +02:00
bert-loses-patience [logging] remove no longer needed verbosity override (#7100) 2020-09-15 04:01:14 -04:00
bertology Black 20 release 2020-08-26 17:20:22 +02:00
contrib Transformer-XL: Remove unused parameters (#7087) 2020-09-17 06:10:34 -04:00
deebert [logging] remove no longer needed verbosity override (#7100) 2020-09-15 04:01:14 -04:00
distillation [logging] remove no longer needed verbosity override (#7100) 2020-09-15 04:01:14 -04:00
language-modeling Add cache_dir to save features TextDataset (#6879) 2020-09-01 11:42:17 -04:00
longform-qa Fix CI with change of name of nlp (#7054) 2020-09-10 14:51:08 -04:00
lxmert Demoing LXMERT with raw images by incorporating the FRCNN model for roi-pooled extraction and bounding-box predction on the GQA answer set. (#6986) 2020-09-14 10:07:04 -04:00
movement-pruning [logging] remove no longer needed verbosity override (#7100) 2020-09-15 04:01:14 -04:00
multiple-choice Black 20 release 2020-08-26 17:20:22 +02:00
question-answering [logging] remove no longer needed verbosity override (#7100) 2020-09-15 04:01:14 -04:00
seq2seq [s2s] dynamic batch size with --max_tokens_per_batch (#7030) 2020-09-17 15:19:34 -04:00
text-classification [logging] remove no longer needed verbosity override (#7100) 2020-09-15 04:01:14 -04:00
text-generation feat: allow prefix for any generative model (#5885) 2020-09-07 03:03:45 -04:00
token-classification Fix the TF Trainer gradient accumulation and the TF NER example (#6713) 2020-08-27 08:45:34 -04:00
README.md correct pl link in readme (#6364) 2020-08-10 03:08:46 -04:00
conftest.py enable easy checkout switch (#5645) 2020-07-31 04:34:46 -04:00
lightning_base.py clearly indicate shuffle=False (#6312) 2020-08-30 19:26:10 +08:00
requirements.txt [examples testing] restore code (#7099) 2020-09-14 08:54:23 -04:00
test_examples.py [examples testing] restore code (#7099) 2020-09-14 08:54:23 -04:00
test_xla_examples.py Add setup for TPU CI to run every hour. (#6219) 2020-08-07 11:17:07 -04:00
xla_spawn.py [TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) 2020-05-08 14:10:05 -04:00

README.md

Examples

Version 2.9 of 🤗 Transformers introduces a new Trainer class for PyTorch, and its equivalent TFTrainer for TF 2. Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.

Here is the list of all our examples:

  • grouped by task (all official examples work for multiple models)
  • with information on whether they are built on top of Trainer/TFTrainer (if not, they still work, they might just lack some features),
  • whether they also include examples for pytorch-lightning, which is a great fully-featured, general-purpose training library for PyTorch,
  • links to Colab notebooks to walk through the scripts and run them easily,
  • links to Cloud deployments to be able to deploy large-scale trainings in the Cloud with little to no setup.

This is still a work-in-progress in particular documentation is still sparse so please contribute improvements/pull requests.

The Big Table of Tasks

Task Example datasets Trainer support TFTrainer support pytorch-lightning Colab
language-modeling Raw text - - Open In Colab
text-classification GLUE, XNLI Open In Colab
token-classification CoNLL NER -
multiple-choice SWAG, RACE, ARC - Open In Colab
question-answering SQuAD - -
text-generation - n/a n/a n/a Open In Colab
distillation All - - - -
summarization CNN/Daily Mail - - -
translation WMT - - -
bertology - - - - -
adversarial HANS - - -

Important note

Important To make sure you can successfully run the latest versions of the example scripts, you have to install the library from source and install some example-specific requirements. Execute the following steps in a new virtual environment:

git clone https://github.com/huggingface/transformers
cd transformers
pip install .
pip install -r ./examples/requirements.txt

One-click Deploy to Cloud (wip)

Azure

Deploy to Azure

Running on TPUs

When using Tensorflow, TPUs are supported out of the box as a tf.distribute.Strategy.

When using PyTorch, we support TPUs thanks to pytorch/xla. For more context and information on how to setup your TPU environment refer to Google's documentation and to the very detailed pytorch/xla README.

In this repo, we provide a very simple launcher script named xla_spawn.py that lets you run our example scripts on multiple TPU cores without any boilerplate. Just pass a --num_cores flag to this script, then your regular training script with its arguments (this is similar to the torch.distributed.launch helper for torch.distributed).

For example for run_glue:

python examples/xla_spawn.py --num_cores 8 \
	examples/text-classification/run_glue.py
	--model_name_or_path bert-base-cased \
	--task_name mnli \
	--data_dir ./data/glue_data/MNLI \
	--output_dir ./models/tpu \
	--overwrite_output_dir \
	--do_train \
	--do_eval \
	--num_train_epochs 1 \
	--save_steps 20000

Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.

Logging & Experiment tracking

You can easily log and monitor your runs code. The following are currently supported:

Weights & Biases

To use Weights & Biases, install the wandb package with:

pip install wandb

Then log in the command line:

wandb login

If you are in Jupyter or Colab, you should login with:

import wandb
wandb.login()

Whenever you use Trainer or TFTrainer classes, your losses, evaluation metrics, model topology and gradients (for Trainer only) will automatically be logged.

When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through WandbLogger. Refer to related documentation & examples.

Comet.ml

To use comet_ml, install the Python package with:

pip install comet_ml

or if in a Conda environment:

conda install -c comet_ml -c anaconda -c conda-forge comet_ml