transformers/examples/research_projects/performer
Sanchit Gandhi e6f221c8d4
[JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361)
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*

* fix double tree_util
2022-09-09 15:18:56 +02:00
..
README.md Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00
full_script.sh Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00
modeling_flax_performer.py Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00
modeling_flax_performer_utils.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
run_mlm_performer.py [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361) 2022-09-09 15:18:56 +02:00
sanity_script.sh Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00

README.md

Performer fine-tuning

Example authors: @TevenLeScao, @Patrickvonplaten

Paper authors: Krzysztof Choromanski, Valerii Likhosherstov, David Dohan, Xingyou Song, Andreea Gane, Tamas Sarlos, Peter Hawkins, Jared Davis, Afroz Mohiuddin, Lukasz Kaiser, David Belanger, Lucy Colwell, Adrian Weller

Requirements

datasets, flax and jax. wandb integration is built-in if you want to use it.

Examples

sanity_script.sh will launch performer fine-tuning from the bert-base-cased checkpoint on the Simple Wikipedia dataset (a small, easy-language English Wikipedia) from datasets. full_script.sh will launch performer fine-tuning from the bert-large-cased checkpoint on the English Wikipedia dataset from datasets.

Here are a few key arguments:

  • Remove the --performer argument to use a standard Bert model.

  • Add --reinitialize to start from a blank model rather than a Bert checkpoint.

  • You may change the Bert size by passing a different checkpoint to the --model_name_or_path argument.

  • Passing your user name to the --wandb_user_name argument will trigger weights and biases logging.

  • You can choose a dataset with --dataset_name and --dataset_config. Our viewer will help you find what you need.