transformers

Commit Graph

Author	SHA1	Message	Date
Sam Shleifer	83dba10b8f	[s2s] distributed_eval.py saves better speed info (#7242 )	2020-09-18 15:46:01 -04:00
Dat Quoc Nguyen	af2322c7a0	Add new pre-trained models BERTweet and PhoBERT (#6129 ) * Add BERTweet and PhoBERT models * Update modeling_auto.py Re-add `bart` to LM_MAPPING * Update tokenization_auto.py Re-add `from .configuration_mobilebert import MobileBertConfig` not sure why it's replaced by `from transformers.configuration_mobilebert import MobileBertConfig` * Add BERTweet and PhoBERT to pretrained_models.rst * Update tokenization_auto.py Remove BertweetTokenizer and PhobertTokenizer out of tokenization_auto.py (they are currently not supported by AutoTokenizer. * Update BertweetTokenizer - without nltk * Update model card for BERTweet * PhoBERT - with Auto mode - without import fastBPE * PhoBERT - with Auto mode - without import fastBPE * BERTweet - with Auto mode - without import fastBPE * Add PhoBERT and BERTweet to TF modeling auto * Improve Docstrings for PhobertTokenizer and BertweetTokenizer * Update PhoBERT and BERTweet model cards * Fixed a merge conflict in tokenization_auto * Used black to reformat BERTweet- and PhoBERT-related files * Used isort to reformat BERTweet- and PhoBERT-related files * Reformatted BERTweet- and PhoBERT-related files based on flake8 * Updated test files * Updated test files * Updated tf test files * Updated tf test files * Updated tf test files * Updated tf test files * Update commits from huggingface * Delete unnecessary files * Add tokenizers to auto and init files * Add test files for tokenizers * Revised model cards * Update save_vocabulary function in BertweetTokenizer and PhobertTokenizer and test files * Revised test files * Update orders of Phobert and Bertweet tokenizers in auto tokenization file	2020-09-18 13:16:43 -04:00
Patrick von Platen	9397436ea5	Create README.md	2020-09-18 16:52:00 +02:00
Patrick von Platen	7eeca4d399	Create README.md	2020-09-18 16:44:02 +02:00
Patrick von Platen	31516c776a	Update README.md	2020-09-18 16:37:14 +02:00
Patrick von Platen	4c14669a78	Update README.md	2020-09-18 16:35:11 +02:00
Yih-Dar	3a03bab9db	Fix a few countings (steps / epochs) in trainer_tf.py (#7175 )	2020-09-18 09:28:56 -04:00
Stefan Schweter	ee9eae4e06	token-classification: update url of GermEval 2014 dataset (#6571 )	2020-09-18 06:18:06 -04:00
Julien Chaumond	eef8d94d19	[model_cards] We use ISO 639-1 cc @gentaiscool	2020-09-18 12:09:24 +02:00
Patrick von Platen	afd6a9f827	Create README.md	2020-09-18 11:41:12 +02:00
Patrick von Platen	9f1544b9e0	Create README.md	2020-09-18 11:37:20 +02:00
Sameer Zahid	5c1d5ea667	Fixed typo in README (#7233 )	2020-09-18 04:52:43 -04:00
Yuta Hayashibe	7719ecd19f	Fix a typo (#7225 )	2020-09-18 04:23:33 -04:00
Manuel Romero	4a26e8ac5f	Create README.md (#7205 )	2020-09-18 03:24:30 -04:00
Manuel Romero	94320c5b81	Add customized text to widget (#7204 )	2020-09-18 03:24:23 -04:00
Manuel Romero	3aefb24b20	Create README.md (#7209 )	2020-09-18 03:24:10 -04:00
Manuel Romero	a22e7a8dd4	Create README.md (#7210 )	2020-09-18 03:23:58 -04:00
Manuel Romero	c028b26481	Create README.md (#7212 )	2020-09-18 03:23:49 -04:00
Genta Indra Winata	c7cdd7b4fd	Create README.md for indobert-lite-base-p1 (#7182 )	2020-09-18 03:22:32 -04:00
Genta Indra Winata	bfb9150b8f	Create README.md for indobert-lite-large-p1 (#7184 ) * Create README.md * Update README.md	2020-09-18 03:22:11 -04:00
Genta Indra Winata	d193593403	Create README.md (#7183 )	2020-09-18 03:21:54 -04:00
Genta Indra Winata	e65d846674	Create README.md (#7185 )	2020-09-18 03:21:39 -04:00
Genta Indra Winata	e27d86d48d	Create README.md for indobert-large-p2 model card (#7181 )	2020-09-18 03:21:28 -04:00
Genta Indra Winata	881c0783e9	Create README.md for indobert-large-p1 model card (#7180 )	2020-09-18 03:21:16 -04:00
Genta Indra Winata	e0d58a5c87	Create README.md (#7179 )	2020-09-18 03:20:59 -04:00
Genta Indra Winata	1313a1d2a8	Create README.md for indobert-base-p2 (#7178 )	2020-09-18 03:20:29 -04:00
tuner007	cf24f43e76	Create README.md (#7095 ) Create model card for Pegasus QA	2020-09-18 03:19:45 -04:00
Sam Shleifer	67d9fc50d9	[s2s] remove double assert (#7223 )	2020-09-17 18:32:31 -04:00
Stas Bekman	edbaad2c5c	[model cards] fix metadata - 3rd attempt (#7218 )	2020-09-17 16:57:06 -04:00
Stas Bekman	999a1c957a	skip failing FSMT CUDA tests until investigated (#7220 )	2020-09-17 16:53:14 -04:00
Stas Bekman	51c4adf54c	[model cards] fix dataset yaml (#7216 )	2020-09-17 15:29:39 -04:00
Sam Shleifer	a5638b2b3a	[s2s] dynamic batch size with --max_tokens_per_batch (#7030 )	2020-09-17 15:19:34 -04:00
Stas Bekman	efeab6a3f1	[s2s] run_eval/run_eval_search tweaks (#7192 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-17 14:26:38 -04:00
Stas Bekman	9c5bcab5b0	[model cards] fix yaml in cards (#7207 )	2020-09-17 14:11:17 -04:00
Sohee Yang	e643a29722	Change to use relative imports in some files & Add python prompt symbols to example codes (#7202 ) * Move 'from transformers' statements to relative imports in some files * Add python prompt symbols in front of the example codes * Reformat the code * Add one missing space Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-09-17 12:30:45 -04:00
Stas Bekman	0fe6e435b6	[model cards] ported allenai Deep Encoder, Shallow Decoder models (#7153 ) * [model cards] ported allenai Deep Encoder, Shallow Decoder models * typo * fix references * add allenai/wmt19-de-en-6-6 model cards * fill-in the missing info for the build script as provided by the searcher.	2020-09-17 17:58:49 +02:00
Stas Bekman	1eeb206bef	[ported model] FSMT (FairSeq MachineTranslation) (#6940 ) * ready for PR * cleanup * correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST * fix * perfectionism * revert change from another PR * odd, already committed this one * non-interactive upload workaround * backup the failed experiment * store langs in config * workaround for localizing model path * doc clean up as in https://github.com/huggingface/transformers/pull/6956 * style * back out debug mode * document: run_eval.py --num_beams 10 * remove unneeded constant * typo * re-use bart's Attention * re-use EncoderLayer, DecoderLayer from bart * refactor * send to cuda and fp16 * cleanup * revert (moved to another PR) * better error message * document run_eval --num_beams * solve the problem of tokenizer finding the right files when model is local * polish, remove hardcoded config * add a note that the file is autogenerated to avoid losing changes * prep for org change, remove unneeded code * switch to model4.pt, update scores * s/python/bash/ * missing init (but doesn't impact the finetuned model) * cleanup * major refactor (reuse-bart) * new model, new expected weights * cleanup * cleanup * full link * fix model type * merge porting notes * style * cleanup * have to create a DecoderConfig object to handle vocab_size properly * doc fix * add note (not a public class) * parametrize * - add bleu scores integration tests * skip test if sacrebleu is not installed * cache heavy models/tokenizers * some tweaks * remove tokens that aren't used * more purging * simplify code * switch to using decoder_start_token_id * add doc * Revert "major refactor (reuse-bart)" This reverts commit `226dad15ca`. * decouple from bart * remove unused code #1 * remove unused code #2 * remove unused code #3 * update instructions * clean up * move bleu eval to examples * check import only once * move data+gen script into files * reuse via import * take less space * add prepare_seq2seq_batch (auto-tested) * cleanup * recode test to use json instead of yaml * ignore keys not needed * use the new -y in transformers-cli upload -y * [xlm tok] config dict: fix str into int to match definition (#7034) * [s2s] --eval_max_generate_length (#7018) * Fix CI with change of name of nlp (#7054) * nlp -> datasets * More nlp -> datasets * Woopsie * More nlp -> datasets * One last * extending to support allen_nlp wmt models - allow a specific checkpoint file to be passed - more arg settings - scripts for allen_nlp models * sync with changes * s/fsmt-wmt/wmt/ in model names * s/fsmt-wmt/wmt/ in model names (p2) * s/fsmt-wmt/wmt/ in model names (p3) * switch to a better checkpoint * typo * make non-optional args such - adjust tests where possible or skip when there is no other choice * consistency * style * adjust header * cards moved (model rename) * use best custom hparams * update info * remove old cards * cleanup * s/stas/facebook/ * update scores * s/allen_nlp/allenai/ * url maps aren't needed * typo * move all the doc / build /eval generators to their own scripts * cleanup * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix indent * duplicated line * style * use the correct add_start_docstrings * oops * resizing can't be done with the core approach, due to 2 dicts * check that the arg is a list * style * style Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-09-17 11:31:29 -04:00
Sylvain Gugger	492bb6aa48	Trainer multi label (#7191 ) * Trainer accep multiple labels * Missing import * Fix dosctrings	2020-09-17 08:15:37 -04:00
RafaelWO	709745927b	Transformer-XL: Remove unused parameters (#7087 ) * Removed 'tgt_len' and 'ext_len' from Transfomer-XL * Some changes are still to be done * Removed 'tgt_len' and 'ext_len' from Transfomer-XL (2) * Removed comments * Fixed quality * Changed warning to info	2020-09-17 06:10:34 -04:00
Dhaval Taunk	c183d81e27	added multilabel text classification notebook using distilbert to community notebooks (#7201 ) * added multilabel classification using distilbert notebook to community notebooks * added multilabel classification using distilbert notebook to community notebooks	2020-09-17 05:58:57 -04:00
Stas Bekman	79111b77d2	remove deprecated flag (#7171 ) ``` /home/circleci/.local/lib/python3.6/site-packages/isort/main.py:915: UserWarning: W0501: The following deprecated CLI flags were used and ignored: --recursive! "W0501: The following deprecated CLI flags were used and ignored: " ```	2020-09-17 05:52:12 -04:00
Stas Bekman	0cdafbf7ec	remove duplicated code (#7173 )	2020-09-17 05:51:40 -04:00
Sam Shleifer	45b0b1ff2f	[s2s] fix kwarg typo (#7196 )	2020-09-16 21:58:57 -04:00
Sam Shleifer	0203ad43bc	[s2s] distributed eval cleanup (#7186 )	2020-09-16 15:38:37 -04:00
sgugger	3babef815c	Formatting	2020-09-16 14:57:09 -04:00
Stas Bekman	42049b8e12	use the correct add_start_docstrings (#7174 )	2020-09-16 14:40:35 -04:00
Stas Bekman	fdaf8ab349	[s2s run_eval] new features (#7109 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-16 13:59:57 -04:00
Antoine Louis	df165065c3	[model_cards] antoiloui/belgpt2 🇧🇪 (#7166 ) * Create README.md * Update README.md	2020-09-16 12:16:01 -04:00
Sylvain Gugger	108c9aefcc	Update README (#7133 ) * Rewrite and update README * Typo and migration guide * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address Clem's comments Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-09-16 12:12:12 -04:00
Donna Choi	9e376e156a	Add condition (#7161 )	2020-09-16 09:15:10 -04:00

1 2 3 4 5 ...

5232 Commits All Branches Search

5232 Commits

All Branches