transformers

Commit Graph

Author	SHA1	Message	Date
Manuel Romero	ff82a2aa93	Create README.md (#8015 )	2020-10-29 08:19:35 -04:00
Zhiqi Huang	0a3b9733cb	Add model_cards for DynaBERT (#8012 ) * Update README.md * Add dynabert_overview.png * Update README.md * Create README.md * Add dynabert_overview.png * Update README.md * Update README.md * Delete dynabert_overview.png * Update README.md * Delete dynabert_overview.png * Update README.md	2020-10-29 08:19:17 -04:00
Patrick von Platen	afa21504b1	add tags (#8147 )	2020-10-29 12:45:55 +01:00
Joe Davison	556709ad92	rm multiclass option from model card	2020-10-27 17:11:43 -04:00
Julien Chaumond	55bc0c599a	[model_cards] Switch to a more explicit domain for the media bucket	2020-10-27 18:08:05 +01:00
Philip May	8bbb74f211	[Model Card] new cross lingual sentence model for German and English (#8026 ) * mc for new cross lingual sentence model * fat text * url spelling fix * more url spelling fixes * slight thanks change * small improvements in text * multilingual word xchange * change colab link * xval fold number * add model links * line break in model names * Update README.md * Update README.md * new examples link * new examples link * add evaluation dataset name * add more about multi lingual * typo fix * typo * typos * hyperparameter typos * hyperparameter typo * add metadata * add metadata * Update README.md * typo fix * Small improvement	2020-10-26 14:48:26 -04:00
Joe Davison	fbcddb8544	add mutliclass field to default zero shot example	2020-10-26 11:07:51 -04:00
Joe Davison	b0a907615a	minor model card description updates (#8051 )	2020-10-26 10:04:20 -04:00
Julien Chaumond	7087d9b1c0	[model_cards] bert-base-danish Fixup #8030	2020-10-26 09:38:21 +01:00
Julien Chaumond	efc4a21ffa	Fixup #8025 Close #8030	2020-10-26 09:32:07 +01:00
Sam Longenbach	5148f43309	[Model Card] DJSammy/bert-base-danish-uncased_BotXO,ai (#8025 ) * Create README.md * Update README.md	2020-10-25 15:20:46 +08:00
Yixin Nie	00602f7840	Create model card for pre-trained NLI models. (#7864 ) * Create README.md * Update model_cards/ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Add Meta information for dataset identifier. Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-24 03:16:07 -04:00
Sacha Arbonel	59b5953d89	Create model card for bert-italian-cased-finetuned-pos (#8003 ) * Create README.md * Update model_cards/sachaarbonel/bert-italian-cased-finetuned-pos/README.md * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-23 10:58:05 -04:00
Zhiqi Huang	43fdafef89	Create README.md (#7997 )	2020-10-23 10:53:37 -04:00
Blaise Cruz	627e813734	Added model cards for Tagalog ELECTRA models (#7996 ) Co-authored-by: Jan Christian Blaise Cruz <jcblaise@Blaises-MacBook-Pro.local>	2020-10-23 10:52:21 -04:00
Philip May	9865e1fe52	model card for German Sentence Embeddings V2 (#7952 ) * model card German Sentence Embeddings V2 - for German RoBERTa for Sentence Embeddings V2 - marked old as outdated * small correction * small improvement in description * small spelling fix * spelling fix * add evaluation results * spearman explanation * add number of trials	2020-10-23 10:45:54 -04:00
Joe Davison	64b24bb3c2	change zero shot widget default example (#7992 )	2020-10-22 15:19:41 -06:00
Joe Davison	077c99bb5f	add zero shot pipeline tags & examples (#7983 ) * add zero shot pipeline tags * rm default and fix yaml format * rm DS_Store * add bart large default * don't add more typos Co-authored-by: Julien Chaumond <chaumond@gmail.com> * add multiple multilingual examples * improve multilingual examples for single-label Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-22 13:01:23 -06:00
Julien Chaumond	3479787edc	Disable inference API for t5-11b (#7978 )	2020-10-22 09:08:37 -04:00
Julien Chaumond	a7db81c33f	[model_card] t5-11b move disclaimer to top of page cc @Narsil @patrickvonplaten	2020-10-22 14:35:31 +02:00
Julien Chaumond	f8d3695e8c	[model_cards] camembert: dataset = oscar Hat/tip @pjox	2020-10-21 14:17:56 -04:00
Ali Hamdi Ali Fadel	bf162ce8ca	Add AI-SOCO models (#7867 )	2020-10-21 09:24:43 -04:00
Fangyu Liu	58fb25f25b	Create README.md (#7857 ) * Create README.md model card for cambridgeltl/BioRedditBERT-uncased. * Update model_cards/cambridgeltl/BioRedditBERT-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-21 08:41:41 -04:00
Manuel Romero	2b07ec7823	Model card for German BERT fine-tuned for LER/NER (#7855 )	2020-10-21 08:31:41 -04:00
MichalPleban	35d2ad5b83	Create README.md (#7819 )	2020-10-21 08:30:01 -04:00
Wuwei Lan	bdda4f2249	Create README.md (#7625 ) * Create README.md * Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md * Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-21 08:29:39 -04:00
Manuel Romero	8e23749649	Add missing comma (#7870 )	2020-10-21 08:24:12 -04:00
Manuel Romero	3eaa007d78	Create README.md (#7899 )	2020-10-21 08:23:55 -04:00
Julien Chaumond	758572cad8	[model_cards] move hatmimoha/arabic-ner to correct location see `16d3cc187d` and https://github.com/huggingface/transformers/pull/7836	2020-10-21 14:13:17 +02:00
quentinheinrich	006a16483f	update model cards of Illuin models (#7930 )	2020-10-21 08:05:53 -04:00
Patrick von Platen	5cd9e2cba1	Update README.md	2020-10-21 12:43:42 +02:00
Patrick von Platen	220b5f97ca	Create README.md	2020-10-21 12:34:46 +02:00
Patrick von Platen	8ffd7fb12d	Update README.md	2020-10-21 12:27:09 +02:00
Patrick von Platen	613ab364eb	Update README.md	2020-10-21 12:23:17 +02:00
Patrick von Platen	f7eb17dc47	Update README.md	2020-10-21 12:19:44 +02:00
Patrick von Platen	0264048660	Update README.md	2020-10-20 16:13:49 +02:00
Patrick von Platen	f3312515b7	Add note for WikiSplit	2020-10-20 15:42:29 +02:00
Patrick von Platen	0724c0f3a2	Fix EncoderDecoder WikiSplit Example	2020-10-20 15:13:22 +02:00
Weizhen	2422cda01b	ProphetNet (#7157 ) * add new model prophetnet prophetnet modified modify codes as suggested v1 add prophetnet test files * still bugs, because of changed output formats of encoder and decoder * move prophetnet into the latest version * clean integration tests * clean tokenizers * add xlm config to init * correct typo in init * further refactoring * continue refactor * save parallel * add decoder_attention_mask * fix use_cache vs. past_key_values * fix common tests * change decoder output logits * fix xlm tests * make common tests pass * change model architecture * add tokenizer tests * finalize model structure * no weight mapping * correct n-gram stream attention mask as discussed with qweizhen * remove unused import * fix index.rst * fix tests * delete unnecessary code * add fast integration test * rename weights * final weight remapping * save intermediate * Descriptions for Prophetnet Config File * finish all models * finish new model outputs * delete unnecessary files * refactor encoder layer * add dummy docs * code quality * fix tests * add model pages to doctree * further refactor * more refactor, more tests * finish code refactor and tests * remove unnecessary files * further clean up * add docstring template * finish tokenizer doc * finish prophetnet * fix copies * fix typos * fix tf tests * fix fp16 * fix tf test 2nd try * fix code quality * add test for each model * merge new tests to branch * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update src/transformers/modeling_prophetnet.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update utils/check_repo.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * apply sams and sylvains comments * make style * remove unnecessary code * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/configuration_prophetnet.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * implement lysandres comments * correct docs * fix isort * fix tokenizers * fix copies Co-authored-by: weizhen <weizhen@mail.ustc.edu.cn> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-10-19 17:36:09 +02:00
Jordi Mas	ea1507fb45	Julibert model card (#7868 ) * Julibert model card * Fix text	2020-10-19 06:50:52 -04:00
Patrick von Platen	dc552b9b70	Fix typo in sequence model card	2020-10-16 16:05:06 +02:00
rmroczkowski	7b13bd01df	Herbert polish model (#7798 ) * HerBERT transformer model for Polish language understanding. * HerbertTokenizerFast generated with HerbertConverter * Herbert base and large model cards * Herbert model cards with tags * Herbert tensorflow models * Herbert model tests based on Bert test suit * src/transformers/tokenization_herbert.py edited online with Bitbucket * src/transformers/tokenization_herbert.py edited online with Bitbucket * docs/source/model_doc/herbert.rst edited online with Bitbucket * Herbert tokenizer tests and bug fixes * src/transformers/configuration_herbert.py edited online with Bitbucket * Copyrights and tests for TFHerbertModel * model_cards/allegro/herbert-base-cased/README.md edited online with Bitbucket * model_cards/allegro/herbert-large-cased/README.md edited online with Bitbucket * Bug fixes after testing * Reformat modified_only_fixup * Proper order of configuration * Herbert proper documentation formatting * Formatting with make modified_only_fixup * Dummies fixed * Adding missing models to documentation * Removing HerBERT model as it is a simple extension of BERT * Update model_cards/allegro/herbert-base-cased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Update model_cards/allegro/herbert-large-cased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> * HerbertTokenizer deprecated configuration removed Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-16 03:06:51 -04:00
David S. Lim	9c71cca316	model card for bert-base-NER (#7799 ) * model card for bert-base-NER * add meta data up top Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-15 21:55:00 +02:00
Julien Chaumond	e7aa64838c	[model_cards] facebook/bart-large-mnli: register ZSC for the inference API cc @Narsil @mfuntowicz @joeddav	2020-10-15 19:02:10 +02:00
Julien Chaumond	6f45dd2fac	[model_cards] Fix yaml for Facebook/wmt19-* see `d99ed7ad61`	2020-10-15 16:14:08 +02:00
Julien Chaumond	d99ed7ad61	[model_cards] Facebook: add thumbnail	2020-10-15 12:53:29 +02:00
Nils Reimers	3032de9369	Model Card (#7752 ) * Create README.md * Update model_cards/sentence-transformers/LaBSE/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-14 13:30:58 -04:00
sarahlintang	3fdbeba83c	[model_cards] sarahlintang/IndoBERT (#7748 ) * Create README.md * Update model_cards/sarahlintang/IndoBERT/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-14 13:10:31 -04:00
Julien Chaumond	ba654270b3	[model_cards] rename to correct model name	2020-10-14 19:02:48 +02:00
Zhuosheng Zhang	08978487e7	Create README.md (#7722 )	2020-10-14 12:56:12 -04:00
Sagor Sarker	3557509127	added evaluation results for classification task (#7790 )	2020-10-14 12:50:43 -04:00
XiaoqiJiao	890e790e16	[model_cards] TinyBERT (HUAWEI Noah's Ark Lab) (#7775 )	2020-10-14 09:31:01 -04:00
Alex Combessie	aacac8f708	Add license info to nlptown/bert-base-multilingual-uncased-sentiment (#7738 )	2020-10-12 11:56:10 -04:00
Andrew Kane	26d5475d4b	Added license information for default and distilbert models (#7688 )	2020-10-10 03:55:11 -04:00
Joe Davison	a1ac082879	add license to xlm-roberta-large-xnli card	2020-10-09 09:16:06 -04:00
Blaise Cruz	aee7967fc4	Added model cards for Tagalog BERT models (#7603 )	2020-10-07 16:49:20 -04:00
Bobby Donchev	b1c06140f4	Create README.md for IsRoBERTa language model (#7640 ) * Create README.md * Update README.md * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-07 16:46:03 -04:00
Keshan	e10d389561	[Model card] SinhalaBERTo model. (#7558 ) * [Model card] SinhalaBERTo model. This is the model card for keshan/SinhalaBERTo model. * Update model_cards/keshan/SinhalaBERTo/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-07 16:40:52 -04:00
Amine Abdaoui	167bce56f2	[model_card] bert-base-5lang-cased (#7573 ) Co-authored-by: Amin <amin.geotrend@gmail.com>	2020-10-07 16:38:14 -04:00
Abed khooli	923dd4e5ef	Create README.md (#7581 )	2020-10-07 16:37:40 -04:00
dartrevan	85ead0fec4	Update README.md (#7590 )	2020-10-07 16:37:10 -04:00
Ilias Chalkidis	c6b9c72eac	Update README.md (#7629 ) Minor changes: Add arxiv link + Layout improvement + fix typos	2020-10-07 16:36:08 -04:00
Abhilash Majumder	048b4bd2c6	Create Model Card For "abhilash1910/french-roberta" Model (#7544 )	2020-10-07 16:35:28 -04:00
Julien Chaumond	c2e0d8ac52	[model_card] nikokons/gpt2-greek by @nikkon3	2020-10-07 16:28:47 -04:00
Ahmed Elnaggar	aa6c3c14b4	typo fix (#7611 ) It should be T5-3B not T5-3M.	2020-10-06 15:32:52 +02:00
cedspam	8d2c248df7	Update README.md (#7612 )	2020-10-06 08:46:55 -04:00
Ilias Chalkidis	1c80b2c604	Create README.md (LEGAL-BERT Model card) (#7607 ) * Create README.md Model description for all LEGAL-BERT models, published as part of "LEGAL-BERT: The Muppets straight out of Law School". Chalkidis et al., 2018, In Findings of EMNLP 2020 * Update model_cards/nlpaueb/legal-bert-base-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-06 08:46:17 -04:00
Ahmed Elnaggar	66c72082d0	Add ProtT5-XL-BFD model card (#7606 ) * Add ProtT5-XL-BFD model card * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-10-06 12:19:21 +02:00
Joshua H	1a00f46c74	Update Code example according to deprecation of AutoModeWithLMHead (#7555 ) 'The class `AutoModelWithLMHead` is deprecated and will be removed in a future version. Please use `AutoModelForCausalLM` for causal language models, `AutoModelForMaskedLM` for masked language models and `AutoModelForSeq2SeqLM` for encoder-decoder models.' I dont know how to change the 'How to use this model directly from the 🤗/transformers library:' part since it is not part of the model-paper	2020-10-05 08:21:21 -04:00
Nathan Cooper	071970feb8	[Model card] Java Code Summarizer model (#7568 ) * Create README.md * Update model_cards/ncoop57/bart-base-code-summarizer-java-v0/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-05 04:49:17 -04:00
Forrest Iandola	02ef825be2	SqueezeBERT architecture (#7083 ) * configuration_squeezebert.py thin wrapper around bert tokenizer fix typos wip sb model code wip modeling_squeezebert.py. Next step is to get the multi-layer-output interface working set up squeezebert to use BertModelOutput when returning results. squeezebert documentation formatting allow head mask that is an array of [None, ..., None] docs docs cont'd path to vocab docs and pointers to cloud files (WIP) line length and indentation squeezebert model cards formatting of model cards untrack modeling_squeezebert_scratchpad.py update aws paths to vocab and config files get rid of stub of NSP code, and advise users to pretrain with mlm only fix rebase issues redo rebase of modeling_auto.py fix issues with code formatting more code format auto-fixes move squeezebert before bert in tokenization_auto.py and modeling_auto.py because squeezebert inherits from bert tests for squeezebert modeling and tokenization fix typo move squeezebert before bert in modeling_auto.py to fix inheritance problem disable test_head_masking, since squeezebert doesn't yet implement head masking fix issues exposed by the test_modeling_squeezebert.py fix an issue exposed by test_tokenization_squeezebert.py fix issue exposed by test_modeling_squeezebert.py auto generated code style improvement issue that we inherited from modeling_xxx.py: SqueezeBertForMaskedLM.forward() calls self.cls(), but there is no self.cls, and I think the goal was actually to call self.lm_head() update copyright resolve failing 'test_hidden_states_output' and remove unused encoder_hidden_states and encoder_attention_mask docs add integration test. rename squeezebert-mnli --> squeezebert/squeezebert-mnli autogenerated formatting tweaks integrate feedback from patrickvonplaten and sgugger to programming style and documentation strings * tiny change to order of imports	2020-10-05 04:25:43 -04:00
Julien Chaumond	e32390931d	[model_card] distilbert-base-german-cased	2020-10-01 09:08:49 -04:00
Julien Chaumond	9a4e163b58	[model_card] Fix metadata, adalbertojunior/PTT5-SMALL-SUM	2020-10-01 08:54:06 -04:00
Adalberto	8435e10e24	Create README.md (#7299 ) * Create README.md * language metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-01 08:52:28 -04:00
Martin Müller	d727432072	Update README.md (#7459 )	2020-10-01 08:51:26 -04:00
allenyummy	664da5b077	Create README.md (#7468 )	2020-10-01 08:50:26 -04:00
ahotrod	f745f61c99	Update README.md (#7491 ) Model now fine-tuned on Transformers 3.1.0, previous out-of-date model was fine-tuned on Transformers 2.3.0.	2020-10-01 08:50:07 -04:00
Abed khooli	6ef7658c0a	Create README.md (#7349 ) Model card for akhooli/personachat-arabic	2020-10-01 08:48:51 -04:00
Bayartsogt Yadamsuren	15ab3f049b	Creating readme for bert-base-mongolian-cased (#7439 ) * Creating readme for bert-base-mongolian-cased * Update model_cards/bayartsogt/bert-base-mongolian-cased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-01 08:46:27 -04:00
Bayartsogt Yadamsuren	0c2b9fa831	creating readme for bert-base-mongolian-uncased (#7440 )	2020-10-01 08:45:22 -04:00
Pengcheng He	7a0cf0ec93	Add DeBERTa model (#5929 ) * Add DeBERTa model * Remove dependency of deberta * Address comments * Patch DeBERTa Documentation Style * Add final tests * Style * Enable tests + nitpicks * position IDs * BERT -> DeBERTa * Quality * Style * Tokenization * Last updates. * @patrickvonplaten's comments * Not everything can be a copy * Apply most of @sgugger's review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Last reviews * DeBERTa -> Deberta Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-09-30 07:07:30 -04:00
GmailB	205bf0b7ea	Update README.md (#7444 ) Hi, just corrected the example code, add 2 links and fixed some typos	2020-09-29 03:18:01 -04:00
Typicasoft	671b278e25	Create README.md (#7436 ) * Create README.md MagBERT-NER : Added widget (Text) * Rename model_cards/README.md to model_cards/TypicaAI/magbert-ner/README.md	2020-09-28 18:25:25 -04:00
Manuel Romero	a1a8ffa512	Update README.md (#7429 ) Add links to models fine-tuned on a downstream task	2020-09-28 13:40:09 -04:00
Patrick von Platen	8279471506	correct RAG model cards (#7420 )	2020-09-28 11:08:39 +02:00
Patrick von Platen	1a14687e6f	Update README.md	2020-09-25 19:43:48 +02:00
Patrick von Platen	3327c2b0f6	Update README.md	2020-09-25 19:43:36 +02:00
Patrick von Platen	4e5b036bdd	Update README.md	2020-09-25 18:16:46 +02:00
Patrick von Platen	55eccfbb49	Update README.md	2020-09-25 18:16:44 +02:00
Patrick von Platen	5ff0d6d7d0	Update README.md	2020-09-25 16:58:29 +02:00
blinovpd	a9c7849cfa	[model_cards] blinoff/roberta-base-russian-v0 (#7317 )	2020-09-22 18:26:13 -04:00
Pavel Soriano	d6bc72c469	Fixed results of SQuAD-FR evaluation (#7313 ) The score for the F1 metric was reported as the Exact Match and vice-versa.	2020-09-22 12:39:07 -04:00
Thomas Winters	34a1b75f01	Added RobBERT-v2 model card (#7286 ) * Added RobBERT-v2 model card * minor Tweaks Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-09-21 16:17:28 -04:00
jjacampos	6513d16a48	IXAmBERT model card (#7283 ) This PR includes the model card for the IXAmBERT model which has been recently uploaded to the huggingface repository.	2020-09-21 16:15:31 -04:00
Suraj Patil	7a88ed6c2a	[model card] distlbart-mnli model cards (#7278 )	2020-09-21 12:26:18 -04:00
Dat Quoc Nguyen	67c4b0c517	Add model cards for new pre-trained BERTweet-COVID19 models (#7269 ) Two new pre-trained models "vinai/bertweet-covid19-base-cased" and "vinai/bertweet-covid19-base-uncased" are resulted by further pre-training the pre-trained model "vinai/bertweet-base" on a corpus of 23M COVID-19 English Tweets for 40 epochs.	2020-09-21 06:12:51 -04:00
Patrick von Platen	0cbe1139b1	Update README.md	2020-09-21 11:53:08 +02:00
Stas Bekman	4f6e525742	model card improvements (#7221 )	2020-09-19 17:02:05 -04:00
Stas Bekman	eb074af75e	fsmt tiny model card + script (#7244 )	2020-09-19 14:37:12 -04:00
Manuel Romero	1d90d0f386	Add title to model card (#7240 )	2020-09-19 02:10:45 -04:00
Manuel Romero	c9b7ef042f	Create README.md (#7239 )	2020-09-19 02:09:29 -04:00
Dat Quoc Nguyen	af2322c7a0	Add new pre-trained models BERTweet and PhoBERT (#6129 ) * Add BERTweet and PhoBERT models * Update modeling_auto.py Re-add `bart` to LM_MAPPING * Update tokenization_auto.py Re-add `from .configuration_mobilebert import MobileBertConfig` not sure why it's replaced by `from transformers.configuration_mobilebert import MobileBertConfig` * Add BERTweet and PhoBERT to pretrained_models.rst * Update tokenization_auto.py Remove BertweetTokenizer and PhobertTokenizer out of tokenization_auto.py (they are currently not supported by AutoTokenizer. * Update BertweetTokenizer - without nltk * Update model card for BERTweet * PhoBERT - with Auto mode - without import fastBPE * PhoBERT - with Auto mode - without import fastBPE * BERTweet - with Auto mode - without import fastBPE * Add PhoBERT and BERTweet to TF modeling auto * Improve Docstrings for PhobertTokenizer and BertweetTokenizer * Update PhoBERT and BERTweet model cards * Fixed a merge conflict in tokenization_auto * Used black to reformat BERTweet- and PhoBERT-related files * Used isort to reformat BERTweet- and PhoBERT-related files * Reformatted BERTweet- and PhoBERT-related files based on flake8 * Updated test files * Updated test files * Updated tf test files * Updated tf test files * Updated tf test files * Updated tf test files * Update commits from huggingface * Delete unnecessary files * Add tokenizers to auto and init files * Add test files for tokenizers * Revised model cards * Update save_vocabulary function in BertweetTokenizer and PhobertTokenizer and test files * Revised test files * Update orders of Phobert and Bertweet tokenizers in auto tokenization file	2020-09-18 13:16:43 -04:00
Patrick von Platen	9397436ea5	Create README.md	2020-09-18 16:52:00 +02:00
Patrick von Platen	7eeca4d399	Create README.md	2020-09-18 16:44:02 +02:00
Patrick von Platen	31516c776a	Update README.md	2020-09-18 16:37:14 +02:00
Patrick von Platen	4c14669a78	Update README.md	2020-09-18 16:35:11 +02:00
Julien Chaumond	eef8d94d19	[model_cards] We use ISO 639-1 cc @gentaiscool	2020-09-18 12:09:24 +02:00
Patrick von Platen	afd6a9f827	Create README.md	2020-09-18 11:41:12 +02:00
Patrick von Platen	9f1544b9e0	Create README.md	2020-09-18 11:37:20 +02:00
Manuel Romero	4a26e8ac5f	Create README.md (#7205 )	2020-09-18 03:24:30 -04:00
Manuel Romero	94320c5b81	Add customized text to widget (#7204 )	2020-09-18 03:24:23 -04:00
Manuel Romero	3aefb24b20	Create README.md (#7209 )	2020-09-18 03:24:10 -04:00
Manuel Romero	a22e7a8dd4	Create README.md (#7210 )	2020-09-18 03:23:58 -04:00
Manuel Romero	c028b26481	Create README.md (#7212 )	2020-09-18 03:23:49 -04:00
Genta Indra Winata	c7cdd7b4fd	Create README.md for indobert-lite-base-p1 (#7182 )	2020-09-18 03:22:32 -04:00
Genta Indra Winata	bfb9150b8f	Create README.md for indobert-lite-large-p1 (#7184 ) * Create README.md * Update README.md	2020-09-18 03:22:11 -04:00
Genta Indra Winata	d193593403	Create README.md (#7183 )	2020-09-18 03:21:54 -04:00
Genta Indra Winata	e65d846674	Create README.md (#7185 )	2020-09-18 03:21:39 -04:00
Genta Indra Winata	e27d86d48d	Create README.md for indobert-large-p2 model card (#7181 )	2020-09-18 03:21:28 -04:00
Genta Indra Winata	881c0783e9	Create README.md for indobert-large-p1 model card (#7180 )	2020-09-18 03:21:16 -04:00
Genta Indra Winata	e0d58a5c87	Create README.md (#7179 )	2020-09-18 03:20:59 -04:00
Genta Indra Winata	1313a1d2a8	Create README.md for indobert-base-p2 (#7178 )	2020-09-18 03:20:29 -04:00
tuner007	cf24f43e76	Create README.md (#7095 ) Create model card for Pegasus QA	2020-09-18 03:19:45 -04:00
Stas Bekman	edbaad2c5c	[model cards] fix metadata - 3rd attempt (#7218 )	2020-09-17 16:57:06 -04:00
Stas Bekman	51c4adf54c	[model cards] fix dataset yaml (#7216 )	2020-09-17 15:29:39 -04:00
Stas Bekman	9c5bcab5b0	[model cards] fix yaml in cards (#7207 )	2020-09-17 14:11:17 -04:00
Stas Bekman	0fe6e435b6	[model cards] ported allenai Deep Encoder, Shallow Decoder models (#7153 ) * [model cards] ported allenai Deep Encoder, Shallow Decoder models * typo * fix references * add allenai/wmt19-de-en-6-6 model cards * fill-in the missing info for the build script as provided by the searcher.	2020-09-17 17:58:49 +02:00
Stas Bekman	1eeb206bef	[ported model] FSMT (FairSeq MachineTranslation) (#6940 ) * ready for PR * cleanup * correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST * fix * perfectionism * revert change from another PR * odd, already committed this one * non-interactive upload workaround * backup the failed experiment * store langs in config * workaround for localizing model path * doc clean up as in https://github.com/huggingface/transformers/pull/6956 * style * back out debug mode * document: run_eval.py --num_beams 10 * remove unneeded constant * typo * re-use bart's Attention * re-use EncoderLayer, DecoderLayer from bart * refactor * send to cuda and fp16 * cleanup * revert (moved to another PR) * better error message * document run_eval --num_beams * solve the problem of tokenizer finding the right files when model is local * polish, remove hardcoded config * add a note that the file is autogenerated to avoid losing changes * prep for org change, remove unneeded code * switch to model4.pt, update scores * s/python/bash/ * missing init (but doesn't impact the finetuned model) * cleanup * major refactor (reuse-bart) * new model, new expected weights * cleanup * cleanup * full link * fix model type * merge porting notes * style * cleanup * have to create a DecoderConfig object to handle vocab_size properly * doc fix * add note (not a public class) * parametrize * - add bleu scores integration tests * skip test if sacrebleu is not installed * cache heavy models/tokenizers * some tweaks * remove tokens that aren't used * more purging * simplify code * switch to using decoder_start_token_id * add doc * Revert "major refactor (reuse-bart)" This reverts commit `226dad15ca`. * decouple from bart * remove unused code #1 * remove unused code #2 * remove unused code #3 * update instructions * clean up * move bleu eval to examples * check import only once * move data+gen script into files * reuse via import * take less space * add prepare_seq2seq_batch (auto-tested) * cleanup * recode test to use json instead of yaml * ignore keys not needed * use the new -y in transformers-cli upload -y * [xlm tok] config dict: fix str into int to match definition (#7034) * [s2s] --eval_max_generate_length (#7018) * Fix CI with change of name of nlp (#7054) * nlp -> datasets * More nlp -> datasets * Woopsie * More nlp -> datasets * One last * extending to support allen_nlp wmt models - allow a specific checkpoint file to be passed - more arg settings - scripts for allen_nlp models * sync with changes * s/fsmt-wmt/wmt/ in model names * s/fsmt-wmt/wmt/ in model names (p2) * s/fsmt-wmt/wmt/ in model names (p3) * switch to a better checkpoint * typo * make non-optional args such - adjust tests where possible or skip when there is no other choice * consistency * style * adjust header * cards moved (model rename) * use best custom hparams * update info * remove old cards * cleanup * s/stas/facebook/ * update scores * s/allen_nlp/allenai/ * url maps aren't needed * typo * move all the doc / build /eval generators to their own scripts * cleanup * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix indent * duplicated line * style * use the correct add_start_docstrings * oops * resizing can't be done with the core approach, due to 2 dicts * check that the arg is a list * style * style Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-09-17 11:31:29 -04:00
Antoine Louis	df165065c3	[model_cards] antoiloui/belgpt2 🇧🇪 (#7166 ) * Create README.md * Update README.md	2020-09-16 12:16:01 -04:00
Patrick von Platen	7af2791d77	Create README.md	2020-09-15 16:47:36 +02:00
Sylvain Gugger	153ec2f154	Funnel model cards (#7147 )	2020-09-15 10:40:57 -04:00
Pedro Lima	52d250f6aa	[model_cards] pvl/labse_bert model card From Language-Agnostic BERT Sentence Embedding https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html	2020-09-15 08:54:12 -04:00
tuner007	84d64805b0	Create README.md (#7097 ) Model card for PEGASUS finetuned for paraphrasing task	2020-09-15 08:48:25 -04:00
Philip May	52bb7ccce5	German electra model card v3 update (#7089 ) * changed eval table model order * Update install * update mc	2020-09-15 08:48:13 -04:00
李明浩	563ffb3dc3	Create README.md (#7066 )	2020-09-11 15:21:05 -04:00
李明浩	1ad49cde3a	Create README.md (#7067 )	2020-09-11 15:20:54 -04:00
Sagor Sarker	4753816e39	added bangla-bert-base model card and also modified other model cards (#7071 ) * added bangla-bert-base * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-09-11 15:17:25 -04:00
Patrick von Platen	eb2feb5d90	Create README.md	2020-09-10 17:05:50 +02:00
Patrick von Platen	9ccdb1d517	Update README.md	2020-09-10 17:01:19 +02:00
Patrick von Platen	60698936fc	Create README.md	2020-09-10 17:00:10 +02:00
Patrick von Platen	e0c3bc8ee0	Create README.md	2020-09-10 16:51:15 +02:00
Patrick von Platen	c356b9878d	Create README.md	2020-09-10 16:45:44 +02:00
Patrick von Platen	5afd3f6196	Create README.md	2020-09-10 16:44:47 +02:00
Patrick von Platen	63e539459d	Update README.md	2020-09-10 16:34:28 +02:00
Patrick von Platen	054db06b1b	Create README.md	2020-09-10 16:30:46 +02:00
Patrick von Platen	76818cc4c6	Create README.md	2020-09-09 16:26:35 +02:00
Mehrdad Farahani	60fc03290b	README for HooshvareLab/bert-fa-base-uncased (#6990 ) ParsBERT v2.0 is a fine-tuned and vocab-reconstructed version of ParsBERT, and it's able to be used in other scopes! It includes these features: - We added some unused-vocab for use in summarization and other scopes. - We fine-tuned the model on vast styles of writing in the Persian language.	2020-09-07 16:43:50 -04:00
Abed khooli	e9d0d4c75c	Create README.md (#6974 )	2020-09-07 07:31:22 -04:00
Richard Bownes	e20d8895bd	Create README.md model card (#6964 ) * Create README.md * Add some custom prompts Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-09-07 06:01:40 -04:00
Julien Chaumond	10c6f94adc	[model_card] register jplu/tf-xlm-r-ner-40-lang as multilingual	2020-09-07 05:03:40 -04:00

1 2 3 4 5 ...

772 Commits