transformers

Commit Graph

Author	SHA1	Message	Date
Mehrdad Farahani	daaa68451e	Readme for Wiki Summary [Persian] bert2bert (#8558 )	2020-11-16 05:04:46 -05:00
Mehrdad Farahani	06d468d3f0	Readme for News Headline Generation (bert2bert) (#8557 )	2020-11-16 05:04:38 -05:00
zhezhaoa	9b7fb8a368	Create README.md for Chinese RoBERTa Miniatures (#8550 ) * Create README.md * Update model_cards/uer/chinese_roberta_L-2_H-128/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-16 05:01:28 -05:00
Joe Davison	f6f4da8dd4	Add bart-large-mnli model card (#8527 )	2020-11-13 14:07:25 -05:00
Branden Chan	4df6b59318	Update deepset/roberta-base-squad2 model card (#8522 ) * Update README.md * Update README.md	2020-11-13 09:58:27 -05:00
Forrest Iandola	0fa0349883	fix SqueezeBertForMaskedLM (#8479 )	2020-11-12 12:19:37 -05:00
Antonio Lanza	17b1fd804f	Fix typo in roberta-base-squad2-v2 model card (#8489 )	2020-11-12 05:29:37 -05:00
Julien Chaumond	c6c08ebf61	[model_cards] other chars than [\w\-_] not allowed anymore in model names cc @Pierrci	2020-11-12 10:45:29 +01:00
Julien Chaumond	8dda9167de	[model_cards] harmonization	2020-11-11 12:42:50 +01:00
Santiago Castro	8fe6629bb4	Add missing tasks to `pipeline` docstring (#8428 )	2020-11-10 13:44:25 -05:00
Julien Chaumond	70f622fab4	Model versioning (#8324 ) * fix typo * rm use_cdn & references, and implement new hf_bucket_url * I'm pretty sure we don't need to `read` this file * same here * [BIG] file_utils.networking: do not gobble up errors anymore * Fix CI 😇 * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Tiny doc tweak * Add doc + pass kwarg everywhere * Add more tests and explain cc @sshleifer let me know if better Co-Authored-By: Sam Shleifer <sshleifer@gmail.com> * Also implement revision in pipelines In the case where we're passing a task name or a string model identifier * Fix CI 😇 * Fix CI * [hf_api] new methods + command line implem * make style * Final endpoints post-migration * Fix post-migration * Py3.6 compat cc @stefan-it Thank you @stas00 Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-11-10 07:11:02 -05:00
Patrick von Platen	9c83b96e62	[Tests] Add Common Test for Training + Fix a couple of bugs (#8415 ) * add training tests * correct longformer * fix docs * fix some tests * fix some more train tests * remove ipdb * fix multiple edge case model training * fix funnel and prophetnet * clean gpt models * undo renaming of albert	2020-11-09 18:24:41 +01:00
Sylvain Gugger	908a28894c	Add new token classification example (#8340 ) * Add new token classification example * Remove txt file * Add test * With actual testing done * Less warmup is better * Update examples/token-classification/run_ner_new.py Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Fix test * Make Lysandre happy * Last touches and rename * Rename in tests * Address review comments * More run_ner -> run_ner_old Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-11-09 11:39:55 -05:00
dartrevan	507dfb40c3	Update README.md (#8406 )	2020-11-09 16:44:43 +08:00
smanjil	7247d0b4ea	updating tag for exbert viz (#8408 )	2020-11-09 16:43:55 +08:00
Chengxi Guo	0b02489b2c	Add gpt2-medium-chinese model card (#8402 ) * Create README.md * Update model_cards/mymusise/gpt2-medium-chinese/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-08 05:00:19 -05:00
Stas Bekman	187554366f	fix md table (#8395 )	2020-11-08 04:25:14 -05:00
hassoudi	30f2507a07	Update README.md (#8360 ) Fix websitr address	2020-11-06 11:45:46 -05:00
hassoudi	82146496b6	Update README.md (#8338 ) fixes	2020-11-06 06:20:58 -05:00
ktrapeznikov	9e5c4d39ab	Create README.md (#8312 ) * Create README.md * Update model_cards/ktrapeznikov/gpt2-medium-topic-news/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 06:19:59 -05:00
hasantanvir79	06ebc37967	Create README.md (#8255 ) * Create README.md Initial commit * Updated Read me Updated * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:34:24 -05:00
Karthik Uppuluri	41cd031cf2	Create README.md (#8169 )	2020-11-06 03:26:07 -05:00
Karthik Uppuluri	f932ddeff5	Create README.md (#8170 )	2020-11-06 03:25:52 -05:00
Karthik Uppuluri	08b92f78fa	Create README.md (#8168 ) * Create README.md * Update README.md	2020-11-06 03:25:33 -05:00
Karthik Uppuluri	77d62e78b0	Create README.md (#8167 ) * Create README.md Telugu BERTU Readme file * Update model_cards/kuppuluri/telugu_bertu/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:24:31 -05:00
Yifan Peng	dd6bfcaefb	Create README.md (#8327 )	2020-11-06 03:22:52 -05:00
smanjil	ddeecf08e6	german medbert model details (#8266 ) * model details * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:21:13 -05:00
Jiaxin Pei	96baaafd34	Create README.md (#8258 )	2020-11-06 03:19:12 -05:00
Stefan Schweter	185259c261	[model_cards] Update Italian BERT models and introduce new Italian XXL ELECTRA model 🎉 (#8343 )	2020-11-06 03:17:03 -05:00
Manuel Romero	34bbf60bf8	Model card: GPT-2 fine-tuned on CommonGen (#8248 )	2020-11-06 03:15:11 -05:00
Manuel Romero	973218fd3b	Model card: CodeBERT fine-tuned for Insecure Code Detection (#8247 ) * Model card: CodeBERT fine-tuned for Insecure Code Detection * Update model_cards/mrm8488/codebert-base-finetuned-detect-insecure-code/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-06 03:13:45 -05:00
Manuel Romero	f833ca418b	Model card: T5-base fine-tuned on QuaRel (#8334 )	2020-11-06 03:09:55 -05:00
Yifan Peng	638c0b7c50	Create README.md (#8223 ) * Create README.md * Update README.md * Apply suggestions from code review Co-authored-by: Kevin Canwen Xu <canwenxu@126.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-05 03:03:19 -05:00
Victor SANH	969ccac2e9	adding model cards for distilled models (#8300 ) * adding model cards for distil models * forgot the languages	2020-11-04 11:41:45 -05:00
Branden Chan	38630e7a87	Update model cards of deepset/roberta-base-squad2 v1 and v2 (#8241 ) * update deepset/roberta-base-squad2 to v2 * Update model_cards/deepset/roberta-base-squad2/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-04 11:21:25 -05:00
Manuel Romero	04561ecbe6	Model card: T5-base fine-tuned on QASC (#8299 )	2020-11-04 11:20:15 -05:00
Patrick von Platen	5b178f3c87	Create README.md	2020-11-02 20:03:44 +01:00
Patrick von Platen	ebec410c71	Create README.md	2020-11-02 17:53:22 +01:00
Zhiqi Huang	00cc2d1df2	DynaBERT model cards update (#8192 ) * Update README.md * Update README.md	2020-11-02 13:19:38 +08:00
Kushal	aa79aa4e7d	Added 12 model cards for Indian Language Models (#8198 ) * Create README.md * added model cards	2020-11-02 13:17:43 +08:00
Santiago Castro	969859d5f6	Fix doc errors and typos across the board (#8139 ) * Fix doc errors and typos across the board * Fix a typo * Fix the CI * Fix more typos * Fix CI * More fixes * Fix CI * More fixes * More fixes	2020-10-29 10:33:33 -04:00
Ethan	4731a00c3e	Update widget examples. (#8149 ) Co-authored-by: yantan <yantan@effyic.com>	2020-10-29 08:49:16 -04:00
dartrevan	238876068c	Update README.md (#8090 )	2020-10-29 08:31:32 -04:00
Branden Chan	e566adc09c	Add model_cards (#7969 ) * add readme * add readmes * Add metadata	2020-10-29 08:29:54 -04:00
dartrevan	cc8941d881	Create README.md (#8089 )	2020-10-29 08:23:43 -04:00
dartrevan	234a6dc388	Create README.md (#8088 ) * Create README.md * metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-29 08:23:30 -04:00
gurkan08	5d76859531	Create README.md (#8075 ) * Create README.md * Update model_cards/gurkan08/bert-turkish-text-classification/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-29 08:22:33 -04:00
Ethan	b215090eed	Add two model_cards: ethanyt/guwenbert-base and ethanyt/guwenbert-large (#8041 )	2020-10-29 08:21:54 -04:00
Ashwani Tanwar	ba2ad3a98a	Model Card for Gujarati-XLM-R-Base (#8038 ) * Add model card for Gujarati-XLM-R-Base * Update README.md Add the model card for the Gujarati-XLM-R-Base. * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-29 08:21:11 -04:00
Manuel Romero	52cea7de75	Create README.md (#8017 )	2020-10-29 08:19:47 -04:00
Manuel Romero	ff82a2aa93	Create README.md (#8015 )	2020-10-29 08:19:35 -04:00
Zhiqi Huang	0a3b9733cb	Add model_cards for DynaBERT (#8012 ) * Update README.md * Add dynabert_overview.png * Update README.md * Create README.md * Add dynabert_overview.png * Update README.md * Update README.md * Delete dynabert_overview.png * Update README.md * Delete dynabert_overview.png * Update README.md	2020-10-29 08:19:17 -04:00
Patrick von Platen	afa21504b1	add tags (#8147 )	2020-10-29 12:45:55 +01:00
Joe Davison	556709ad92	rm multiclass option from model card	2020-10-27 17:11:43 -04:00
Julien Chaumond	55bc0c599a	[model_cards] Switch to a more explicit domain for the media bucket	2020-10-27 18:08:05 +01:00
Philip May	8bbb74f211	[Model Card] new cross lingual sentence model for German and English (#8026 ) * mc for new cross lingual sentence model * fat text * url spelling fix * more url spelling fixes * slight thanks change * small improvements in text * multilingual word xchange * change colab link * xval fold number * add model links * line break in model names * Update README.md * Update README.md * new examples link * new examples link * add evaluation dataset name * add more about multi lingual * typo fix * typo * typos * hyperparameter typos * hyperparameter typo * add metadata * add metadata * Update README.md * typo fix * Small improvement	2020-10-26 14:48:26 -04:00
Joe Davison	fbcddb8544	add mutliclass field to default zero shot example	2020-10-26 11:07:51 -04:00
Joe Davison	b0a907615a	minor model card description updates (#8051 )	2020-10-26 10:04:20 -04:00
Julien Chaumond	7087d9b1c0	[model_cards] bert-base-danish Fixup #8030	2020-10-26 09:38:21 +01:00
Julien Chaumond	efc4a21ffa	Fixup #8025 Close #8030	2020-10-26 09:32:07 +01:00
Sam Longenbach	5148f43309	[Model Card] DJSammy/bert-base-danish-uncased_BotXO,ai (#8025 ) * Create README.md * Update README.md	2020-10-25 15:20:46 +08:00
Yixin Nie	00602f7840	Create model card for pre-trained NLI models. (#7864 ) * Create README.md * Update model_cards/ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Add Meta information for dataset identifier. Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-24 03:16:07 -04:00
Sacha Arbonel	59b5953d89	Create model card for bert-italian-cased-finetuned-pos (#8003 ) * Create README.md * Update model_cards/sachaarbonel/bert-italian-cased-finetuned-pos/README.md * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-23 10:58:05 -04:00
Zhiqi Huang	43fdafef89	Create README.md (#7997 )	2020-10-23 10:53:37 -04:00
Blaise Cruz	627e813734	Added model cards for Tagalog ELECTRA models (#7996 ) Co-authored-by: Jan Christian Blaise Cruz <jcblaise@Blaises-MacBook-Pro.local>	2020-10-23 10:52:21 -04:00
Philip May	9865e1fe52	model card for German Sentence Embeddings V2 (#7952 ) * model card German Sentence Embeddings V2 - for German RoBERTa for Sentence Embeddings V2 - marked old as outdated * small correction * small improvement in description * small spelling fix * spelling fix * add evaluation results * spearman explanation * add number of trials	2020-10-23 10:45:54 -04:00
Joe Davison	64b24bb3c2	change zero shot widget default example (#7992 )	2020-10-22 15:19:41 -06:00
Joe Davison	077c99bb5f	add zero shot pipeline tags & examples (#7983 ) * add zero shot pipeline tags * rm default and fix yaml format * rm DS_Store * add bart large default * don't add more typos Co-authored-by: Julien Chaumond <chaumond@gmail.com> * add multiple multilingual examples * improve multilingual examples for single-label Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-22 13:01:23 -06:00
Julien Chaumond	3479787edc	Disable inference API for t5-11b (#7978 )	2020-10-22 09:08:37 -04:00
Julien Chaumond	a7db81c33f	[model_card] t5-11b move disclaimer to top of page cc @Narsil @patrickvonplaten	2020-10-22 14:35:31 +02:00
Julien Chaumond	f8d3695e8c	[model_cards] camembert: dataset = oscar Hat/tip @pjox	2020-10-21 14:17:56 -04:00
Ali Hamdi Ali Fadel	bf162ce8ca	Add AI-SOCO models (#7867 )	2020-10-21 09:24:43 -04:00
Fangyu Liu	58fb25f25b	Create README.md (#7857 ) * Create README.md model card for cambridgeltl/BioRedditBERT-uncased. * Update model_cards/cambridgeltl/BioRedditBERT-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-21 08:41:41 -04:00
Manuel Romero	2b07ec7823	Model card for German BERT fine-tuned for LER/NER (#7855 )	2020-10-21 08:31:41 -04:00
MichalPleban	35d2ad5b83	Create README.md (#7819 )	2020-10-21 08:30:01 -04:00
Wuwei Lan	bdda4f2249	Create README.md (#7625 ) * Create README.md * Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md * Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-21 08:29:39 -04:00
Manuel Romero	8e23749649	Add missing comma (#7870 )	2020-10-21 08:24:12 -04:00
Manuel Romero	3eaa007d78	Create README.md (#7899 )	2020-10-21 08:23:55 -04:00
Julien Chaumond	758572cad8	[model_cards] move hatmimoha/arabic-ner to correct location see `16d3cc187d` and https://github.com/huggingface/transformers/pull/7836	2020-10-21 14:13:17 +02:00
quentinheinrich	006a16483f	update model cards of Illuin models (#7930 )	2020-10-21 08:05:53 -04:00
Patrick von Platen	5cd9e2cba1	Update README.md	2020-10-21 12:43:42 +02:00
Patrick von Platen	220b5f97ca	Create README.md	2020-10-21 12:34:46 +02:00
Patrick von Platen	8ffd7fb12d	Update README.md	2020-10-21 12:27:09 +02:00
Patrick von Platen	613ab364eb	Update README.md	2020-10-21 12:23:17 +02:00
Patrick von Platen	f7eb17dc47	Update README.md	2020-10-21 12:19:44 +02:00
Patrick von Platen	0264048660	Update README.md	2020-10-20 16:13:49 +02:00
Patrick von Platen	f3312515b7	Add note for WikiSplit	2020-10-20 15:42:29 +02:00
Patrick von Platen	0724c0f3a2	Fix EncoderDecoder WikiSplit Example	2020-10-20 15:13:22 +02:00
Weizhen	2422cda01b	ProphetNet (#7157 ) * add new model prophetnet prophetnet modified modify codes as suggested v1 add prophetnet test files * still bugs, because of changed output formats of encoder and decoder * move prophetnet into the latest version * clean integration tests * clean tokenizers * add xlm config to init * correct typo in init * further refactoring * continue refactor * save parallel * add decoder_attention_mask * fix use_cache vs. past_key_values * fix common tests * change decoder output logits * fix xlm tests * make common tests pass * change model architecture * add tokenizer tests * finalize model structure * no weight mapping * correct n-gram stream attention mask as discussed with qweizhen * remove unused import * fix index.rst * fix tests * delete unnecessary code * add fast integration test * rename weights * final weight remapping * save intermediate * Descriptions for Prophetnet Config File * finish all models * finish new model outputs * delete unnecessary files * refactor encoder layer * add dummy docs * code quality * fix tests * add model pages to doctree * further refactor * more refactor, more tests * finish code refactor and tests * remove unnecessary files * further clean up * add docstring template * finish tokenizer doc * finish prophetnet * fix copies * fix typos * fix tf tests * fix fp16 * fix tf test 2nd try * fix code quality * add test for each model * merge new tests to branch * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update src/transformers/modeling_prophetnet.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update utils/check_repo.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * apply sams and sylvains comments * make style * remove unnecessary code * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/configuration_prophetnet.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * implement lysandres comments * correct docs * fix isort * fix tokenizers * fix copies Co-authored-by: weizhen <weizhen@mail.ustc.edu.cn> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-10-19 17:36:09 +02:00
Jordi Mas	ea1507fb45	Julibert model card (#7868 ) * Julibert model card * Fix text	2020-10-19 06:50:52 -04:00
Patrick von Platen	dc552b9b70	Fix typo in sequence model card	2020-10-16 16:05:06 +02:00
rmroczkowski	7b13bd01df	Herbert polish model (#7798 ) * HerBERT transformer model for Polish language understanding. * HerbertTokenizerFast generated with HerbertConverter * Herbert base and large model cards * Herbert model cards with tags * Herbert tensorflow models * Herbert model tests based on Bert test suit * src/transformers/tokenization_herbert.py edited online with Bitbucket * src/transformers/tokenization_herbert.py edited online with Bitbucket * docs/source/model_doc/herbert.rst edited online with Bitbucket * Herbert tokenizer tests and bug fixes * src/transformers/configuration_herbert.py edited online with Bitbucket * Copyrights and tests for TFHerbertModel * model_cards/allegro/herbert-base-cased/README.md edited online with Bitbucket * model_cards/allegro/herbert-large-cased/README.md edited online with Bitbucket * Bug fixes after testing * Reformat modified_only_fixup * Proper order of configuration * Herbert proper documentation formatting * Formatting with make modified_only_fixup * Dummies fixed * Adding missing models to documentation * Removing HerBERT model as it is a simple extension of BERT * Update model_cards/allegro/herbert-base-cased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Update model_cards/allegro/herbert-large-cased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> * HerbertTokenizer deprecated configuration removed Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-16 03:06:51 -04:00
David S. Lim	9c71cca316	model card for bert-base-NER (#7799 ) * model card for bert-base-NER * add meta data up top Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-15 21:55:00 +02:00
Julien Chaumond	e7aa64838c	[model_cards] facebook/bart-large-mnli: register ZSC for the inference API cc @Narsil @mfuntowicz @joeddav	2020-10-15 19:02:10 +02:00
Julien Chaumond	6f45dd2fac	[model_cards] Fix yaml for Facebook/wmt19-* see `d99ed7ad61`	2020-10-15 16:14:08 +02:00
Julien Chaumond	d99ed7ad61	[model_cards] Facebook: add thumbnail	2020-10-15 12:53:29 +02:00
Nils Reimers	3032de9369	Model Card (#7752 ) * Create README.md * Update model_cards/sentence-transformers/LaBSE/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-14 13:30:58 -04:00
sarahlintang	3fdbeba83c	[model_cards] sarahlintang/IndoBERT (#7748 ) * Create README.md * Update model_cards/sarahlintang/IndoBERT/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-14 13:10:31 -04:00
Julien Chaumond	ba654270b3	[model_cards] rename to correct model name	2020-10-14 19:02:48 +02:00
Zhuosheng Zhang	08978487e7	Create README.md (#7722 )	2020-10-14 12:56:12 -04:00

1 2 3 4 5 ...

772 Commits