transformers.js

Commit Graph

Author	SHA1	Message	Date
Joshua Lochner	983cf3a246	Update example app dependencies	2023-10-22 10:36:23 +02:00
Joshua Lochner	63b4369ccc	Update example app dependencies (#347 ) * Update dependency versions * Update dev dependency versions	2023-10-22 10:26:00 +02:00
Joshua Lochner	40df3cc003	Update link to hosted pretrained models	2023-10-04 17:59:50 +02:00
Joshua Lochner	a45b8441d3	Improve electron example template (#342 ) * Update transformers.js version * Use Singleton object in electron tutorial * Create package-lock.json * Remove models folder * Remove step for copying models to local folder	2023-10-04 17:45:25 +02:00
Joshua Lochner	e7b71b8f8f	Add more links to example section (#343 )	2023-10-04 17:44:46 +02:00
Joshua Lochner	0efa60ffff	Do not post-process `<` and `>` symbols generated from docs (#335 ) * Do not post-process `<` and `>` symbols generated from docs * Fix typo	2023-10-04 15:43:35 +02:00
Joshua Lochner	b6bd608322	Update falcon tokenizer (#344 ) * Update generate_tests.py * Do not add token types for `FalconTokenizer`	2023-10-03 12:35:07 +02:00
Joshua Lochner	5b31129218	[version] Update to 2.6.2	2023-09-27 15:15:09 +02:00
Hermann Rolfes	09cbb0c2e7	Fix more `_call` LSP errors + extra typings (#304 ) * types are only inferred through this assignments in constructor * Typing PreprocessedImage * LSP + other typings for ImageFeatureExtractor / SamImageProcessor / DetrFeatureExtractor * Fix SamProcessor error * Fix PretrainedOptions * Fix double AnyTypedArray * Update `unused` variable name * Mark `_update` image function as private * Update processor JSDoc --------- Co-authored-by: Joshua Lochner <admin@xenova.com>	2023-09-27 15:12:59 +02:00
Joshua Lochner	df94965ea2	Use `doc-builder` main (#331 )	2023-09-27 00:24:54 +02:00
Joshua Lochner	f83d079621	Revert doc builder (#330 ) * Update pr-documentation.yml * Update documentation.yml	2023-09-26 21:31:00 +02:00
Joshua Lochner	4d8a590fb3	Add support for `LongT5` models (#316 ) * Create `LongT5` classes * Update model mappings * Add `LongT5` to supported list of models * Update list of supported models	2023-09-26 18:25:12 +02:00
Joshua Lochner	d307f270dd	Add support for `DonutSwin` models (#320 ) * Add `add_special_tokens` option to tokenizers * Improve error messages for loading processors * Add `DonutFeatureExtractor` * Add `DonutSwinModel` and `MBartForCausalLM` models * Fix `addPastKeyValues` for `VisionEncoderDecoder` models * Add `Donut` to list of supported models * Make encode parameters optional * Support batched decoder input ids * Remove unused import * Add `do_thumbnail` for donut image processing * Fix `TypeError: decoder_input_ids[i].map is not a function` * Only pad if width and height specified in size * Only pad if `pad_size` is defined * Only cut `decoder_input_ids` if past model output * Add donut model * Add example usage to JSDoc for `DonutSwinModel` * Add support for `DocumentQuestionAnsweringPipeline` * Add simple document question answering unit test * Add listed support for document QA pipeline	2023-09-26 17:46:26 +02:00
Joshua Lochner	222d33acf0	Use `transformers==4.33.2` in conversion script	2023-09-23 17:06:27 +02:00
Joshua Lochner	69aa5dff58	Freeze conversion script requirements Will be upgraded when we switch to `onnxruntime>=1.16`	2023-09-23 16:56:52 +02:00
Joshua Lochner	dac6f4b073	Improve example browser extension template (#325 ) Removes unnecessary complexity	2023-09-23 03:34:10 +02:00
Joshua Lochner	982498c064	Create in-browser version of `semantic-image-search` example app (#326 ) * Create in-browser version of `semantic-image-search` * Add loading model and database message * Create README.md * Add link to example app	2023-09-23 02:54:28 +02:00
Joshua Lochner	c367f9d68b	Add support for `Blenderbot` and `BlenderbotSmall` (#292 ) * Add support for `Blenderbot` models Closes #37 References #29 * Add support for `BlenderbotTokenizer` * Add blenderbot to supported models * Add support for `BlenderbotSmallTokenizer` * Add custom tests for blenderbot-small * Add support for `BlenderbotSmall` models * Update list of supported models * Improve `addPastKeyValues` function * Allow skipping of adding encoder past key values	2023-09-19 13:34:00 +02:00
Joshua Lochner	c453e6be32	Add `LLaMA`/`Llama2` to list of supported models	2023-09-19 01:36:36 +02:00
Joshua Lochner	b3a2a5b00f	[version] Update to 2.6.1	2023-09-18 14:56:06 +02:00
Joshua Lochner	c8e55041cb	Add vanilla JS example link to README	2023-09-18 14:53:34 +02:00
Joshua Lochner	6f1842a274	Fix issues with minification (#307 ) * Cleanup JSDoc * Store mapping between class and name * Fix `PretrainedMixin` * Check seq2seq and vision2seq mappings for possible generate-compatible classes	2023-09-18 01:59:37 +02:00
Joshua Lochner	11f6a08090	Add support for `min_length` and `min_new_tokens` generation parameters (#308 ) * Add support for `MinNewTokensLengthLogitsProcessor` * Add support for `MinLengthLogitsProcessor` * Fix `generation_config` defaults * Fix `input_ids_seq_length` * Add unit tests for generation * Fix generation parameters test case * Allow specification of multiple `eos_token_ids`	2023-09-17 23:57:13 +02:00
Per Harald Borgen	ef27100553	Add JavaScript tutorial to the docs (#271 )	2023-09-17 21:47:09 +02:00
Joshua Lochner	8253dab531	Update node and NPM versions (#294 ) node -> 18+ npm -> 9+	2023-09-12 19:17:05 +02:00
Doni Rubiagatra	434dcccac7	[docs] Update minimum node version (16 -> 18)	2023-09-11 13:01:05 +02:00
Joshua Lochner	686add8466	Update supported_models.py	2023-09-10 01:37:59 +02:00
Joshua Lochner	d7438ce26d	Update conversion script Use latest version of Optimum (source)	2023-09-10 00:44:00 +02:00
Joshua Lochner	5216fb461d	Fix `ByteLevel` pretokenizer * Re-enable other whisper tests * Fix `ByteLevel` pretokenizer Only add prefix space to first word, when option is enabled.	2023-09-10 00:37:04 +02:00
Joshua Lochner	ad7e8758bc	[version] Update to 2.6.0	2023-09-08 15:41:59 +02:00
Joshua Lochner	9a3339239e	New models and refactoring (#276 ) * Add `CodeLlamaTokenizer` * Add `codellama` for testing * Update default quantization settings * Refactor `PretrainedModel` * Remove unnecessary error message * Update llama-code-tokenizer test * Add support for `GPTNeoX` models * Fix `GPTNeoXPreTrainedModel` config * Add support for `GPTJ` models * Add support for `WavLM` models * Update list of supported models - CodeLlama - GPT NeoX - GPT-J - WavLM * Add support for XLM models * Add support for `ResNet` models * Add support for `BeiT` models * Fix casing of `BeitModel` * Remove duplicate code * Update variable name * Remove `ts-ignore` * Remove unnecessary duplication * Update demo model sizes * [demo] Update default summarization parameters * Update default quantization parameters for new models * Remove duplication in mapping * Update list of supported marian models * Add support for `CamemBERT` models * Add support for `MBart` models * Add support for `OPT` models * Add `MBartTokenizer` and `MBart50Tokenizer` * Add example of multilingual translation with MBart models * Add `CamembertTokenizer` * Add support for `HerBERT` models * Add support for `XLMTokenizer` * Fix `fuse_unk` config * Do not remove duplicate keys for `Unigram` models See https://huggingface.co/camembert-base for an example of a Unigram tokenizer that has two tokens with the same value (`<unk>`) * Update HerBERT supported model text * Update generate_tests.py * Update list of supported models * Use enum object instead of classes for model types Fixes https://github.com/xenova/transformers.js/issues/283 * Add link to issue * Update dependencies for unit tests * Add `sentencepiece` as a testing requirement * Add `protobuf` to test dependency * Remove duplicated models to test	2023-09-08 15:17:05 +02:00
Joshua Lochner	109a7f9711	Fix unit test	2023-09-04 23:53:05 +02:00
Joshua Lochner	dbea8a2990	Update to `checkout@v4` See https://github.com/actions/checkout/issues/1448 for more info.	2023-09-04 23:20:57 +02:00
Hermann Rolfes	1488079f81	Make // @ts-ignore obsolete for _call overrides by respecting LSP (#278 ) * Make // @ts-ignore obsolete for _call overrides by respecting LSP * oops can't be undefined, back to how it was * Use `...unused` instead to fix LSP errors	2023-09-04 23:06:44 +02:00
Joshua Lochner	57f2b5cd17	Add support for MPT models (Fixes #166 ) (#272 ) * Add support for MPT models * Fix `use_cache_branch` * Update list of supported models	2023-09-02 22:17:01 +02:00
Joshua Lochner	96b9143b33	Update masked-lm tests	2023-09-02 03:47:06 +02:00
Joshua Lochner	9077c21540	Add support for BLOOM models (#273 ) * Add support for Bloom models * Update `BloomTokenizer` to fix the default (invalid) regex * Update supported models * Update default quantization settings for bloom models * Fix `use_cache_branch`	2023-09-01 22:07:04 +02:00
Joshua Lochner	62159eb383	Fix `CustomWhisperOnnxConfig`	2023-09-01 16:14:49 +02:00
Joshua Lochner	0c2dcc7498	[version] Update to 2.5.4	2023-08-28 20:07:06 +02:00
Joshua Lochner	09cf91abd0	Add `DeiT`, `Swin`, and `Yolos` vision models (#262 ) * Add support `DeiT` models * Add `Swin` models for image classification * Add support for `yolos` models * Add `YolosFeatureExtractor` * Remove unused import * Update list of supported models * Remove SAM for now Move SAM support to next release	2023-08-28 17:29:15 +02:00
Joshua Lochner	f0573175fd	Add `DeiTFeatureExtractor`	2023-08-26 23:54:27 +02:00
Per Harald Borgen	76b8556110	Rename how-to guides to developer guides (#261 )	2023-08-25 17:56:18 +02:00
Joshua Lochner	7076c8e401	[version] Update to 2.5.3	2023-08-22 23:31:00 +02:00
josephrocca	9bb6923242	[docs] Add links and compatible models to supported tasks table (#257 )	2023-08-22 23:19:48 +02:00
Joshua Lochner	3fab8265cb	Update whisper unit test (#258 )	2023-08-22 22:18:17 +02:00
Joshua Lochner	9c449c151c	Fix caching for LFS files from the Hugging Face Hub (#251 ) * Fix model caching for LFS files from the HF Hub * Ignore local model check on demo site	2023-08-22 18:28:37 +02:00
Joshua Lochner	f61cc66e0e	Fix link to API reference	2023-08-22 17:19:49 +02:00
Joshua Lochner	c3af596443	Fix word-level timestamps for non-English languages w/ Whisper (#253 ) * Fix language detection * Remove debug statement * Fix punctuation regex for whisper decoding (Closes #223) * Fix word-level timestamps for audio < 30 seconds Issue in python library: https://github.com/huggingface/transformers/issues/25605 PR for above: https://github.com/huggingface/transformers/pull/25607 * Add multilingual transcription w/ word-level timestamps unit test * Fix unit tests	2023-08-22 15:50:30 +02:00
Joshua Lochner	276bdd06b8	Improve pipeline docs (w/ example code) - closes #134 (#255 ) * Add example code for zero shot image classification * Add example code for text classification pipeline * Fix links to custom usage from pipelines docs Reported on discord https://discord.com/channels/879548962464493619/1142943169068154950/1142943169068154950 * Fix relative links * Rename .mdx -> .md GitHub recently changed how mdx files are displayed, breaking a lot of the formatting. So, we just use .md now (same as transformers) * Add example code for token classification pipeline * Add example code for fill-mask pipeline * Add text2text and summarization pipeline examples * Add example code for image segmentation pipeline * Remove redundant `@extends Pipeline` * Add example code for image-to-text pipeline * Cleanup example code outputs * Cleanup JSDoc * Cleanup pipeline example code * Update codegen example	2023-08-22 04:30:56 +02:00
Joshua Lochner	254e99ef9a	[version] Update to 2.5.2	2023-08-14 22:55:54 +02:00

1 2 3 4 5 ...

1021 Commits All Branches Search

1021 Commits

All Branches