Commit Graph

1074 Commits

Author SHA1 Message Date
Joshua Lochner d0915294ae [version] Update to 2.7.0 2023-10-23 16:39:27 +02:00
Joshua Lochner 68067de199 Add listed support for speecht5 2023-10-23 16:38:25 +02:00
Joshua Lochner 4a991bd911
Add support for `text-to-speech` (w/ Speecht5) (#345)
* Add vocoder to export

* Add tokenizer.json export for speecht5 models

* Update speecht5 supported models

* Create `SpeechT5Tokenizer`

* Add `ones` and `ones_like` tensor functions

* Add support for speecht5 text-to-speech

* Disambiguate `SpeechSeq2Seq` and `Seq2SeqLM`

* Create `TextToAudioPipeline`

* Add listed support for `text-to-audio` / `text-to-speech`

* Use unquantized vocoder by default

* Skip speecht5 unit tests for now

Due to bug in transformers: https://github.com/huggingface/transformers/issues/26547

* Update example pipeline output

* Create simple in-browser TTS demo

* Add template README

* Delete package-lock.json

* Update required transformers.js version

* Add link to Transformers.js

* Double -> Single quotes

* Add link to text-to-speech demo

* Update sample speaker embeddings
2023-10-23 16:31:46 +02:00
Joshua Lochner 983cf3a246 Update example app dependencies 2023-10-22 10:36:23 +02:00
Joshua Lochner 63b4369ccc
Update example app dependencies (#347)
* Update dependency versions

* Update dev dependency versions
2023-10-22 10:26:00 +02:00
Joshua Lochner 40df3cc003 Update link to hosted pretrained models 2023-10-04 17:59:50 +02:00
Joshua Lochner a45b8441d3
Improve electron example template (#342)
* Update transformers.js version

* Use Singleton object in electron tutorial

* Create package-lock.json

* Remove models folder

* Remove step for copying models to local folder
2023-10-04 17:45:25 +02:00
Joshua Lochner e7b71b8f8f
Add more links to example section (#343) 2023-10-04 17:44:46 +02:00
Joshua Lochner 0efa60ffff
Do not post-process `<` and `>` symbols generated from docs (#335)
* Do not post-process `<` and `>` symbols generated from docs

* Fix typo
2023-10-04 15:43:35 +02:00
Joshua Lochner b6bd608322
Update falcon tokenizer (#344)
* Update generate_tests.py

* Do not add token types for `FalconTokenizer`
2023-10-03 12:35:07 +02:00
Joshua Lochner 5b31129218 [version] Update to 2.6.2 2023-09-27 15:15:09 +02:00
Hermann Rolfes 09cbb0c2e7
Fix more `_call` LSP errors + extra typings (#304)
* types are only inferred through this assignments in constructor

* Typing PreprocessedImage

* LSP + other typings for ImageFeatureExtractor / SamImageProcessor / DetrFeatureExtractor

* Fix SamProcessor error

* Fix PretrainedOptions

* Fix double AnyTypedArray

* Update `unused` variable name

* Mark `_update` image function as private

* Update processor JSDoc

---------

Co-authored-by: Joshua Lochner <admin@xenova.com>
2023-09-27 15:12:59 +02:00
Joshua Lochner df94965ea2
Use `doc-builder` main (#331) 2023-09-27 00:24:54 +02:00
Joshua Lochner f83d079621
Revert doc builder (#330)
* Update pr-documentation.yml

* Update documentation.yml
2023-09-26 21:31:00 +02:00
Joshua Lochner 4d8a590fb3
Add support for `LongT5` models (#316)
* Create `LongT5` classes

* Update model mappings

* Add `LongT5` to supported list of models

* Update list of supported models
2023-09-26 18:25:12 +02:00
Joshua Lochner d307f270dd
Add support for `DonutSwin` models (#320)
* Add `add_special_tokens` option to tokenizers

* Improve error messages for loading processors

* Add `DonutFeatureExtractor`

* Add `DonutSwinModel` and `MBartForCausalLM` models

* Fix `addPastKeyValues` for `VisionEncoderDecoder` models

* Add `Donut` to list of supported models

* Make encode parameters optional

* Support batched decoder input ids

* Remove unused import

* Add `do_thumbnail` for donut image processing

* Fix `TypeError: decoder_input_ids[i].map is not a function`

* Only pad if width and height specified in size

* Only pad if `pad_size` is defined

* Only cut `decoder_input_ids` if past model output

* Add donut model

* Add example usage to JSDoc for `DonutSwinModel`

* Add support for `DocumentQuestionAnsweringPipeline`

* Add simple document question answering unit test

* Add listed support for document QA pipeline
2023-09-26 17:46:26 +02:00
Joshua Lochner 222d33acf0 Use `transformers==4.33.2` in conversion script 2023-09-23 17:06:27 +02:00
Joshua Lochner 69aa5dff58 Freeze conversion script requirements
Will be upgraded when we switch to `onnxruntime>=1.16`
2023-09-23 16:56:52 +02:00
Joshua Lochner dac6f4b073
Improve example browser extension template (#325)
Removes unnecessary complexity
2023-09-23 03:34:10 +02:00
Joshua Lochner 982498c064
Create in-browser version of `semantic-image-search` example app (#326)
* Create in-browser version of `semantic-image-search`

* Add loading model and database message

* Create README.md

* Add link to example app
2023-09-23 02:54:28 +02:00
Joshua Lochner c367f9d68b
Add support for `Blenderbot` and `BlenderbotSmall` (#292)
* Add support for `Blenderbot` models

Closes #37
References #29

* Add support for `BlenderbotTokenizer`

* Add blenderbot to supported models

* Add support for `BlenderbotSmallTokenizer`

* Add custom tests for blenderbot-small

* Add support for `BlenderbotSmall` models

* Update list of supported models

* Improve `addPastKeyValues` function

* Allow skipping of adding encoder past key values
2023-09-19 13:34:00 +02:00
Joshua Lochner c453e6be32 Add `LLaMA`/`Llama2` to list of supported models 2023-09-19 01:36:36 +02:00
Joshua Lochner b3a2a5b00f [version] Update to 2.6.1 2023-09-18 14:56:06 +02:00
Joshua Lochner c8e55041cb Add vanilla JS example link to README 2023-09-18 14:53:34 +02:00
Joshua Lochner 6f1842a274
Fix issues with minification (#307)
* Cleanup JSDoc

* Store mapping between class and name

* Fix `PretrainedMixin`

* Check seq2seq and vision2seq mappings for possible generate-compatible classes
2023-09-18 01:59:37 +02:00
Joshua Lochner 11f6a08090
Add support for `min_length` and `min_new_tokens` generation parameters (#308)
* Add support for `MinNewTokensLengthLogitsProcessor`

* Add support for `MinLengthLogitsProcessor`

* Fix `generation_config` defaults

* Fix `input_ids_seq_length`

* Add unit tests for generation

* Fix generation parameters test case

* Allow specification of multiple `eos_token_ids`
2023-09-17 23:57:13 +02:00
Per Harald Borgen ef27100553
Add JavaScript tutorial to the docs (#271) 2023-09-17 21:47:09 +02:00
Joshua Lochner 8253dab531
Update node and NPM versions (#294)
node -> 18+
npm -> 9+
2023-09-12 19:17:05 +02:00
Doni Rubiagatra 434dcccac7
[docs] Update minimum node version (16 -> 18) 2023-09-11 13:01:05 +02:00
Joshua Lochner 686add8466 Update supported_models.py 2023-09-10 01:37:59 +02:00
Joshua Lochner d7438ce26d Update conversion script
Use latest version of Optimum (source)
2023-09-10 00:44:00 +02:00
Joshua Lochner 5216fb461d
Fix `ByteLevel` pretokenizer
* Re-enable other whisper tests

* Fix `ByteLevel` pretokenizer

Only add prefix space to first word, when option is enabled.
2023-09-10 00:37:04 +02:00
Joshua Lochner ad7e8758bc [version] Update to 2.6.0 2023-09-08 15:41:59 +02:00
Joshua Lochner 9a3339239e
New models and refactoring (#276)
* Add `CodeLlamaTokenizer`

* Add `codellama` for testing

* Update default quantization settings

* Refactor `PretrainedModel`

* Remove unnecessary error message

* Update llama-code-tokenizer test

* Add support for `GPTNeoX` models

* Fix `GPTNeoXPreTrainedModel` config

* Add support for `GPTJ` models

* Add support for `WavLM` models

* Update list of supported models

- CodeLlama
- GPT NeoX
- GPT-J
- WavLM

* Add support for XLM models

* Add support for `ResNet` models

* Add support for `BeiT` models

* Fix casing of `BeitModel`

* Remove duplicate code

* Update variable name

* Remove `ts-ignore`

* Remove unnecessary duplication

* Update demo model sizes

* [demo] Update default summarization parameters

* Update default quantization parameters for new models

* Remove duplication in mapping

* Update list of supported marian models

* Add support for `CamemBERT` models

* Add support for `MBart` models

* Add support for `OPT` models

* Add `MBartTokenizer` and `MBart50Tokenizer`

* Add example of multilingual translation with MBart models

* Add `CamembertTokenizer`

* Add support for `HerBERT` models

* Add support for `XLMTokenizer`

* Fix `fuse_unk` config

* Do not remove duplicate keys for `Unigram` models

See https://huggingface.co/camembert-base for an example of a Unigram tokenizer that has two tokens with the same value (`<unk>`)

* Update HerBERT supported model text

* Update generate_tests.py

* Update list of supported models

* Use enum object instead of classes for model types

Fixes https://github.com/xenova/transformers.js/issues/283

* Add link to issue

* Update dependencies for unit tests

* Add `sentencepiece` as a testing requirement

* Add `protobuf` to test dependency

* Remove duplicated models to test
2023-09-08 15:17:05 +02:00
Joshua Lochner 109a7f9711 Fix unit test 2023-09-04 23:53:05 +02:00
Joshua Lochner dbea8a2990 Update to `checkout@v4`
See https://github.com/actions/checkout/issues/1448 for more info.
2023-09-04 23:20:57 +02:00
Hermann Rolfes 1488079f81
Make // @ts-ignore obsolete for _call overrides by respecting LSP (#278)
* Make // @ts-ignore obsolete for _call overrides by respecting LSP

* oops can't be undefined, back to how it was

* Use `...unused` instead to fix LSP errors
2023-09-04 23:06:44 +02:00
Joshua Lochner 57f2b5cd17
Add support for MPT models (Fixes #166) (#272)
* Add support for MPT models

* Fix `use_cache_branch`

* Update list of supported models
2023-09-02 22:17:01 +02:00
Joshua Lochner 96b9143b33 Update masked-lm tests 2023-09-02 03:47:06 +02:00
Joshua Lochner 9077c21540
Add support for BLOOM models (#273)
* Add support for Bloom models

* Update `BloomTokenizer` to fix the default (invalid) regex

* Update supported models

* Update default quantization settings for bloom models

* Fix `use_cache_branch`
2023-09-01 22:07:04 +02:00
Joshua Lochner 62159eb383 Fix `CustomWhisperOnnxConfig` 2023-09-01 16:14:49 +02:00
Joshua Lochner 0c2dcc7498 [version] Update to 2.5.4 2023-08-28 20:07:06 +02:00
Joshua Lochner 09cf91abd0
Add `DeiT`, `Swin`, and `Yolos` vision models (#262)
* Add support `DeiT` models

* Add `Swin` models for image classification

* Add support for `yolos` models

* Add `YolosFeatureExtractor`

* Remove unused import

* Update list of supported models

* Remove SAM for now

Move SAM support to next release
2023-08-28 17:29:15 +02:00
Joshua Lochner f0573175fd Add `DeiTFeatureExtractor` 2023-08-26 23:54:27 +02:00
Per Harald Borgen 76b8556110
Rename how-to guides to developer guides (#261) 2023-08-25 17:56:18 +02:00
Joshua Lochner 7076c8e401 [version] Update to 2.5.3 2023-08-22 23:31:00 +02:00
josephrocca 9bb6923242
[docs] Add links and compatible models to supported tasks table (#257) 2023-08-22 23:19:48 +02:00
Joshua Lochner 3fab8265cb
Update whisper unit test (#258) 2023-08-22 22:18:17 +02:00
Joshua Lochner 9c449c151c
Fix caching for LFS files from the Hugging Face Hub (#251)
* Fix model caching for LFS files from the HF Hub

* Ignore local model check on demo site
2023-08-22 18:28:37 +02:00
Joshua Lochner f61cc66e0e Fix link to API reference 2023-08-22 17:19:49 +02:00