Tim Miller
ab633089e9
Merge branch 'master' into embeddings-semantickernel
2023-09-05 13:53:24 +09:00
Yaohui Liu
18294a725e
refactor: remove old version files.
2023-09-02 22:24:07 +08:00
Tim Miller
94a395240a
Bump example, readme
2023-09-02 14:21:02 +09:00
Tim Miller
bea1ca9f8d
Update
2023-09-02 13:54:38 +09:00
Tim Miller
35266d8112
Remove MemorySkill sample
2023-09-02 13:41:59 +09:00
Tim Miller
521f068d64
Add Embedding for Semantic Kernel
2023-09-01 22:42:13 +09:00
Rinne
1533ee7dbf
Merge pull request #138 from drasticactions/semantic-kernel
...
Enable Semantic kernel support
2023-09-01 20:50:46 +08:00
Tim Miller
d4a57fffef
README, Cleanup
2023-09-01 10:03:34 +09:00
Tim Miller
2bde188c64
Change Namespace
2023-08-31 22:22:38 +09:00
Tim Miller
a81edacbfb
Remove embedding for now
2023-08-31 18:26:03 +09:00
Tim Miller
98bfbe1d50
Update
2023-08-31 18:24:07 +09:00
Tim Miller
9a1d6f99f2
Add Semantic Kernel support
2023-08-31 17:24:44 +09:00
Martin Evans
a70c7170dd
- Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system.
...
- Made all the mechanics of grammar parsing (GBNFGrammarParser, ParseState) internal. Just call `Grammar.Parse("whatever")`.
- Added a `GrammarRule` class which validates elements on construction (this allows constructing grammar without parsing GBNF).
- It should be impossible for a `GrammarRule` to represent an invalid rule.
2023-08-31 00:02:50 +01:00
Mihai
71f02e08c2
Add grammar example
2023-08-30 17:24:39 +03:00
Martin Evans
5a6c6de0dc
Merge pull request #115 from martindevans/model_params_record
...
ModelsParams record class
2023-08-24 22:54:23 +01:00
Martin Evans
829f32b27d
- Added `Obsolete` attributes to the entire `OldVersion` namespace, so they can be removed in the future
...
- Minor changes to cleanup some of the compiler warnings
2023-08-24 00:59:32 +01:00
Martin Evans
29df14cd9c
Converted ModelParams into a `record` class. This has several advantages:
...
- Equality, hashing etc all implemented automatically
- Default values are defined in just one place (the properties) instead of the constructor as well
- Added test to ensure that serialization works properly
2023-08-23 00:58:25 +01:00
Martin Evans
2830e5755c
- Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed.
...
- Deleted `NativeInfo` (internal class, not used anywhere)
2023-08-22 23:20:13 +01:00
Martin Evans
a45d9089e1
Fixed demos
2023-08-22 01:32:53 +01:00
Martin Evans
a9e6f21ab8
- Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring!
...
- Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around
2023-08-22 01:30:13 +01:00
Martin Evans
ae8ef17a4a
- Added various convenience overloads to `LLamaContext.Eval`
...
- Converted `SafeLLamaContextHandle` to take a `ReadOnlySpan` for Eval, narrower type better represents what's really needed
2023-08-22 01:28:28 +01:00
Martin Evans
02a46fc363
Updated demos to use the new loading/multi context system
2023-08-17 23:26:20 +01:00
Martin Evans
e2fe08a9a2
Added a higher level `LLamaWeights` wrapper around `SafeLlamaModelHandle`
2023-08-13 01:10:08 +01:00
Martin Evans
f31bdf6b93
Using the right context for Bob
2023-08-13 01:10:08 +01:00
Martin Evans
f3511e390f
WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving.
...
The biggest single change is renaming `LLamaModel` to `LLamaContext`
2023-08-13 01:10:08 +01:00
Rinne
0269af8c17
Merge branch 'master' into runtime-targets
2023-06-21 16:02:19 +08:00
Yaohui Liu
1062fe1a7e
feat: upgrade the native libraries.
2023-06-21 15:21:27 +08:00
Yaohui Liu
eed96248b5
docs: add the documentations with mkdocs.
2023-06-20 01:53:46 +08:00
Yaohui Liu
2eb2d6df83
test: add 9 examples of the new version.
2023-06-19 22:09:58 +08:00
Tim Miller
bbd2650cf0
Include runtime targets file
2023-06-19 18:44:51 +09:00
Yaohui Liu
f3565d6b2d
refactor: rename Quantizer to LLamaQuantizer.
2023-06-19 02:54:55 +08:00
Yaohui Liu
a3b8186f20
feat: support save and load chat session.
2023-06-12 18:31:37 +08:00
Yaohui Liu
bdbd6aa824
feat: add transforms for chat session.
2023-06-12 18:07:41 +08:00
Yaohui Liu
3bf74ec9b9
feat: add chat session for refactored code.
2023-06-12 02:47:25 +08:00
Yaohui Liu
908b79e855
feat: add stateless executor.
2023-06-11 22:39:31 +08:00
Yaohui Liu
e603a09137
fix: state loading and saving not working.
2023-06-11 09:13:30 +08:00
Yaohui Liu
5679e08718
feat: add ILLamaExecutor.InferAsync.
2023-06-11 05:44:21 +08:00
Yaohui Liu
aaa0cba9f8
test: update the examples after refactoring.
2023-06-10 18:54:26 +08:00
Yaohui Liu
a545ab7eda
test: remove dumplicated space of example output.
2023-05-22 21:28:40 +08:00
Yaohui Liu
56c56b9c51
refactor: drop LLamaModelV1.
2023-05-21 20:40:54 +08:00
Yaohui Liu
18c2ff2395
refactor: instruct mode and examples.
2023-05-21 20:36:49 +08:00
Yaohui Liu
6ffcb5306b
refactor: use official api of quantization instead.
2023-05-13 15:02:19 +08:00
Yaohui Liu
0958bbac2c
feat: add get-embedding api to LLamaModel.
2023-05-13 02:08:03 +08:00
Haiping Chen
21c36cbf80
Added WebAPI.
2023-05-11 21:45:34 -05:00
Yaohui Liu
33067f990f
feat: run quantization in csharp.
2023-05-11 17:38:28 +08:00
Yaohui Liu
648b14fd0f
docs: revise examples.
2023-05-11 13:59:38 +08:00
Yaohui Liu
856d6549de
build: add linux support.
2023-05-11 04:20:56 +08:00
Yaohui Liu
8978410eab
docs: add examples for running LLamaSharp.
2023-05-11 03:21:55 +08:00