Commit Graph

65 Commits

Author SHA1 Message Date
xbotter ab83016fb4
chore: update semantic kernel examples 2023-10-20 10:24:40 +08:00
Martin Evans 328022b13d Fixed merge conflicts 2023-10-19 21:14:45 +01:00
sa_ddam213 b4b4000342
Merge branch 'master' into upstream_master
# Conflicts:
#	LLama.Web/Common/ModelOptions.cs
#	LLama.Web/Services/ConnectionSessionService.cs
#	LLama/LLamaStatelessExecutor.cs
#	LLama/LLamaWeights.cs
2023-10-20 08:02:27 +13:00
Martin Evans e89ca5cc17 Fixed a few minor warnings 2023-10-19 00:43:50 +01:00
Martin Evans d8434ea9d6
Merge pull request #185 from martindevans/wip_major_api_change
Major llama.cpp API Change
2023-10-18 20:50:32 +01:00
Daniel Vaughan f64a54c9c8 Support SemanticKernel 1.0.0-beta1 2023-10-13 13:56:21 +02:00
sa_ddam213 9b8de007dc Propagate ILogger 2023-10-04 13:47:08 +13:00
Martin Evans 669ae47ef7 - Split parameters into two interfaces
- params contains a list of loras, instead of just one
2023-09-30 16:21:18 +01:00
Martin Evans bca55eace0 Initial changes to match the llama.cpp changes 2023-09-29 01:18:21 +01:00
Martin Evans b7379b7124 Moved spinner out to an extension, so it can easily be used in other examples 2023-09-24 19:24:52 +01:00
Martin Evans 08f1615e60 - Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated.
- Added a "spinner" to the `StatelessModeExecute` demo, which spins while waiting for the next token (demonstrating that it's not blocked).
2023-09-23 15:22:57 +01:00
Haiping Chen 45426aefa9 Fix CodingAssistant InferAsync. 2023-09-17 10:33:31 -05:00
Haiping 9f9903c711
Merge pull request #173 from martindevans/async_only
Remove non-async
2023-09-17 10:19:16 -05:00
Martin Evans 3f80190f85 Minimal changes required to remove non-async inference. 2023-09-14 21:04:14 +01:00
Regenhardt Marlon 0049dcfb66 Coding Assistent: Improve user information and link to source paper 2023-09-13 17:04:08 +02:00
Regenhardt Marlon 71f4e2ae4a Coding Assistent: Use GGUF model 2023-09-13 15:45:44 +02:00
Marlon Regenhardt 2386be58c4 Example: Add code assistant using Code Llama-7b-Instruct as default 2023-09-13 15:21:06 +02:00
Tim Miller ab633089e9 Merge branch 'master' into embeddings-semantickernel 2023-09-05 13:53:24 +09:00
Yaohui Liu 18294a725e
refactor: remove old version files. 2023-09-02 22:24:07 +08:00
Tim Miller 94a395240a Bump example, readme 2023-09-02 14:21:02 +09:00
Tim Miller bea1ca9f8d Update 2023-09-02 13:54:38 +09:00
Tim Miller 35266d8112 Remove MemorySkill sample 2023-09-02 13:41:59 +09:00
Tim Miller 521f068d64 Add Embedding for Semantic Kernel 2023-09-01 22:42:13 +09:00
Rinne 1533ee7dbf
Merge pull request #138 from drasticactions/semantic-kernel
Enable Semantic kernel support
2023-09-01 20:50:46 +08:00
Tim Miller d4a57fffef README, Cleanup 2023-09-01 10:03:34 +09:00
Tim Miller 2bde188c64 Change Namespace 2023-08-31 22:22:38 +09:00
Tim Miller a81edacbfb Remove embedding for now 2023-08-31 18:26:03 +09:00
Tim Miller 98bfbe1d50 Update 2023-08-31 18:24:07 +09:00
Tim Miller 9a1d6f99f2 Add Semantic Kernel support 2023-08-31 17:24:44 +09:00
Martin Evans a70c7170dd - Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system.
- Made all the mechanics of grammar parsing (GBNFGrammarParser, ParseState) internal. Just call `Grammar.Parse("whatever")`.
 - Added a `GrammarRule` class which validates elements on construction (this allows constructing grammar without parsing GBNF).
   - It should be impossible for a `GrammarRule` to represent an invalid rule.
2023-08-31 00:02:50 +01:00
Mihai 71f02e08c2 Add grammar example 2023-08-30 17:24:39 +03:00
Martin Evans 5a6c6de0dc
Merge pull request #115 from martindevans/model_params_record
ModelsParams record class
2023-08-24 22:54:23 +01:00
Martin Evans 829f32b27d - Added `Obsolete` attributes to the entire `OldVersion` namespace, so they can be removed in the future
- Minor changes to cleanup some of the compiler warnings
2023-08-24 00:59:32 +01:00
Martin Evans 29df14cd9c Converted ModelParams into a `record` class. This has several advantages:
- Equality, hashing etc all implemented automatically
 - Default values are defined in just one place (the properties) instead of the constructor as well
 - Added test to ensure that serialization works properly
2023-08-23 00:58:25 +01:00
Martin Evans 2830e5755c - Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed.
- Deleted `NativeInfo` (internal class, not used anywhere)
2023-08-22 23:20:13 +01:00
Martin Evans a45d9089e1 Fixed demos 2023-08-22 01:32:53 +01:00
Martin Evans a9e6f21ab8 - Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring!
- Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around
2023-08-22 01:30:13 +01:00
Martin Evans ae8ef17a4a - Added various convenience overloads to `LLamaContext.Eval`
- Converted `SafeLLamaContextHandle` to take a `ReadOnlySpan` for Eval, narrower type better represents what's really needed
2023-08-22 01:28:28 +01:00
Martin Evans 02a46fc363 Updated demos to use the new loading/multi context system 2023-08-17 23:26:20 +01:00
Martin Evans e2fe08a9a2 Added a higher level `LLamaWeights` wrapper around `SafeLlamaModelHandle` 2023-08-13 01:10:08 +01:00
Martin Evans f31bdf6b93 Using the right context for Bob 2023-08-13 01:10:08 +01:00
Martin Evans f3511e390f WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving.
The biggest single change is renaming `LLamaModel` to `LLamaContext`
2023-08-13 01:10:08 +01:00
Rinne 0269af8c17
Merge branch 'master' into runtime-targets 2023-06-21 16:02:19 +08:00
Yaohui Liu 1062fe1a7e
feat: upgrade the native libraries. 2023-06-21 15:21:27 +08:00
Yaohui Liu eed96248b5
docs: add the documentations with mkdocs. 2023-06-20 01:53:46 +08:00
Yaohui Liu 2eb2d6df83
test: add 9 examples of the new version. 2023-06-19 22:09:58 +08:00
Tim Miller bbd2650cf0 Include runtime targets file 2023-06-19 18:44:51 +09:00
Yaohui Liu f3565d6b2d
refactor: rename Quantizer to LLamaQuantizer. 2023-06-19 02:54:55 +08:00
Yaohui Liu a3b8186f20
feat: support save and load chat session. 2023-06-12 18:31:37 +08:00
Yaohui Liu bdbd6aa824
feat: add transforms for chat session. 2023-06-12 18:07:41 +08:00