LLamaSharp/LLama
Martin Evans 3b0b2ab224
Merge pull request #721 from martindevans/kv_cache_view
Make `LLamaKvCacheView` Safe
2024-05-10 15:19:36 +01:00
..
Abstractions fix: typos. 2024-04-29 18:19:20 +08:00
Batched Removed (marked as obsolete) prompting with a string for `Conversation`. Tokenization requires extra parameters (e.g. addBos, special) which require special considersation. For now it's better to tokenize using other tools and pass the tokens directly. 2024-05-06 15:53:21 +01:00
Common fix: typos. 2024-04-29 18:19:20 +08:00
Exceptions Swapped `StatelessExecutor` to use `llama_decode`! 2024-01-20 21:18:35 +00:00
Extensions fix: typos. 2024-04-29 18:19:20 +08:00
Grammars Code cleanup driven by R# suggestions: 2024-01-02 03:20:21 +00:00
Native Merge pull request #721 from martindevans/kv_cache_view 2024-05-10 15:19:36 +01:00
Sampling April 2024 Binary Update (#662) 2024-04-16 23:19:47 +01:00
runtimes April 2024 Binary Update (#662) 2024-04-16 23:19:47 +01:00
AntipromptProcessor.cs Make AntipromptProcessor public 2023-11-14 21:41:12 +13:00
AssemblyAttributes.cs - Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system. 2023-08-31 00:02:50 +01:00
ChatSession.cs InitializeSessionFromHistoryAsync changed 2024-04-30 02:32:14 +03:00
GlobalSuppressions.cs Logging interceptor (#649) 2024-04-05 16:42:27 +01:00
LLamaContext.cs Add method to get BOS token. 2024-05-02 23:29:33 -06:00
LLamaEmbedder.cs - Added tests for generating embeddings with generative model and embedding model 2024-04-19 16:30:32 +01:00
LLamaExecutorBase.cs Refactor executors 2024-05-02 23:30:16 -06:00
LLamaInstructExecutor.cs Refactor executors 2024-05-02 23:30:16 -06:00
LLamaInteractExecutor.cs Refactor executors 2024-05-02 23:30:16 -06:00
LLamaQuantizer.cs Modified `llama_model_quantize` to accept argument by `ref` instead of pointer. 2024-04-26 01:35:13 +01:00
LLamaSharp.Runtime.targets fix: add cuda llava native libraries. 2024-04-04 00:47:33 +08:00
LLamaSharp.csproj release: update release info of packages. 2024-04-06 14:20:36 +08:00
LLamaStatelessExecutor.cs Refactor executors 2024-05-02 23:30:16 -06:00
LLamaTemplate.cs Split template out to a field, so it can be changed more easily. 2024-05-10 00:23:57 +01:00
LLamaTransforms.cs Add LLaMA3 chat session example. 2024-04-29 04:12:19 +08:00
LLamaWeights.cs Added optional `IProgress` parameter to `LoadFromFileAsync` 2024-04-27 15:04:54 +01:00
LLavaWeights.cs - Added `LoadFromFileAsync` method for `LLavaWeights` 2024-04-27 23:31:07 +01:00
StreamingTokenDecoder.cs Assorted small changes to clean up some code warnings 2024-02-17 23:07:10 +00:00
Usings.cs - Added `LLamaTemplate` which efficiently formats a series of messages according to the model template. 2024-05-06 23:43:45 +01:00