Commit Graph

899 Commits

Author SHA1 Message Date
Martin Evans 95b9e4d2c7
Merge pull request #385 from martindevans/partial_fix_metadata_override
Metadata Fixes
2023-12-24 22:25:08 +00:00
Martin Evans 889d99bfe2
Merge pull request #386 from edgett/patch-1
Update compile.yml to fix not building for windows
2023-12-24 22:24:57 +00:00
edgett b6dec83ca9
Update compile.yml to fix not building for windows
This fixes the build step: Compile (Windows) (avx512, -DLLAMA_AVX512=ON -LLAMA_AVX512_VBMI=ON -DLLAMA_AVX512_VNNI=ON)

There is a typo in the cmake command: `cmake .. -DLLAMA_NATIVE=OFF -DLLAMA_BUILD_TESTS=OFF -DLLAMA_BUILD_EXAMPLES=OFF -DLLAMA_BUILD_SERVER=OFF -DBUILD_SHARED_LIBS=ON -DLLAMA_AVX512=ON -LLAMA_AVX512_VBMI=ON -DLLAMA_AVX512_VNNI=ON
  cmake --build . --config Release -j ${env:NUMBER_OF_PROCESSORS}`

-LLAMA_AVX512_VBMI
Should be
-DLLAMA_AVX512_VBMI
2023-12-24 17:20:46 -05:00
Martin Evans 2cded1b296 Fixed alignment of value fields in `LLamaModelMetadataOverride` 2023-12-24 22:02:22 +00:00
Martin Evans 3fc0f34cbe Fixed some issues which were causing metadata overrides not to work (mostly importantly, converting the key was failing so all keys were null bytes and thus ignored). 2023-12-24 21:23:37 +00:00
Martin Evans 82baee7133
Merge pull request #384 from martindevans/fixed_metadata_large_values
Fixed loading of very large metadata values
2023-12-24 20:09:42 +00:00
Martin Evans 6be3f62321 Fixed loading of very large metadata values (over 1kb) 2023-12-24 20:01:53 +00:00
Martin Evans f0d7468b22
Merge pull request #356 from xbotter/deps/sk-rc3
bump sk to 1.0.1 & km to 0.18
2023-12-21 17:24:19 +00:00
Martin Evans bf6fc38b58
Merge pull request #377 from scotmcc/master
Updated & Fixed WebAPI
2023-12-21 16:04:33 +00:00
Martin Evans 9b1ff0ba4e
Merge pull request #380 from martindevans/LLamaWeights.Metadata_Property
Added `LLamaWeights.Metadata` property
2023-12-21 14:56:28 +00:00
xbotter 211ce12bf5
LLamaEmbedder exposes the Context 2023-12-21 10:28:37 +08:00
Martin Evans fb606c2488 Fixed incorrect values 2023-12-20 16:30:28 +00:00
Martin Evans 47e4fcef2a Fixed GetString on netstandard2 2023-12-20 16:05:54 +00:00
Martin Evans 2a1e1b6183 Removed unused imports 2023-12-20 15:47:09 +00:00
Martin Evans a2bae178fa Added a `Metadata` property to `LLamaWeights` 2023-12-20 15:45:24 +00:00
xbotter 40ac944fb5
Bump sk to 1.0.1 2023-12-19 08:42:01 +08:00
Scot McConnaughay 2abc2ecfca Fixed WebAPI
Upgraded to .NET8.0
Fixed logging (removed Console replaced with Ilogger)
Fixed warnings (Null Strings, etc).

The application now returns data from the back-end chat services.
2023-12-18 09:03:48 -07:00
Rinne 4635185767
Merge pull request #378 from asmirnov82/fix_docs
Fix broken references in docs
2023-12-18 23:38:22 +08:00
Aleksei Smirnov 1c88bd5a74 fix broken references in docs (after commit "docs: update the docs to follow new version") 2023-12-18 15:53:15 +03:00
Martin Evans 7139281e4d
Merge pull request #373 from martindevans/enhanced_avx512
Improved support for AVX512
2023-12-16 15:52:32 +00:00
Martin Evans a0232a8534
Merge pull request #372 from martindevans/custom_pipeline_stateless_reset
Reset Custom Sampling Pipeline
2023-12-16 15:52:03 +00:00
Martin Evans 1b13f7c717 Improved support for AVX512:
- Enabled more features in build process (VBMI and VNNI)
 - Added runtime checking for this features
 - Improved runtime checking to no longer require dotnet8.0
2023-12-16 15:46:28 +00:00
Martin Evans 82d84afaea Resetting the custom sampling pipeline in the stateless executor 2023-12-16 14:48:40 +00:00
Martin Evans c298ab828a
Merge pull request #368 from martindevans/context_set_seed
Context Set Seed
2023-12-15 23:42:47 +00:00
Martin Evans dc8e5d88f7
Update LLama/LLamaContext.cs 2023-12-15 23:14:39 +00:00
Martin Evans a3177ab140
Merge pull request #369 from martindevans/rename_llama_sample_temperature
Renamed `llama_sample_temperature` to `llama_sample_temp`
2023-12-15 23:13:45 +00:00
Martin Evans cbc4c8d9af
Merge pull request #370 from martindevans/copy_grammar
Clone Grammar
2023-12-15 23:10:11 +00:00
Martin Evans db7ecf5a43 Added a method to create a clone of a grammar instance 2023-12-15 23:01:05 +00:00
Martin Evans ea523d2e2a Renamed `llama_sample_temperature` to `llama_sample_temp`, Mirroring the same change made in llama.cpp 2023-12-15 22:58:26 +00:00
Martin Evans 2df3e7617e Added a method to set the RNG seed on the context 2023-12-15 22:55:04 +00:00
Martin Evans 8494fd5769
Merge pull request #367 from martindevans/fix_quantisation_pure_field
Added missing field to LLamaModelQuantizeParams
2023-12-15 22:53:10 +00:00
Martin Evans cedef5e45a Added the `pure` field to `LLamaModelQuantizeParams` (it's been added to llama.cpp) 2023-12-15 22:36:58 +00:00
Martin Evans 6650e683f4
Merge pull request #361 from martindevans/update_binaries_dec_2023
Updated Binaries December 2023
2023-12-15 22:19:16 +00:00
Martin Evans 199b4418c3 Fixed JSON serialization for ModelParams 2023-12-14 20:24:22 +00:00
Martin Evans 6128c18824
Merge pull request #366 from asmirnov82/fix_code_in_readme
Fix documentation to reflect changes in ChatSession API
2023-12-14 20:04:50 +00:00
Aleksei Smirnov 8558d109b4 Fix documentation according to the changes in newer versions 2023-12-14 22:35:21 +03:00
Martin Evans 2f0deeadcd Implemented serialization for `MetadataOverride`. Deserialization is broken (converter is never called) 2023-12-14 16:11:31 +00:00
Martin Evans b868b056f7 Added metadata overrides to `IModelParams` 2023-12-14 02:05:40 +00:00
xbotter 8766fb1b03
Merge branch 'deps/sk-rc3' of https://github.com/xbotter/LLamaSharp into deps/sk-rc3 2023-12-14 09:47:37 +08:00
xbotter 213b4be723
bump sk-1.0.0-rc4 2023-12-14 09:47:32 +08:00
xbotter ce20b30e06
Merge branch 'SciSharp:master' into deps/sk-rc3 2023-12-14 09:28:06 +08:00
Martin Evans b22d8b7495 - Added `GroupDisposable` to dispose a collection of items all together
- Renamed `LLamaModelKvOverride` to `LLamaModelMetadataOverride`
2023-12-14 01:23:45 +00:00
Martin Evans 5ad2cd1d3c Added a comment on the type itself 2023-12-14 01:23:44 +00:00
Martin Evans b0270b5788 Added comments on GGMLType 2023-12-14 01:23:44 +00:00
Martin Evans b3e576608b fixed safe handle 2023-12-14 01:23:44 +00:00
Martin Evans bab6b65b61 Added a safe handle for LLamaKvCacheView 2023-12-14 01:23:44 +00:00
Martin Evans 439d14a061 Updated binaries:
- build run: https://github.com/SciSharp/LLamaSharp/actions/runs/7196891440
 - commit: 9fb13f9584
2023-12-14 01:23:43 +00:00
Martin Evans 9813a30193
Merge pull request #358 from xbotter/unittest-net8
Upgrade unittest target framework to .NET 8.0
2023-12-14 01:22:57 +00:00
xbotter 340bbbcf48
Move JSON converter for TensorSplitsCollection 2023-12-14 09:10:31 +08:00
Martin Evans 728f7077d8
Merge pull request #360 from martindevans/macos_avx
Added AVX and AVX2 to MacOS x86_64 builds
2023-12-13 14:47:26 +00:00