Commit Graph

1141 Commits

Author SHA1 Message Date
Martin Evans 0592164dc3
Merge pull request #489 from jasoncouture/clblast_nuget
Create nuspec for OpenCL
2024-02-05 19:56:57 +00:00
Jason Couture 1f45bae2cf
Update compile.yml
Fix SHA256 hash path
2024-02-05 13:35:40 -05:00
Jason Couture c963b051e2 Add nuspec for OpenCL (CLBLAST) 2024-02-05 12:21:07 -05:00
Martin Evans d468df08d6
Merge pull request #487 from jasoncouture/clblast_linux
CLBlast for linux
2024-02-05 16:17:50 +00:00
Martin Evans d4f3f642c3
Merge pull request #488 from jasoncouture/concurrency_limit
Only allow one build in parallel per ref
2024-02-05 14:33:53 +00:00
Jason Couture bfe3ad50aa Only allow one build in parallel per ref 2024-02-05 06:07:46 -05:00
Jason Couture f7a6eaa49f Cancel previous builds when a new build is started, due to how expensive this build is. 2024-02-05 06:06:11 -05:00
Jason Couture 277175af4d CLBlast for linux
This builds CLBLAST support for linux, and makes sure to copy the
clblast shared library on both windows and linux to the artifacts.
2024-02-05 06:06:11 -05:00
Martin Evans 7dbaed2d3b
Update README.md 2024-02-05 00:25:40 +00:00
Martin Evans dfac029dde
Merge pull request #485 from zsogitbe/master
KernelMemory EmbeddingMode bug correction
2024-02-04 18:26:28 +00:00
Zoli Somogyi f578fcafa3 KernelMemory EmbeddingMode bug correction 2024-02-04 08:37:08 +01:00
Martin Evans 765c697f77 Fixed number type 2024-02-01 19:40:34 +00:00
Martin Evans b2e815d51e Updated all binaries (from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/7746303349) 2024-02-01 19:34:37 +00:00
Martin Evans 2323988cc7
Merge pull request #478 from martindevans/fixed_artifact_paths
Updated download-artifact to v4
2024-02-01 19:01:27 +00:00
Martin Evans b5674ead97 Updated download-artifact to v4 2024-02-01 19:00:54 +00:00
Martin Evans 15a98b36d8 Updated everything to work with llama.cpp ce32060198b7e2d6a13a9b8e1e1369e3c295ae2a 2024-02-01 16:35:05 +00:00
Martin Evans 48798837fb
Merge pull request #477 from martindevans/updated_cuda_toolkit
Updated compile.yml to use `Jimver/cuda-toolkit@v0.2.14`
2024-02-01 14:41:16 +00:00
Martin Evans 2df7e35c81 Updated compile.yml to use `Jimver/cuda-toolkit@v0.2.14` 2024-02-01 14:40:49 +00:00
Martin Evans 72687ca230
Merge pull request #476 from martindevans/upload_artifact_v4
Updated compile.yml to use `actions/upload-artifact@v4`
2024-02-01 14:32:13 +00:00
Martin Evans d8ed875a2a Updated compile.yml to use `actions/upload-artifact@v4` 2024-02-01 14:31:49 +00:00
Martin Evans faa342ca13
Merge pull request #475 from martindevans/checkout_v4
Updated compile.yml to use checkout v4
2024-02-01 14:27:09 +00:00
Martin Evans ae40d993c4 Updated compile.yml to use checkout v4 2024-02-01 14:26:32 +00:00
Martin Evans 3523c51fa3
Merge pull request #474 from martindevans/embeddings_generator_decode
Swapped `GetEmbeddings` to `llama_decode`
2024-01-31 21:03:04 +00:00
Martin Evans c9c8cd0d62 - Swapped embeddings generator to use `llama_decode`
- Modified `GetEmbeddings` method to be async
2024-01-31 20:28:53 +00:00
Martin Evans 3b08874bec
Merge pull request #468 from jasoncouture/clblast
Add CLBLAST native library to native libraries build
2024-01-31 20:05:47 +00:00
Martin Evans 22aba9a671
Merge pull request #473 from martindevans/base_handle_removed_constructor
Removed `SafeLLamaHandleBase` Constructor
2024-01-31 18:58:12 +00:00
Martin Evans 2488f74bbd
Merge pull request #472 from martindevans/remove_params_interface_set
Removed `IModelParams` and `IContextParams` setters.
2024-01-31 18:57:49 +00:00
Martin Evans 5da2a2f64b - Removed one of the constructors of `SafeLLamaHandleBase`, which implicitly states that memory is owned. Better to be explicit about this kind of thing!
- Also fixed `ToString()` in `SafeLLamaHandleBase`
2024-01-31 18:01:03 +00:00
Martin Evans 9b995510d6 Removed all setters in `IModelParams` and `IContextParams`, allowing implementations to be immutable. 2024-01-31 17:51:50 +00:00
Martin Evans f9a9aaabca
Merge pull request #471 from jasoncouture/master
Fix incorrect event input variable name
2024-01-30 22:24:06 +00:00
Martin Evans 96d7d37f80
Merge pull request #469 from jasoncouture/library_name_fix
Fix missing library name prefix for cuda
2024-01-30 22:22:27 +00:00
Jason Couture 52a85c35e6 Add missing CMAKE prefix 2024-01-30 13:59:26 -05:00
Jason Couture 689ddf0d08 Add missing T to defines for CLBLAST 2024-01-30 13:59:26 -05:00
Jason Couture face505588 Fix syntax error in CLBLAST if statement 2024-01-30 13:59:26 -05:00
Jason Couture ce5fbf0658 Copy clblast dependencies 2024-01-30 13:59:26 -05:00
Jason Couture ad3f895eb3 Make build-deps depend on compile-clblast 2024-01-30 13:59:25 -05:00
Jason Couture 2347a7aa09 Add build job for CLBLAST 2024-01-30 13:59:25 -05:00
Jason Couture 34ca5ff6eb Simplifiy directory creation in build-deps
using bash expansion expression, combined with mkdir -p (--parents), allows us to create all of the directories at once.

`deps/{avx,avx2,avx512,osx-arm64,osx-x64,cu11.7.1,cu12.1.0,clblast}`
expands to: deps/avx deps/avx2 deps/avx512 deps/osx-arm64 deps/osx-x64 deps/cu11.7.1 deps/cu12.1.0 deps/clblast
2024-01-30 13:59:25 -05:00
Jason Couture 9cfbd22499 Fix github variable name
I am not on my game today 🤦
2024-01-30 13:35:54 -05:00
Jason Couture 30e448d2d5 Use event input directly
GHA doesn't seem to load the value into the enviroment variable first, so the workflow always runs on master.
2024-01-30 13:35:54 -05:00
Martin Evans afa6cc0ec4
Merge pull request #470 from jasoncouture/specific_commit
Checkout specific ref for llamacpp when building native libs
2024-01-30 18:13:02 +00:00
Jason Couture 64cb697bbf Checkout specific ref for llamacpp when building native libs 2024-01-30 13:10:13 -05:00
Jason Couture ec59c5bf9e Fix missing library name prefix for cuda 2024-01-30 12:41:23 -05:00
Martin Evans 0f9742c6d0
Merge pull request #465 from jasoncouture/lib_naming
Use llama instead of libllama in `[DllImport]`
2024-01-30 16:35:24 +00:00
Jason Couture 443ce4fff4 While the dllimport changes work, manual path searching needed to be updated 2024-01-30 11:10:51 -05:00
Jason Couture db7e1e88f8 Use llama instead of libllama in `[DllImport]`
This results in windows users not needing to rename the DLL. This allows native llama builds to be dropped in, even on windows.

I also took the time to update the documentation, removing references to renaming the files, since the names now match.

Fixes #463
2024-01-30 02:40:13 -05:00
Martin Evans 4cfdf064b8
Merge pull request #462 from SciSharp/dependabot/nuget/System.Text.Json-8.0.1
build(deps): bump System.Text.Json from 8.0.0 to 8.0.1
2024-01-29 13:18:25 +00:00
dependabot[bot] d8eb817bf5
build(deps): bump System.Text.Json from 8.0.0 to 8.0.1
Bumps [System.Text.Json](https://github.com/dotnet/runtime) from 8.0.0 to 8.0.1.
- [Release notes](https://github.com/dotnet/runtime/releases)
- [Commits](https://github.com/dotnet/runtime/compare/v8.0.0...v8.0.1)

---
updated-dependencies:
- dependency-name: System.Text.Json
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-01-29 06:45:09 +00:00
Martin Evans 5cf481dc8e
Merge pull request #454 from martindevans/kv_cache_instance_methods
kv_cache_instance_methods
2024-01-25 15:26:47 +00:00
Martin Evans 92b9bbe779 Added methods to `SafeLLamaContextHandle` for KV cache manipulation 2024-01-23 16:16:02 +00:00