LLamaSharp/docs/xmldocs/llama.native.llamaftype.md

1.7 KiB

LLamaFtype

Namespace: LLama.Native

Supported model file types

public enum LLamaFtype

Inheritance ObjectValueTypeEnumLLamaFtype
Implements IComparable, IFormattable, IConvertible

Fields

Name Value Description
LLAMA_FTYPE_ALL_F32 0 All f32
LLAMA_FTYPE_MOSTLY_F16 1 Mostly f16
LLAMA_FTYPE_MOSTLY_Q8_0 7 Mostly 8 bit
LLAMA_FTYPE_MOSTLY_Q4_0 2 Mostly 4 bit
LLAMA_FTYPE_MOSTLY_Q4_1 3 Mostly 4 bit
LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 4 Mostly 4 bit, tok_embeddings.weight and output.weight are f16
LLAMA_FTYPE_MOSTLY_Q5_0 8 Mostly 5 bit
LLAMA_FTYPE_MOSTLY_Q5_1 9 Mostly 5 bit
LLAMA_FTYPE_MOSTLY_Q2_K 10 K-Quant 2 bit
LLAMA_FTYPE_MOSTLY_Q3_K_S 11 K-Quant 3 bit (Small)
LLAMA_FTYPE_MOSTLY_Q3_K_M 12 K-Quant 3 bit (Medium)
LLAMA_FTYPE_MOSTLY_Q3_K_L 13 K-Quant 3 bit (Large)
LLAMA_FTYPE_MOSTLY_Q4_K_S 14 K-Quant 4 bit (Small)
LLAMA_FTYPE_MOSTLY_Q4_K_M 15 K-Quant 4 bit (Medium)
LLAMA_FTYPE_MOSTLY_Q5_K_S 16 K-Quant 5 bit (Small)
LLAMA_FTYPE_MOSTLY_Q5_K_M 17 K-Quant 5 bit (Medium)
LLAMA_FTYPE_MOSTLY_Q6_K 18 K-Quant 6 bit
LLAMA_FTYPE_GUESSED 1024 File type was not specified