f0b0bbcbb7
Modified LLamaBatch to not share tokens with other sequences if logits is true. This ensures that the logit span at the end in used by exactly one sequence - therefore it's safe to mutate. This removes the need for copying _very_ large arrays (vocab size) and simplifies sampling pipelines. |
||
---|---|---|
.. | ||
BatchedExecutor.cs | ||
Conversation.cs | ||
ConversationExtensions.cs | ||
Exceptions.cs |