You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
v1.8.45: Fix node-llama-cpp API misuse causing CONTEXT_OVERFLOW
Fix G1: Correct lastEvaluationContextWindow structure - was passing raw
lastEvaluation object instead of {history, minimumOverlapPercentageToPreventContextShift}.
This silently disabled KV cache reuse, causing full re-tokenization every turn.
Fix G2: Correct contextShift metadata key - was using wrong key name
'lastEvaluationContextWindowHistory' instead of 'lastEvaluationMetadata'.
Fix G3: Add pre-generation diagnostic logging for chatHistory state.
Fix G4: Strip think/thought segment objects from cleanHistory before storing.
node-llama-cpp preserves hidden thought segments in cleanHistory even when
budgets.thoughtTokens=0, causing token count inflation across turns.
0 commit comments