Skip to content

fix: allow oversized Vulkan parameter tensors#1662

Merged
leejet merged 1 commit into
masterfrom
fix/vulkan-large-param-tensors
Jun 15, 2026
Merged

fix: allow oversized Vulkan parameter tensors#1662
leejet merged 1 commit into
masterfrom
fix/vulkan-large-param-tensors

Conversation

@leejet

@leejet leejet commented Jun 15, 2026

Copy link
Copy Markdown
Owner

Summary

  • Treat backend max buffer size as a chunking threshold instead of a hard per-tensor limit when allocating parameter buffers.
  • Allow oversized tensors to be allocated alone so Vulkan can handle large weights such as Qwen embedding tensors.

Related Issue / Discussion

Fix #1659.

Additional Information

N/A

Checklist

@leejet leejet merged commit 6e66a1a into master Jun 15, 2026
14 checks passed
@leejet leejet deleted the fix/vulkan-large-param-tensors branch June 15, 2026 15:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] regression: Vulkan memory management error after master-691-563137a

1 participant