microsoft / onnxruntime-genai Public

Notifications You must be signed in to change notification settings
Fork 291
Star 1k

Code
Issues 135
Pull requests 40
Discussions
Actions
Projects
Models
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Models
Wiki
Security and quality
Insights

Pull requests: microsoft/onnxruntime-genai

Labels 57 Milestones 0

New pull request New

40 Open 1,462 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Auto-detect fixed kv-cache shape in DefaultKeyValueCache 0.14.0

#2166 opened May 18, 2026 by akholodnamdcom Contributor

Loading…

Add MIGraphX execution provider support

#2165 opened May 14, 2026 by aditya-dl

Loading…

Add text-only mode support for Qwen 3.5 model builder 0.14.0

#2157 opened May 12, 2026 by apsonawane Contributor

Loading…

Add per-run profiling config for fine-grained Run() profiling

#2152 opened May 9, 2026 by xiaofeihan1 Contributor • Draft

Add gemma4 unit tests

#2151 opened May 8, 2026 by apsonawane Contributor

Loading…

Nvidia Parakeet Tdt ASR support 0.14.0

#2150 opened May 8, 2026 by nenad1002 Contributor

Loading…

Add VideoChat-Flash (OpenGVLab) language model support 0.14.0

#2147 opened May 8, 2026 by anilmartha Contributor

Loading…

4 tasks

Add Qwen3.5-MoE (35B-A3B) model support 0.14.0

#2146 opened May 8, 2026 by tanzeel-amd

Loading…

Expose mutable sampling parameters on live Generator

#2145 opened May 8, 2026 by qjia7 Contributor

Loading…

4 tasks done

Add HunYuan Dense V1 (hunyuan_v1_dense) model support 0.14.0

#2144 opened May 8, 2026 by anilmartha Contributor

Loading…

fix: enable Generator.rewind_to(0) for multimodal models

#2141 opened May 8, 2026 by justinchuby Contributor • Draft

Enable Qwen3.5 TRT-RTX EP path with CUDA graph 0.14.0

#2139 opened May 7, 2026 by yen-shi

Loading…

Supply JobId to runs-on

#2138 opened May 7, 2026 by baijumeswani Collaborator

Loading…

[Qwen3] Allow packed QKV MatMul under QK-Norm via post-MatMul Split 0.14.0

#2137 opened May 7, 2026 by xiaofeihan1 Contributor

Loading…

model package integration

#2136 opened May 6, 2026 by xiaoyu-work Contributor

Loading…

Fix Gemma4 pixel_values trimming

#2135 opened May 6, 2026 by apsonawane Contributor

Loading…

Cohere Transcribe Support

#2133 opened May 6, 2026 by nenad1002 Contributor

Loading…

Add granitemode support 0.14.0

#2124 opened May 6, 2026 by amdrajeevp1 Contributor

Loading…

4 tasks

Fix multimodal CUDA pipeline: embedding output persistence causes shape mismatch

#2123 opened May 5, 2026 by justinchuby Contributor

Loading…

Add Seed-OSS architecture support (SeedOssForCausalLM)

#2116 opened May 3, 2026 by PMeeske

Loading…

Pipeline-as-Config: Declarative model dispatch replacing model_type string registry

#2115 opened May 2, 2026 by justinchuby Contributor • Draft

Fix AppendNextTokensToSequences heap overflow

#2111 opened Apr 30, 2026 by apsonawane Contributor

Loading…

Fix heap overflow issue

#2110 opened Apr 30, 2026 by apsonawane Contributor

Loading…

Add linux-aarch64 support 0.14.0

#2107 opened Apr 29, 2026 by baijumeswani Collaborator • Draft

Release captured graph resources when generator is destroyed

#2106 opened Apr 29, 2026 by qjia7 Contributor • Draft

Previous 1 2 Next

Previous Next

ProTip! Updated in the last three days: updated:>2026-05-16.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!