Skip to content

Dflash speculator#24837

Open
kashif wants to merge 18 commits into
ggml-org:masterfrom
kashif:dflash-rebase
Open

Dflash speculator#24837
kashif wants to merge 18 commits into
ggml-org:masterfrom
kashif:dflash-rebase

Conversation

@kashif

@kashif kashif commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Overview

POC DFlash speculator in llama.cpp using the new api vs the previous draft PR #22105

Additional information

Requirements

@kashif kashif requested review from a team, CISC, ggerganov and ngxson as code owners June 20, 2026 11:55
@github-actions github-actions Bot added documentation Improvements or additions to documentation model Model specific examples python python script changes server labels Jun 20, 2026
@ngxson

ngxson commented Jun 20, 2026

Copy link
Copy Markdown
Collaborator

wrong push target ?

@ServeurpersoCom ServeurpersoCom marked this pull request as draft June 20, 2026 12:10
@ServeurpersoCom

ServeurpersoCom commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

I told him he could push in draft. I tested it at home, it's interesting. It looks clean. that would be too much all at once for 1 PR

@pwilkin

pwilkin commented Jun 24, 2026

Copy link
Copy Markdown
Member

I told him he could push in draft. I tested it at home, it's interesting. It looks clean. that would be too much all at once for 1 PR

Nope, it's the minimum number of changes to add the DFlash arch, nothing extra here. It's just spread over a lot of classes. Looks clean to me. @kashif rebase on the newest master, switch to non-draft and tag @am17an and @ggerganov for review.

@kashif

kashif commented Jun 24, 2026

Copy link
Copy Markdown
Contributor Author

let me clean it up! and ping you!

@kashif kashif marked this pull request as ready for review June 24, 2026 13:29
@kashif

kashif commented Jun 24, 2026

Copy link
Copy Markdown
Contributor Author

if you can kindly have a look @am17an

@pwilkin

pwilkin commented Jun 24, 2026

Copy link
Copy Markdown
Member

There's still an error with test-llama-archs, I think you need to take it out of the list of architectures to test there.

@kashif

kashif commented Jun 24, 2026

Copy link
Copy Markdown
Contributor Author

yup fixing

@kashif kashif requested a review from JohannesGaessler as a code owner June 24, 2026 13:53
@github-actions github-actions Bot added the testing Everything test related label Jun 24, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation examples model Model specific python python script changes server testing Everything test related

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants