Onnxruntime is not using kv cache nor io binding 😅 #191

Open

IlyasMoutawwakil

opened

In your benchmark you are using onnxruntime model without using the kv cache or io binding ?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests