From 1096689734c24421df26311809e881b28069eb6d Mon Sep 17 00:00:00 2001 From: wejoncy Date: Sat, 21 Mar 2026 07:58:32 +0000 Subject: [PATCH] docs: update README with CUDA 13.0, Python 3.11-3.13, new GPU archs --- README.md | 15 +++++---------- 1 file changed, 5 insertions(+), 10 deletions(-) diff --git a/README.md b/README.md index 1b99091..9a33b2f 100644 --- a/README.md +++ b/README.md @@ -34,33 +34,28 @@ Features: - [x] Export to ONNX model, inference by OnnxRuntime *Latest News* 🔥 +- [2026/03] CUDA 13.0 support, PyTorch 2.10, Python 3.11-3.13 +- [2026/03] Support H100/H200 (sm_90), B200/B300 (sm_100), RTX 5090 (sm_120) - [2024/03] ONNX Models export API - [2024/01] Support [HQQ](https://github.com/mobiusml/hqq) algorithm - [2023/12] The first PyPi package released ## Installation -Easy to install qllm from PyPi [cu124] +Easy to install qllm from PyPi `pip install qllm` -Install from release package, CUDA-124 is supported. -[py310,py311,py312] https://github.com/wejoncy/QLLM/releases +Install from release package, CUDA 13.0 is supported. +[py311, py312, py313] https://github.com/wejoncy/QLLM/releases Build from Source **Please set ENV EXCLUDE_EXTENTION_FOR_FAST_BUILD=1 for fast build** -If you are using CUDA-124 ``` pip install git+https://github.com/wejoncy/QLLM.git --no-build-isolation ``` -OR CUDA-118/121 -``` -git clone https://github.com/wejoncy/QLLM.git -cd QLLM -python setup.py install -``` # How to use it