Update cumm dependency to >=0.8.2, add CLAUDE.md and custom build script

Gofinge · claude · Gofinge · commit e4e2dc7c4811 · 2026-04-03T16:07:07.000+08:00
Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/.gitignore b/.gitignore
@@ -116,4 +116,4 @@ example/libspconv/cumm
 example/libspconv/spconv/include
 example/libspconv/spconv/src
 
-third_party/boost
+third_party/
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -0,0 +1,88 @@
+# SpConv - Spatial Sparse Convolution Library
+
+## Overview
+SpConv (v2.3.8) implements high-performance sparse convolution operations for 1D/2D/3D/4D data, commonly used in 3D point cloud processing (e.g., autonomous driving). Depends on cumm for GEMM/convolution kernels. Uses PCCM for C++ code generation and pybind11 for bindings.
+
+## Build System
+
+### Build Command (pre-compiled wheel)
+```bash
+export SPCONV_DISABLE_JIT=1
+export CUMM_CUDA_ARCH_LIST=all
+export CUMM_CUDA_VERSION=12.8
+export BOOST_ROOT=/path/to/boost_1_77_0
+pip wheel . --no-deps -w dist/
+```
+
+### Key Environment Variables
+| Variable | Purpose | Example |
+|---|---|---|
+| `CUMM_CUDA_VERSION` | Target CUDA version | `"12.8"`, `""` (CPU) |
+| `SPCONV_DISABLE_JIT` | `"1"` for pre-compiled wheels | `"1"` |
+| `CUMM_CUDA_ARCH_LIST` | GPU architectures | `"all"`, `"8.6"` |
+| `BOOST_ROOT` | Boost 1.77.0 headers path | `/path/to/boost_1_77_0` |
+| `SPCONV_PYTHON_LIST` | Python versions for build script | `"3.10;3.11;3.12;3.13"` |
+| `SPCONV_VERSION_SUFFIX` | Dev version suffix | `"1.0"` → `2.3.8.dev1000` |
+
+### Build Dependencies
+- **Python**: pccm>=0.4.16, ccimport>=0.4.4, pybind11>=2.6.0, fire, numpy
+- **Critical dep**: cumm>=0.8.2 (must be pre-installed for wheel builds)
+- **C++**: NVIDIA CUDA Toolkit, C++ compiler
+- **Headers**: Boost 1.77.0 (header-only, geometry module required)
+
+### Docker Build (Linux manylinux wheels)
+```bash
+# Download Boost first
+mkdir -p third_party
+wget https://boostorg.jfrog.io/artifactory/main/release/1.77.0/source/boost_1_77_0.zip -O third_party/boost.zip
+unzip third_party/boost.zip -d third_party/boost
+
+docker run --rm \
+  -e PLAT=manylinux_2_28_x86_64 \
+  -e CUMM_CUDA_VERSION=12.8 \
+  -e SPCONV_PYTHON_LIST="3.10;3.11;3.12;3.13" \
+  -e BOOST_ROOT=/io/third_party/boost/boost_1_77_0 \
+  -v $(pwd):/io \
+  scrin/manylinux2014-cuda:cu128-devel-1.0.0 \
+  bash -c "source /etc/bashrc && /io/tools/build-wheels.sh"
+```
+
+### Build Order
+cumm must be built and installed BEFORE building spconv (spconv imports from cumm at build time).
+
+### Platform Tags
+- CUDA < 12.4: `manylinux2014_x86_64`
+- CUDA ≥ 12.4: `manylinux_2_28_x86_64`
+
+## Project Structure
+- `spconv/` - Python package
+  - `csrc/` - C++ source definitions via PCCM
+    - `sparse/all.py` (99KB) - SpconvOps main binding
+    - `sparse/convops.py` (99KB) - GemmTuner, ConvTuner, ConvGemmOps
+    - `sparse/indices.py` (84KB) - Sparse index pair generation
+    - `sparse/alloc.py` - Memory allocation (thrust, external)
+  - `pytorch/` - PyTorch integration, quantization
+  - `core.py` - Kernel parameter definitions (SIMT, Volta, Turing, Ampere)
+  - `constants.py` - Package constants, Boost path, JIT settings
+  - `core_cc/` - Generated C++ extensions (build output)
+- `test/` - Test suite
+- `example/` - MNIST, sparse conv examples
+- `tools/` - Build scripts
+- `docs/` - API, performance, quantization guides
+
+## Key Files
+- `setup.py` - Package build, cumm dependency constraint, kernel compilation
+- `spconv/core.py` - **Critical**: GEMM/conv kernel parameters for all GPU architectures
+- `spconv/constants.py` - Boost path, JIT settings, weight layout
+- `tools/build-wheels.sh` - Linux wheel build script (uses SPCONV_PYTHON_LIST)
+
+## Kernel Architecture Support (core.py)
+- `SHUFFLE_SIMT_PARAMS` - f32/f16 kernels for all GPUs (fallback)
+- `SHUFFLE_VOLTA_PARAMS` - Volta tensor core (sm_70)
+- `SHUFFLE_TURING_PARAMS` - Turing tensor core (sm_75)
+- `SHUFFLE_AMPERE_PARAMS` - Ampere (currently empty, uses NVRTC)
+- `IMPLGEMM_*_PARAMS` - Implicit GEMM variants for each arch
+
+## Package Naming
+- CPU: `spconv`
+- CUDA: `spconv-cu{VER}` (e.g., `spconv-cu128`)
diff --git a/pyproject.toml b/pyproject.toml
@@ -1,5 +1,5 @@
 [build-system]
-requires = ["setuptools>=41.0", "wheel", "pccm>=0.4.16", "cumm>=0.7.11"]
+requires = ["setuptools>=41.0", "wheel", "pccm>=0.4.16", "cumm>=0.8.2"]
 # requires = ["setuptools>=41.0", "wheel", "pccm>=0.4.0", "cumm @ file:///io/dist/cumm_cu120-0.4.2-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl"]
 # requires = ["setuptools>=41.0", "wheel", "pccm>=0.4.0", "cumm-cu126 @ file:///io/dist/cumm_cu126-0.7.3-cp313-cp313-manylinux_2_28_x86_64.whl"]
 build-backend = "setuptools.build_meta"
diff --git a/setup.py b/setup.py
@@ -39,9 +39,9 @@
     cuda_ver_str = cuda_ver.replace(".", "") # 10.2 to 102
 
     RELEASE_NAME += "-cu{}".format(cuda_ver_str)
-    deps = ["cumm-cu{}>=0.7.11, <0.8.0".format(cuda_ver_str)]
+    deps = ["cumm-cu{}>=0.8.2".format(cuda_ver_str)]
 else:
-    deps = ["cumm>=0.7.11, <0.8.0"]
+    deps = ["cumm>=0.8.2"]
 
 
 
diff --git a/tools/build-wheels-custom.sh b/tools/build-wheels-custom.sh
@@ -0,0 +1,67 @@
+#!/bin/bash
+# Parameterized spconv wheel builder
+# Accepts SPCONV_PYTHON_LIST env var (e.g., "3.10;3.11;3.12;3.13")
+# Expects cumm wheels at /io/cumm_dist/ for pre-installation
+set -e -u -x
+
+function repair_wheel {
+    wheel="$1"
+    outpath="$2"
+    if ! auditwheel show "$wheel"; then
+        echo "Skipping non-platform wheel $wheel"
+    else
+        auditwheel repair "$wheel" --plat "$PLAT" --only-plat -w "$outpath"
+    fi
+}
+
+gcc -v
+export SPCONV_DISABLE_JIT="1"
+export CUMM_CUDA_ARCH_LIST="all"
+
+# Default to 3.10-3.13 if not specified
+SPCONV_PYTHON_LIST="${SPCONV_PYTHON_LIST:-3.10;3.11;3.12;3.13}"
+
+# Derive CUDA version short string for wheel matching
+CUDA_VER_SHORT=$(echo "${CUMM_CUDA_VERSION}" | sed 's/\.//')
+
+# Clean up previous build artifacts (may be owned by root from prior Docker runs)
+rm -rf /io/build /io/*.egg-info /io/wheelhouse_tmp
+mkdir -p /io/wheelhouse_tmp /io/dist
+
+for PYVER in ${SPCONV_PYTHON_LIST//;/ }; do
+    PYVER2=$(echo "$PYVER" | sed 's/\.//')
+    PYVER_CP="cp${PYVER2}-cp${PYVER2}"
+    PYTHON="/opt/python/${PYVER_CP}/bin/python"
+    PIP="/opt/python/${PYVER_CP}/bin/pip"
+
+    echo "=== Building spconv for Python ${PYVER} ==="
+
+    # Install build dependencies
+    "${PIP}" install pccm pybind11 ccimport
+
+    # Install pre-built cumm wheel
+    if [ -d /io/cumm_dist ]; then
+        CUMM_WHL=$(ls /io/cumm_dist/cumm_cu${CUDA_VER_SHORT}-*-${PYVER_CP}-*.whl 2>/dev/null | head -1)
+        if [ -n "$CUMM_WHL" ]; then
+            echo "Installing cumm from: ${CUMM_WHL}"
+            "${PIP}" install "$CUMM_WHL"
+        else
+            echo "WARNING: No cumm wheel found for cu${CUDA_VER_SHORT} ${PYVER_CP}, trying pip install"
+            "${PIP}" install "cumm-cu${CUDA_VER_SHORT}>=0.8.2"
+        fi
+    else
+        echo "WARNING: /io/cumm_dist not found, trying pip install"
+        "${PIP}" install "cumm-cu${CUDA_VER_SHORT}>=0.8.2"
+    fi
+
+    "${PIP}" wheel /io/ -v --no-deps -w /io/wheelhouse_tmp
+done
+
+# Bundle external shared libraries into the wheels
+for whl in /io/wheelhouse_tmp/*.whl; do
+    repair_wheel "$whl" /io/dist
+done
+
+rm -rf /io/wheelhouse_tmp
+echo "=== spconv wheels built successfully ==="
+ls -la /io/dist/