Upgrade KFP to 2.16.1 and increase pipeline timeout#127
Conversation
…ages - Updated kfp dependency from 2.14.6 to 2.16.1 in GitHub Actions workflow - This resolves compatibility issues with quay.io/aipcc/docling/sdk-cuda-ubi9:3.5.0-ea.2 which only provides kfp 2.16.1 in its Python environment - Also cleaned up minor whitespace issues in the workflow file Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
The VLM pipeline downloads SmolVLM and smoldocling models from HuggingFace, which includes multiple ONNX variants (fp16, int8, q4, bnb4, uint8, etc.). The download was timing out at 15 minutes. Extending to 30 minutes to allow sufficient time for model downloads, especially for VLM pipeline with unauthenticated HuggingFace access. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
|
Warning Review limit reached
More reviews will be available in 35 minutes and 38 seconds. Learn how PR review limits work. Your organization has used up its prepaid credits, and credit purchases are no longer available. Enable the review add-on in the billing tab to keep reviews running — you're only billed for reviews past your plan's rate limits ($0.25/file). ⌛ How to resolve this issue?After more reviews become available, a review can be triggered using the To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based credits. 🚦 How do rate limits work?CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan refill rate. For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, the refill rate gradually slows as usage increases. The highest same-day bursts are limited more strictly. Please see our Fair Usage Limits Policy for further information. ℹ️ Review info⚙️ Run configurationConfiguration used: Central YAML (base), Organization UI (inherited) Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
Here is the successful pipeline run with quay.io/aipcc/docling/sdk-cuda-ubi9:3.5.0-ea.2 |
Summary
This PR addresses two issues with the local pipeline test runner:
Problem
The
test-local-pipelinesworkflow was failing when usingquay.io/aipcc/docling/sdk-cuda-ubi9:3.5.0-ea.2with two errors:Error 1: KFP Version Mismatch
The newer Docling CUDA image only provides KFP 2.16.1 in its Python environment.
Error 2: Timeout During Model Download
The VLM pipeline downloads SmolVLM and smoldocling models from HuggingFace, which includes multiple ONNX variants. These downloads were timing out at 15 minutes.
Changes
.github/workflows/execute-kfp-localrunners.yml:Testing
quay.io/aipcc/docling/sdk-cuda-ubi9:3.5.0-ea.2Notes
The 30-minute timeout should be sufficient for VLM model downloads even with unauthenticated HuggingFace access. For faster downloads in production, consider setting
HF_TOKENfor authenticated access.