You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Datashader was consistently slower than matplotlib for points due to
five performance bottlenecks:
1. Dask DataFrame passed to cvs.points() instead of pandas (~137x
scheduler overhead on already-computed data)
2. Double extent computation (get_extent on dask, then .compute again)
3. Per-point _hex_no_alpha() calls in O(n) list comprehension
4. _build_datashader_color_key iterated all points instead of
early-exiting after finding all categories
5. _want_decorations created O(n) Python set from color vector
After fixes, datashader is 1.2-1.4x faster than matplotlib for plain
points and up to 1.6x faster for categorical coloring at 500K+ points.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
0 commit comments