Skip to content

celeris-labs/duckdb-bench-damon2026

Repository files navigation

DaMoN'26 Benchmark for "Should I Hide My Duck in the Lake?"

This is the benchmarking code used to get the results presented in the short paper "Should I Hide My Duck in the Lake?". When cloning the repo, you have to initialize the submodules with:

git submodule update --init --recursive

Building the benchmark & extension

To build the benchmark:

mkdir build && cd build
cmake ..
make -j 64
cd ..

To build the DuckDB ViewRewriter extension:

cd extension
make

Getting the data

To generate scale factor 30 as Parquet files in the directory data/tpch-30 with default TPC-H ordering execute:

./generate -b tpch -s 30

To get more options, including how to generate TPC-DS data, execute with -h. The ClickBench Parquet file can be downloaded with ./download_clickbench.sh.

To put data into tmpfs on the HACC cluster (the benchmark were run on hacc-box-01 of the HACC cluster):

sudo tmpfs-create -s 32 -n 0
cp -r data/tpch-30 /mnt/ramdisk

Executing experiments & generating plots

To run experiments and plot the results (this requires some manual moving the measurements to the correct folders for now):

./run_all.sh
python3 plot.py

About

Benchmark for DaMoN'26 workshop that shows the potential of SmartNIC acceleration for file decoding, filtering, and projection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors