Add CodSpeed CPU, memory, and macro benchmark integration by adriangb · Pull Request #55 · pydantic/datafusion

adriangb · 2026-02-23T05:04:03Z

Summary

This PR integrates CodSpeed for continuous performance benchmarking of DataFusion with three modes:

Changes

Workspace dependency update: Replaced criterion = "0.8" with codspeed-criterion-compat = "4.3.0" using Cargo dependency renaming (package = "codspeed-criterion-compat"). This is a drop-in replacement that preserves all existing use criterion::*; imports across the codebase -- no benchmark source code changes required.
CI workflow (.github/workflows/codspeed.yml) with three jobs:
- Criterion CPU Simulation: Runs all 107 criterion micro-benchmarks with instruction counting (<1% variance)
- Criterion Memory Profiling: Tracks heap allocations across all criterion benchmarks
- Macro Benchmarks: Runs TPC-H SF1 (22 queries) and ClickBench partitioned (43 queries) via codspeed exec for per-query regression tracking
README badge: Added CodSpeed badge linking to the project dashboard.
Macro benchmark script (benchmarks/codspeed_macro.sh): Runs each TPC-H and ClickBench query individually through codspeed exec so each gets tracked as a separate benchmark.

How it works

The codspeed-criterion-compat crate acts as a passthrough when running cargo bench normally, so existing benchmark workflows are unaffected. When benchmarks are run through cargo codspeed in CI, CodSpeed instruments the execution to produce consistent, hardware-agnostic measurements.

Affected crates

All 13 crates with criterion benchmarks are covered automatically via the workspace dependency:

datafusion (core)
datafusion-common
datafusion-datasource
datafusion-datasource-parquet
datafusion-functions
datafusion-functions-aggregate
datafusion-functions-aggregate-common
datafusion-functions-nested
datafusion-functions-window
datafusion-optimizer
datafusion-physical-expr
datafusion-physical-plan
datafusion-spark

Design decisions

Simulation mode for macro benchmarks (not walltime): Works on standard ubuntu-latest runners with <1% variance
codspeed exec per query: Gives per-query regression tracking rather than one aggregate measurement
Iterations set to 1: Under simulation mode, benchmarks are deterministic (instruction-counting)
ClickBench partitioned: ~14GB across 100 parquet files, downloaded with 10 parallel workers

Test plan

cargo bench --no-run verifies all criterion benchmarks compile with codspeed-criterion-compat
CI: Criterion CPU simulation job builds and runs successfully
CI: Criterion memory profiling job builds and runs successfully
CI: Macro benchmark job generates data and runs 65 benchmarks (22 TPC-H + 43 ClickBench)

🤖 Generated with Claude Code

Add three CodSpeed CI jobs: Criterion CPU simulation, Criterion memory profiling, and macro benchmarks (TPC-H SF1 + ClickBench partitioned) via codspeed exec. Swap workspace criterion dependency to codspeed-criterion-compat. Install protobuf-compiler for substrait builds. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions bot added documentation Improvements or additions to documentation sqllogictest common physical-expr datasource physical-plan development-process ffi functions spark labels Feb 23, 2026

adriangb force-pushed the main branch from 73d682d to 9660c98 Compare February 23, 2026 05:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CodSpeed CPU, memory, and macro benchmark integration#55

Add CodSpeed CPU, memory, and macro benchmark integration#55
adriangb wants to merge 1 commit intomainfrom
codspeed

adriangb commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

adriangb commented Feb 23, 2026

Summary

Changes

How it works

Affected crates

Design decisions

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant