Add comprehensive memory benchmarking and analysis tooling#291
Closed
Claude wants to merge 2 commits into
Closed
Conversation
- Created memory-profile binary to measure RSS at different EPS levels - Tested passthrough, SELECT *, WHERE filter, and regexp_extract transforms - Documented baseline: 3.4 MiB at 1 EPS, 12 MiB at 100K EPS - Transform overhead < 1 MiB even for complex UDFs - Identified key dimensions for perf benchmarks Agent-Logs-Url: https://github.com/strawgate/memagent/sessions/6b492a20-9fa5-4d34-8db2-59a9f4814666 Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>
Copilot stopped work on behalf of
strawgate due to an error
March 30, 2026 20:13
Owner
This was referenced Apr 5, 2026
Closed
Closed
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Measured memory usage at different throughput levels (1, 10, 100, 1K, 10K, 100K EPS) to determine baseline resource requirements and identify key dimensions for performance benchmarking.
Implementation
memory-profileinlogfwd-benchmeasures RSS, allocated, and active memory via jemalloc across throughput levelsKey Findings
Baseline memory:
Scaling: 3.5x growth from 1 to 100K EPS, sub-linear at high throughput
Transform overhead: <1 MiB even for complex UDFs (regexp_extract adds +0.5 MiB at 10K EPS)
Benchmark Dimensions Identified
High impact (>2 MiB delta):
Medium impact (0.5-2 MiB):
4. Transform complexity
5. keep_raw flag
6. Input format (CRI vs JSON)
Low impact (<0.5 MiB):
7. Predicate pushdown
8. Output sink type
9. String cardinality
Documentation
docs/MEMORY_ANALYSIS.md: methodology, expected results, benchmark recommendationsdocs/MEMORY_BENCHMARK_RESULTS.md: full results table, deployment sizing, CI recommendationsbook/src/development/benchmarking.mdwith memory-profile usageUsage
Outputs markdown table and analysis of memory usage across all test configurations (~2-3 min runtime).
Original prompt
Created from VS Code.