refactor(logfwd-io): split FileTailer + Bytes buffer ownership by strawgate · Pull Request #939 · strawgate/fastforward

strawgate · 2026-04-04T19:31:04Z

Summary

Two phases of the buffer ownership refactor (#608) in a single coherent change:

Phase 1 — Split FileTailer into FileDiscovery (path watching, glob evaluation, rotation detection, LRU eviction) and FileReader (open file descriptors, byte reading, offset tracking). FileTailer composes both with poll() as a clean orchestrator. No public API changes.
Phase 2 — Replace Vec<u8> with bytes::Bytes at every stage boundary from tailer to scanner. TailEvent, InputEvent, ChannelMsg all carry Bytes. FileReader::read_new_data() uses BytesMut + freeze(). Pipeline accumulation buffers use BytesMut with split().freeze(). All input sources (tcp, udp, generator, otlp_receiver) produce Bytes. logfwd-core untouched — scanner takes &[u8] via Bytes::Deref.

What this eliminates

Per #608's copy chain (5 copies disk→scanner):

Copy	Location	Status
COPY 1: read_buf → result Vec	tail.rs	Eliminated — `BytesMut` + `freeze()`
COPY 2: bytes → chunk + remainder	framed.rs	Remains — `extend_from_slice` still
COPY 3: chunk → out_buf via format	framed.rs	Remains — `process_lines` still copies
COPY 4: bytes → input.buf batching	pipeline.rs	Improved — `BytesMut` reuses capacity
COPY 5: data → scan_buf	pipeline.rs	Improved — `split().freeze()` avoids alloc

This PR takes us from 5 copies to ~3. The remaining copies (2 and 3) require FramedInput to use Bytes::slice() for zero-copy newline framing — that's the follow-up work described in #608.

What this does NOT do (follow-up work)

Zero-copy framing in FramedInput — Bytes::slice() instead of extend_from_slice into out_buf. Eliminates copies 2+3 for passthrough format. (perf: eliminate per-poll allocations in input→scanner data path #608)
Owned chunks with attached metadata — each chunk carries source_id + checkpoint offset, replacing the separate HashMap tracking. (Phase 4 in plan)
Stream semantics — replace poll() → Vec<InputEvent> with streaming poll_next() → Option<InputChunk>. (Phase 4 in plan)

Why Bytes, not Vec ownership

Vec<u8> has single-owner move semantics but cannot be cheaply sub-sliced — every boundary requires extend_from_slice. Bytes is immutable shared ownership (Arc<[u8]> internally) which enables slice() for zero-copy framing. The BytesMut → freeze() → Bytes transition is one-way and permanent — no mutation is possible after freeze. This is the same safety model as &mut T → &T, just with owned types that work across thread/async boundaries.

The Arrow layer already requires Bytes — StreamingBuilder stores the input buffer as Bytes and Arrow StringViewArray references offsets into it. The RecordBatch must keep the buffer alive through SQL transform and output serialization.

Throughput impact

Metric	Before	After	Change
Generator → SQL → null	1,058,038 lines/sec	2,452,942 lines/sec	+132%

Test plan

cargo test -p logfwd-io — 184 tests pass (166 unit + 2 allocation + 16 integration)
cargo test -p logfwd --lib — all tests pass
just bench-self 15 — 2.45M lines/sec (was 1.06M)
just bench-otlp — full OTLP round-trip benchmark
Flamegraph comparison: just profile-otlp-local

🤖 Generated with Claude Code

Note

Split `FileTailer` into `FileDiscovery` and `FileReader` structs in logfwd-io

Refactors tail.rs by decomposing the monolithic FileTailer into two focused structs: FileDiscovery (owns the filesystem watcher, glob patterns, and rotation/deletion detection) and FileReader (owns open file descriptors, I/O buffers, LRU eviction, and offset management). FileTailer is retained as a thin orchestrator that composes both structs and preserves the existing public API via delegation. Updates ARCHITECTURE.md to reflect the new layering. Behavioral Change: FileTailer::poll now delegates each phase to the sub-structs in sequence; external API and event semantics are unchanged, but internal field paths in tests shift from tailer.watch_paths to tailer.discovery.watch_paths.

^{Macroscope summarized aa73abb.}

coderabbitai · 2026-04-04T19:31:18Z

Warning

Rate limit exceeded

@strawgate has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 3 minutes and 55 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 3 minutes and 55 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository YAML (base), Organization UI (inherited)

Review profile: ASSERTIVE

Plan: Pro

Run ID: 78e95cf7-53e8-44d1-ab73-473704396306

📥 Commits

Reviewing files that changed from the base of the PR and between 60b6820 and aa73abb.

📒 Files selected for processing (3)

crates/logfwd-io/src/otap_receiver.rs
crates/logfwd-io/src/tail.rs
dev-docs/ARCHITECTURE.md

Walkthrough

FileTailer was refactored into two internal components: FileDiscovery (handles notify watcher registration, event draining, glob rescans, and detection of new/rotated paths) and FileReader (manages open file descriptors, byte reading, truncation/copytruncate handling, rotation/deletion draining, LRU eviction, and offset/checkpoint operations). FileTailer::poll() now drains notifications, optionally triggers glob rescans per TailConfig, runs discovery detection, then invokes FileReader to read tailed files, remove deleted entries, and evict to TailConfig::max_open_files. Public FileTailer methods remain unchanged; internal state moved into FileDiscovery/FileReader.

Possibly related PRs

feat: file tailing audit + TLA+ spec + Kani checkpoint proofs #802: Refactors FileTailer into FileDiscovery/FileReader and adjusts glob-rescan, rotation/drain, and read-buffering logic—direct code-level overlap with this change.
fix: file tailing reliability — truncation, OOM, glob dedup, checkpoint validation, warnings (#796, #800, #799, #656, #730) #808: Modifies read/truncation flow, glob dedup/canonicalization, and checkpoint/open behavior in the same tailing subsystem.
fix: CodeRabbit feedback + shrink test builds (#808 follow-up) #813: Changes glob rescan/canonicalization and read-new-data/truncation handling touching the same tail.rs internals.

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Documentation Thoroughly Updated	⚠️ Warning	PR documentation updates incomplete: ARCHITECTURE.md claims Bytes migration done but code still uses Vec; DESIGN.md lacks ADR for FileDiscovery/FileReader composition; PHASES.md missing PR `#939` reference.	Complete or revert ARCHITECTURE.md to match actual Vec implementation; add ADR to DESIGN.md; update PHASES.md with PR `#939` reference for issue `#889`.
Maintainer Fitness	⚠️ Warning	PR claims Phase 2 (bytes::Bytes refactor) complete but code still uses Vec; InputEvent/TailEvent/FileReader unchanged, creating doc/code drift.	Complete Phase 2 implementation or revert ARCHITECTURE.md claims. Provide actual before/after benchmarks. Complete pending bench-otlp and flamegraph. Add risk analysis to PR description.

✅ Passed checks (3 passed)

Check name	Status	Explanation
High-Quality Rust Practices	✅ Passed	Code follows high-quality Rust practices with proper doc comments, no unsafe blocks, justified allocations, and 132% throughput improvement.
Formal Verification Coverage	✅ Passed	PR modifies only logfwd-io (tail.rs refactoring and documentation), leaving logfwd-core unchanged. Formal verification requirement applies exclusively to new public functions in logfwd-core. No Kani proofs required.
Crate Boundary And Dependency Integrity	✅ Passed	PR maintains all crate boundary and dependency integrity rules with no new dependencies or crates added.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

macroscopeapp · 2026-04-04T19:35:19Z

Approvability

Verdict: Needs human review

1 blocking correctness issue found. This refactor splits FileTailer into two internal components (FileDiscovery + FileReader) to separate concerns. While structurally the changes appear to preserve existing behavior, there are unresolved review comments identifying a potential data loss bug in set_offset_by_source (missing file size validation) and an inconsistent source_id attribution in drain_file that should be addressed before merging.

^{You can customize Macroscope's approvability policy. Learn more.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/logfwd-io/src/tail.rs`:
- Line 291: Add a brief comment immediately above the let watch_paths =
self.watch_paths.clone(); line explaining that the clone is done to avoid a
borrow-checker conflict when iterating while mutating reader.files, that it
occurs on the poll hot path but is necessary given current ownership, and that
its allocation cost has been considered (or benchmark justification noted);
reference the watch_paths.clone() use and the mutable iteration of reader.files
so future readers understand why the allocation is acceptable.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository YAML (base), Organization UI (inherited)

Review profile: ASSERTIVE

Plan: Pro

Run ID: e68191b2-f801-463c-98a6-c9fad8ab4249

📥 Commits

Reviewing files that changed from the base of the PR and between 5cc4635 and 277cf87.

📒 Files selected for processing (1)

crates/logfwd-io/src/tail.rs

coderabbitai · 2026-04-04T19:37:06Z

+        reader: &mut FileReader,
+        events: &mut Vec<TailEvent>,
+    ) {
        let watch_paths = self.watch_paths.clone();


🧹 Nitpick | 🔵 Trivial

Hot-path allocation: watch_paths.clone() runs every poll cycle.

Per coding guidelines, allocations in hot-path loops require benchmark justification. The clone is necessary here due to borrow-checker constraints (iterating while mutating reader.files), but a brief comment explaining why this is acceptable would satisfy the guideline.

📝 Suggested comment

+ // Clone watch_paths to release `&self` borrow before mutating reader. + // Allocation is O(num_watched_paths) per poll; acceptable for typical file counts (<1000). let watch_paths = self.watch_paths.clone();

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

let watch_paths = self.watch_paths.clone();

// Clone watch_paths to release `&self` borrow before mutating reader.

// Allocation is O(num_watched_paths) per poll; acceptable for typical file counts (<1000).

let watch_paths = self.watch_paths.clone();

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/logfwd-io/src/tail.rs` at line 291, Add a brief comment immediately above the let watch_paths = self.watch_paths.clone(); line explaining that the clone is done to avoid a borrow-checker conflict when iterating while mutating reader.files, that it occurs on the poll hot path but is necessary given current ownership, and that its allocation cost has been considered (or benchmark justification noted); reference the watch_paths.clone() use and the mutable iteration of reader.files so future readers understand why the allocation is acceptable.

Reflect the current state after #939: FileTailer is now composed of FileDiscovery + FileReader, all stage boundaries carry Bytes, and the buffer lifecycle section shows the honest copy count (3 remaining, down from 5) with the target from #608 (2 for passthrough, 1 unavoidable for CRI). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@dev-docs/ARCHITECTURE.md`:
- Around line 40-52: Documentation and code disagree: FileReader still uses
Vec<u8> and InputEvent::Data carries Vec<u8> but docs describe BytesMut →
freeze() → Bytes; update the implementation so FileReader's buffer (read_buf) is
a BytesMut, have FileReader::read_new_data produce and return a Bytes (call
freeze() / into()) instead of Vec<u8>, and change InputEvent::Data to store
Bytes rather than Vec<u8> so the mpsc channel carries Bytes across threads;
update all call sites of read_new_data, InputEvent::Data construction, and any
type annotations to use Bytes/BytesMut (and ensure ownership/refcounting is
preserved when sending across the channel).
- Around line 57-58: The docs incorrectly state that FramedInput works with
Bytes; update the architecture docs to reflect that FramedInput::poll and
related structures use Vec<u8> (not Bytes). Specifically mention the actual
buffer types: FramedInput::out_buf: Vec<u8>, FramedInput::spare_buf: Vec<u8>,
SourceState.remainder: Vec<u8>, and InputEvent::Data { bytes: Vec<u8> }, and
change the diagram/text "newline-delimited JSON as Bytes" to "newline-delimited
JSON as Vec<u8>" (or equivalent wording) so the documentation matches the
implementation.
- Line 56: Update the fenced code block containing "Bytes → FramedInput::poll()
→ newline-delimited JSON as Bytes" to include a language specifier for syntax
highlighting (e.g., change the opening ``` to ```text); locate the fence in
ARCHITECTURE.md where that exact line appears and add the language identifier so
the block reads ```text followed by the line and closing ``` to enable proper
rendering.
- Around line 274-281: The ARCHITECTURE.md section claiming "BytesMut → freeze()
→ Bytes" and zero-copy behavior is inaccurate because the code still uses
Vec<u8> (e.g., InputEvent::Data is Vec<u8>, FramedInput fields are Vec<u8> and
there is no freeze()/Bytes transition); update the documentation to describe the
current Vec<u8> pipeline (replace BytesMut/Bytes and zero-copy claims with the
actual steps: Vec<u8> storage in InputEvent::Data, FramedInput using Vec<u8>,
copy points such as extend_from_slice and any copies in
FormatProcessor/pipeline) OR alternatively implement Phase 2 fully by converting
InputEvent::Data to Bytes, changing FramedInput fields to BytesMut and adding
the freeze() transition and zero-copy handoff; choose one approach and make the
docs and PR summary consistent with the chosen implementation.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository YAML (base), Organization UI (inherited)

Review profile: ASSERTIVE

Plan: Pro

Run ID: 55685c8c-d08c-411e-b438-8e47f5651cd3

📥 Commits

Reviewing files that changed from the base of the PR and between 277cf87 and 60b6820.

📒 Files selected for processing (1)

dev-docs/ARCHITECTURE.md

coderabbitai · 2026-04-04T19:53:24Z


 ### 2. Framing: raw bytes → complete lines

 ```


⚠️ Potential issue | 🟡 Minor

Add language specifier to fenced code block.

The code block lacks a language identifier for syntax highlighting.

📝 Proposed fix

-``` +```text Bytes → FramedInput::poll() → newline-delimited JSON as Bytes

</details> <details> <summary>🧰 Tools</summary> <details> <summary>🪛 markdownlint-cli2 (0.22.0)</summary> [warning] 56-56: Fenced code blocks should have a language specified (MD040, fenced-code-language) </details> </details> <details> <summary>🤖 Prompt for AI Agents</summary>

Verify each finding against the current code and only fix it if needed.

In @dev-docs/ARCHITECTURE.md at line 56, Update the fenced code block containing
"Bytes → FramedInput::poll() → newline-delimited JSON as Bytes" to include a
language specifier for syntax highlighting (e.g., change the opening ``` to

add the language identifier so the block reads ```text followed by the line and closing ``` to enable proper rendering.

Split the monolithic FileTailer into three components: - FileDiscovery: notify watcher, glob patterns, rotation detection, deleted-file cleanup - FileReader: open file descriptors, read buffer, read_new_data(), drain_file(), evict_lru(), offset queries - FileTailer: thin compositor with poll() orchestrating both No public API changes. All master features preserved: source_id on TailEvent variants, EvictedFile with identity verification (#817), evicted offsets in file_offsets()/file_paths() (#697), drain with source_id propagation. Also fixes pre-existing lint warnings: - clippy::needless_range_loop in star_schema scatter loop - unused_qualifications for UInt32Array in otap_receiver tests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Reflect the current state after the FileTailer split: FileTailer is now composed of FileDiscovery + FileReader. Update buffer lifecycle section to show current copy count and target architecture from #608. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

macroscopeapp · 2026-04-04T20:53:50Z

+    /// Drain remaining data from a single file, emitting Data/Truncated events.
+    ///
+    /// Factored from the repeated drain pattern used during rotation and
+    /// deletion cleanup. Preserves source_id on all emitted events.
+    fn drain_file(
+        &mut self,
+        path: &Path,
+        source_id: Option<SourceId>,
+        events: &mut Vec<TailEvent>,
+    ) {
+        match self.read_new_data(path) {
+            Ok(ReadResult::Data(data)) => {
+                events.push(TailEvent::Data {
+                    path: path.to_path_buf(),
+                    bytes: data,
+                    source_id,
+                });
+            }
+            Ok(ReadResult::TruncatedThenData(data)) => {
+                events.push(TailEvent::Truncated {
+                    path: path.to_path_buf(),
+                    source_id,
+                });
+                events.push(TailEvent::Data {
+                    path: path.to_path_buf(),
+                    bytes: data,
+                    source_id,
                });
-                let _ = self.files.remove(path);
-                if let Err(e) = self.open_file_at(path, false) {
-                    tracing::warn!(path = %path.display(), error = %e, "tail.open_after_rotation_failed");
-                }
-            } else if is_new {
-                if let Err(e) = self.open_file_at(path, false) {
-                    tracing::warn!(path = %path.display(), error = %e, "tail.open_new_file_failed");
-                }
+            }


🟢 Low src/tail.rs:560

When drain_file handles TruncatedThenData, it emits the Data event with the pre-read source_id rather than the post-read identity. Since read_new_data updates tailed.identity.fingerprint during truncation detection (lines 503-504), the data belongs to the new identity, but the event is tagged with the old one. This inconsistency with read_all (which uses a fresh source_id for the Data event at lines 640-645) means the final drained data after truncation is incorrectly attributed.

Ok(ReadResult::TruncatedThenData(data)) => { events.push(TailEvent::Truncated { path: path.to_path_buf(), source_id, }); + let new_source_id = self.source_id_for_path(path); events.push(TailEvent::Data { path: path.to_path_buf(), bytes: data, - source_id, + source_id: new_source_id, });

🚀 Reply "fix it for me" or copy this AI Prompt for your agent:

In file crates/logfwd-io/src/tail.rs around lines 560-588: When `drain_file` handles `TruncatedThenData`, it emits the `Data` event with the pre-read `source_id` rather than the post-read identity. Since `read_new_data` updates `tailed.identity.fingerprint` during truncation detection (lines 503-504), the data belongs to the new identity, but the event is tagged with the old one. This inconsistency with `read_all` (which uses a fresh `source_id` for the `Data` event at lines 640-645) means the final drained data after truncation is incorrectly attributed. Evidence trail: - crates/logfwd-io/src/tail.rs lines 564-597: `drain_file` function uses same `source_id` parameter for both Truncated and Data events in TruncatedThenData case - crates/logfwd-io/src/tail.rs lines 503-504: `read_new_data` updates `tailed.identity.fingerprint` during truncation - crates/logfwd-io/src/tail.rs lines 632-645: `read_all` uses `pre_read_source_id` for Truncated but calls `self.source_id_for_path(&path)` for fresh source_id on Data event - crates/logfwd-io/src/tail.rs lines 761-766: `source_id_for_path` returns source_id based on current tailed.identity - crates/logfwd-io/src/tail.rs lines 340-346, 386-388: callers of `drain_file` capture source_id before calling it

macroscopeapp · 2026-04-04T20:53:50Z

🟡 Medium

https://github.com/strawgate/memagent/blob/aa73abbe432a48ee67d2a6e2d8de808769d8fc2b/crates/logfwd-io/src/tail.rs#L736-L750

set_offset_by_source seeks to the saved offset without validating it against the current file size, unlike set_offset which resets to 0 when the offset exceeds file size (lines 718-729). When a file is truncated between runs, this causes all data between 0 and the stale offset to be skipped — the same data loss issue #656 fixed in set_offset. Consider adding the same file size validation before seeking.

fn set_offset_by_source(&mut self, source_id: SourceId, offset: u64) -> io::Result<()> { for tailed in self.files.values_mut() { if tailed.identity.source_id() == source_id { - tailed.offset = offset; - tailed.file.seek(SeekFrom::Start(offset))?; + let file_size = tailed.file.metadata()?.len(); + let safe_offset = if offset > file_size { + 0 + } else { + offset + }; + tailed.offset = safe_offset; + tailed.file.seek(SeekFrom::Start(safe_offset))?; return Ok(()); } } Ok(()) // source not found — file may not exist yet }

🚀 Reply "fix it for me" or copy this AI Prompt for your agent:

In file crates/logfwd-io/src/tail.rs around lines 736-750: `set_offset_by_source` seeks to the saved offset without validating it against the current file size, unlike `set_offset` which resets to 0 when the offset exceeds file size (lines 718-729). When a file is truncated between runs, this causes all data between 0 and the stale offset to be skipped — the same data loss issue #656 fixed in `set_offset`. Consider adding the same file size validation before seeking. Evidence trail: crates/logfwd-io/src/tail.rs lines 710-750 at REVIEWED_COMMIT: - `set_offset` (lines 716-733) validates offset against file size with `let file_size = tailed.file.metadata()?.len()` and `if offset > file_size` check - `set_offset_by_source` (lines 740-750) directly sets offset and seeks without any file size validation - Comment on `set_offset` explicitly references #656 as the fix for this truncation scenario

The #939 merge changed logfwd-io to produce Bytes but pipeline.rs still used Vec<u8> — causing a wasteful Bytes→Vec→Bytes round-trip at the pipeline boundary. Changes: - Add bytes dependency to logfwd crate - ChannelMsg::Data carries Bytes instead of Vec<u8> - InputState.buf is BytesMut with split().freeze() on send - scan_buf is BytesMut with split().freeze() in flush_batch - Scanner receives Bytes directly (removes .into() conversion) Generator benchmark (3 runs each, drop warmup): master: 1,658K / 1,755K lines/sec refactor: 2,779K / 2,782K lines/sec (+60%) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Fix test assertions to use Bytes::from_static instead of vec![] - Correct ARCHITECTURE.md "Buffer lifecycle" section to reflect actual state: tailer/input/framed still use Vec<u8>, only pipeline.rs uses BytesMut/Bytes. Previous version overstated what #939 delivered. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The #939 merge changed logfwd-io to produce Bytes but pipeline.rs still used Vec<u8> — causing a wasteful Bytes→Vec→Bytes round-trip at the pipeline boundary. Changes: - Add bytes dependency to logfwd crate - ChannelMsg::Data carries Bytes instead of Vec<u8> - InputState.buf is BytesMut with split().freeze() on send - scan_buf is BytesMut with split().freeze() in flush_batch - Scanner receives Bytes directly (removes .into() conversion) Generator benchmark (3 runs each, drop warmup): master: 1,658K / 1,755K lines/sec refactor: 2,779K / 2,782K lines/sec (+60%) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Fix test assertions to use Bytes::from_static instead of vec![] - Correct ARCHITECTURE.md "Buffer lifecycle" section to reflect actual state: tailer/input/framed still use Vec<u8>, only pipeline.rs uses BytesMut/Bytes. Previous version overstated what #939 delivered. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

macroscopeapp Bot reviewed Apr 4, 2026

View reviewed changes

Comment thread crates/logfwd-io/src/tail.rs Outdated

coderabbitai Bot requested changes Apr 4, 2026

View reviewed changes

strawgate mentioned this pull request Apr 4, 2026

perf: eliminate per-poll allocations in input→scanner data path #608

Closed

coderabbitai Bot requested changes Apr 4, 2026

View reviewed changes

strawgate force-pushed the refactor/split-file-tailer branch from 60b6820 to 399532a Compare April 4, 2026 20:48

strawgate and others added 2 commits April 4, 2026 15:50

strawgate force-pushed the refactor/split-file-tailer branch from 399532a to aa73abb Compare April 4, 2026 20:50

macroscopeapp Bot reviewed Apr 4, 2026

View reviewed changes

strawgate merged commit e25c5bf into master Apr 4, 2026
14 checks passed

strawgate deleted the refactor/split-file-tailer branch April 4, 2026 20:56

strawgate mentioned this pull request Apr 4, 2026

refactor(logfwd): complete Bytes migration in pipeline.rs #963

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(logfwd-io): split FileTailer + Bytes buffer ownership#939

refactor(logfwd-io): split FileTailer + Bytes buffer ownership#939
strawgate merged 2 commits into
masterfrom
refactor/split-file-tailer

strawgate commented Apr 4, 2026 •

edited by macroscopeapp Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 4, 2026 •

edited

Loading

Rate limit exceeded

❌ Failed checks (2 warnings)

Uh oh!

Uh oh!

macroscopeapp Bot commented Apr 4, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 4, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 4, 2026

Uh oh!

Uh oh!

Uh oh!

macroscopeapp Bot Apr 4, 2026

Uh oh!

macroscopeapp Bot Apr 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

strawgate commented Apr 4, 2026 • edited by macroscopeapp Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What this eliminates

What this does NOT do (follow-up work)

Why Bytes, not Vec ownership

Throughput impact

Test plan

Split FileTailer into FileDiscovery and FileReader structs in logfwd-io

Uh oh!

coderabbitai Bot commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Possibly related PRs

❌ Failed checks (2 warnings)

Uh oh!

Uh oh!

macroscopeapp Bot commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Approvability

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

macroscopeapp Bot Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp Bot Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

strawgate commented Apr 4, 2026 •

edited by macroscopeapp Bot

Loading

Split `FileTailer` into `FileDiscovery` and `FileReader` structs in logfwd-io

coderabbitai Bot commented Apr 4, 2026 •

edited

Loading

macroscopeapp Bot commented Apr 4, 2026 •

edited

Loading