fix: column indices in FFI partition evaluator by timsaucer · Pull Request #16480 · apache/datafusion

timsaucer · 2025-06-20T14:12:44Z

Which issue does this PR close?

Closes Panic in FFI UDWF when using wrapping lead function datafusion-python#1144

Rationale for this change

There is a bug in how we compute the indices to create a schema for. It misses the last index and it can erroneously create an index 0 when none exists.

What changes are included in this PR?

Properly handle the None case when finding max index.

Are these changes tested?

Unit test will be added before marking this PR ready. Tested in datafusion-python.

Are there any user-facing changes?

No

alamb

Thank you @timsaucer -- this makes sense to me

alamb · 2025-06-26T18:14:17Z

datafusion/ffi/src/udwf/partition_evaluator_args.rs

-                    DataType::Null,
-                    true,
-                ),
+        let max_column = required_columns.keys().max();


I wonder if it is time to simply pass in the real schema to PartitionEvaluatorArgs 🤔

alamb · 2025-06-26T18:19:00Z

datafusion/ffi/src/udwf/mod.rs

+    }
+
+    #[tokio::test]
+    async fn test_lag_udwf() -> datafusion::common::Result<()> {


I verified these tests fail without the code change in this PR

thread 'udwf::tests::test_lag_udwf' panicked at /Users/andrewlamb/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/arrow-schema-55.1.0/src/schema.rs:382:10: index out of bounds: the len is 0 but the index is 0 note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace thread 'udwf::tests::test_lag_udwf' panicked at library/core/src/panicking.rs:218:5: panic in a function that cannot unwind stack backtrace: 0: 0x108eec990 - std::backtrace_rs::backtrace::libunwind::trace::hd09570c029a6744a at /rustc/17067e9ac6d7ecb70e50f92c1944e545188d2359/library/std/src/../../backtrace/src/backtrace/libunwind.rs:117:9 1: 0x108eec990 - std::backtrace_rs::backtrace::trace_unsynchronized::h8d2fa64833f91cb3

* Column indices were not computed correctly, causing a panic * Add unit tests

* Column indices were not computed correctly, causing a panic * Add unit tests Co-authored-by: Tim Saucer <timsaucer@gmail.com>

Column indices were not computed correctly, causing a panic

c424d44

github-actions bot added the ffi Changes to the ffi crate label Jun 20, 2025

timsaucer mentioned this pull request Jun 20, 2025

Release DataFusion 48.0.1 #16486

Closed

Add unit tests

a707477

timsaucer marked this pull request as ready for review June 26, 2025 12:13

alamb approved these changes Jun 26, 2025

View reviewed changes

alamb merged commit cce3f3f into apache:main Jun 27, 2025
27 checks passed

timsaucer deleted the bugfix/ffi-column-indices branch June 30, 2025 11:28

alamb pushed a commit to alamb/datafusion that referenced this pull request Jul 2, 2025

fix: column indices in FFI partition evaluator (apache#16480)

113843e

* Column indices were not computed correctly, causing a panic * Add unit tests

alamb mentioned this pull request Jul 2, 2025

[branch-48] fix: column indices in FFI partition evaluator (#16480) #16657

Merged

alamb added a commit that referenced this pull request Jul 4, 2025

fix: column indices in FFI partition evaluator (#16480) (#16657)

bcb8dc5

* Column indices were not computed correctly, causing a panic * Add unit tests Co-authored-by: Tim Saucer <timsaucer@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: column indices in FFI partition evaluator#16480

fix: column indices in FFI partition evaluator#16480
alamb merged 2 commits intoapache:mainfrom
timsaucer:bugfix/ffi-column-indices

timsaucer commented Jun 20, 2025 •

edited

Loading

Uh oh!

alamb left a comment

Uh oh!

alamb Jun 26, 2025

Uh oh!

alamb Jun 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

timsaucer commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

alamb Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

timsaucer commented Jun 20, 2025 •

edited

Loading