[SPARK-27938][SQL] Remove feature flag LEGACY_PASS_PARTITION_BY_AS_OPTIONS#24784
[SPARK-27938][SQL] Remove feature flag LEGACY_PASS_PARTITION_BY_AS_OPTIONS#24784liwensun wants to merge 3 commits into
Conversation
|
Test build #106123 has finished for PR 24784 at commit
|
| "turning the flag on provides a way for these sources to see these partitionBy columns.") | ||
| .booleanConf | ||
| .createWithDefault(false) | ||
| .createWithDefault(true) |
There was a problem hiding this comment.
Actually, why do we need this configuration if it's not invasive?
There was a problem hiding this comment.
I think it was to avoid possible regression for the release 2.4.3, which was reasonable at that time.
There was a problem hiding this comment.
It's not intrusive then should be no regression strictly though. Yea but I got that it was an extra caution. Can we remove it now in master?
There was a problem hiding this comment.
Agree, we should be removing these in 3.0 rather than defaulting back to legacy behavior.
There was a problem hiding this comment.
Sounds good to me. I will go ahead and remove this feature flag altogether if no objections.
|
I have updated the PR to remove the config. Thanks for the feedback! |
|
Test build #106256 has finished for PR 24784 at commit
|
|
LGTM pending Jenkins. |
|
Test build #106258 has finished for PR 24784 at commit
|
| .save() | ||
|
|
||
| val partColumns = LastOptions.parameters(DataSourceUtils.PARTITIONING_COLUMNS_KEY) | ||
| assert(DataSourceUtils.decodePartitioningColumns(partColumns) === Seq("col1", "col2")) |
There was a problem hiding this comment.
decodePartitioningColumns is under execution package that's not supposed to be exposed so users shouldn't use this util directly.
Did we document this option to any public datasource v1 API? We should also say this is a JSON string.
|
LGTM too. strictly #24784 (comment) can be done separately. Merged to master. |
…TIONS ## What changes were proposed in this pull request? In PR apache#24365, we pass in the partitionBy columns as options in `DataFrameWriter`. To make this change less intrusive for a patch release, we added a feature flag `LEGACY_PASS_PARTITION_BY_AS_OPTIONS` with the default to be false. For 3.0, we should just do the correct behavior for DSV1, i.e., always passing partitionBy as options, and remove this legacy feature flag. ## How was this patch tested? Existing tests. Closes apache#24784 from liwensun/SPARK-27453-default. Authored-by: liwensun <liwen.sun@databricks.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>
What changes were proposed in this pull request?
In PR #24365, we pass in the partitionBy columns as options in
DataFrameWriter. To make this change less intrusive for a patch release, we added a feature flagLEGACY_PASS_PARTITION_BY_AS_OPTIONSwith the default to be false.For 3.0, we should just do the correct behavior for DSV1, i.e., always passing partitionBy as options, and remove this legacy feature flag.
How was this patch tested?
Existing tests.