Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1178 commits
Select commit Hold shift + click to select a range
6f6dcc8
[SPARK-31854][SQL][2.4] Invoke in MapElementsExec should not propagat…
maropu Jun 1, 2020
ac59a8c
[SPARK-31889][BUILD] Docker release script does not allocate enough m…
holdenk Jun 1, 2020
bf01280
[SPARK-31860][BUILD][2.4] only push release tags on success
holdenk Jun 2, 2020
476010a
[SPARK-31903][SQL][PYSPARK][2.4] Fix toPandas with Arrow enabled to s…
ueshin Jun 6, 2020
48017cc
[SPARK-31923][CORE] Ignore internal accumulators that use unrecognize…
zsxwing Jun 8, 2020
2556d32
[SPARK-31935][2.4][SQL] Hadoop file system config should be effective…
gengliangwang Jun 10, 2020
53f1349
[SPARK-31941][CORE] Replace SparkException to NoSuchElementException …
SaurabhChawla100 Jun 10, 2020
e51eb3a
[SPARK-31954][SQL] Delete duplicate testcase in HiveQuerySuite
GuoPhilipse Jun 11, 2020
f191e2b
[SPARK-31967][UI][2.4] Downgrade to vis.js 4.21.0 to fix Jobs UI load…
gengliangwang Jun 13, 2020
5de1929
[SPARK-31632][CORE][WEBUI][FOLLOWUP] Enrich the exception message whe…
sarutak Jun 14, 2020
e44190a
[SPARK-31968][SQL] Duplicate partition columns check when writing data
TJX2014 Jun 14, 2020
90e928c
[SPARK-29152][CORE][2.4] Executor Plugin shutdown when dynamic alloca…
iRakson Jun 14, 2020
a89a674
Revert "[SPARK-29152][CORE][2.4] Executor Plugin shutdown when dynami…
dongjoon-hyun Jun 15, 2020
e1cb384
[SPARK-31997][SQL][TESTS] Drop test_udtf table when SingleSessionSuit…
LuciferYang Jun 16, 2020
23ff9e6
[SPARK-32000][2.4][CORE][TESTS] Fix the flaky test for partially laun…
Ngone51 Jun 17, 2020
9c5c823
[SPARK-31871][CORE][WEBUI][2.4] Display the canvas element icon for s…
liucht-inspur Jun 17, 2020
dee27ee
[SPARK-31980][SQL] Function sequence() fails if start and end of rang…
TJX2014 Jun 20, 2020
16c70cf
Revert "[SPARK-31980][SQL] Function sequence() fails if start and end…
dongjoon-hyun Jun 20, 2020
d0a2d33
[SPARK-31980][SQL][2.4] Function sequence() fails if start and end of…
TJX2014 Jun 20, 2020
e50c1e5
[SPARK-32034][SQL][2.4] Port HIVE-14817: Shutdown the SessionManager …
yaooqinn Jun 22, 2020
9fb3760
[SPARK-32044][SS][2.4] Kakfa continuous processing print mislead init…
Jun 22, 2020
29873c9
[SPARK-31918][R] Ignore S4 generic methods under SparkR namespace in …
HyukjinKwon Jun 24, 2020
77006b2
[SPARK-32073][R] Drop R < 3.5 support
HyukjinKwon Jun 24, 2020
a295003
[SPARK-32098][PYTHON] Use iloc for positional slicing instead of dire…
HyukjinKwon Jun 25, 2020
1eda585
[SPARK-32115][SQL] Fix SUBSTRING to handle integer overflows
xuanyuanking Jun 28, 2020
409930d
[SPARK-32028][WEBUI][2.4] fix app id link for multi attempts app in h…
zhli1142015 Jul 1, 2020
37b32c8
[SPARK-32131][SQL] Fix AnalysisException messages at UNION/EXCEPT/MIN…
GuoPhilipse Jul 1, 2020
bc1acfe
[SPARK-32089][R][BUILD] Upgrade R version to 4.0.2 in the release Doc…
HyukjinKwon Jun 24, 2020
7f4d452
[SPARK-31935][2.4][SQL][FOLLOWUP] Hadoop file system config should be…
cloud-fan Jul 2, 2020
2227a16
[MINOR][TEST][SQL] Make in-limit.sql more robust
cloud-fan Jul 2, 2020
2bb5ced
[SPARK-32214][SQL] The type conversion function generated in makeFrom…
sarutak Jul 8, 2020
eddf40d
[SPARK-32024][WEBUI] Update ApplicationStoreInfo.size during HistoryS…
zhli1142015 Jul 8, 2020
146062d
[SPARK-32167][2.4][SQL] Fix GetArrayStructFields to respect inner fie…
cloud-fan Jul 8, 2020
49c6877
[SPARK-32024][WEBUI][FOLLOWUP] Quick fix on test failure on missing w…
HeartSaVioR Jul 9, 2020
d5b903e
[SPARK-32035][DOCS][EXAMPLES] Fixed typos involving AWS Access, Secre…
kaxio Jul 9, 2020
4c82ae8
[SPARK-32238][SQL] Use Utils.getSimpleName to avoid hitting Malformed…
Ngone51 Jul 11, 2020
a4854d6
[MINOR][DOCS] Fix typo in PySpark example in ml-datasource.md
chx5 Jul 12, 2020
5084c71
[SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFram…
HyukjinKwon Jul 14, 2020
9aeeb0f
[SPARK-32318][SQL][TESTS] Add a test case to EliminateSortsSuite for …
dongjoon-hyun Jul 15, 2020
afdad0e
[SPARK-32018][SQL][2.4] UnsafeRow.setDecimal should set null with ove…
cloud-fan Jul 17, 2020
58c637a
[SPARK-32344][SQL][2.4] Unevaluable expr is set to FIRST/LAST ignoreN…
maropu Jul 20, 2020
c9c187a
[SPARK-32367][K8S][TESTS] Correct the spelling of parameter in Kubern…
merrily01 Jul 20, 2020
e37b67a
[MINOR][DOCS] add link for Debugging your Application in running-on-y…
brandonJY Jul 21, 2020
822cc34
[SPARK-32379][BUILD] docker based spark release script should use cor…
ScrapCodes Jul 21, 2020
df930e1
[SPARK-32377][SQL][2.4] CaseInsensitiveMap should be deterministic fo…
dongjoon-hyun Jul 21, 2020
6a653a2
[SPARK-32364][SQL][2.4] Use CaseInsensitiveMap for DataFrameReader/Wr…
dongjoon-hyun Jul 23, 2020
6ee0eb4
[SPARK-32280][SPARK-32372][2.4][SQL] ResolveReferences.dedupRight sho…
Ngone51 Jul 24, 2020
62671af
[SPARK-32428][EXAMPLES] Make BinaryClassificationMetricsExample cons…
titsuki Jul 26, 2020
c1421d0
[MINOR][PYTHON] Fix spacing in error message
hauntsaninja Jul 28, 2020
4a8f692
[SPARK-32397][BUILD] Allow specifying of time for build to keep time …
holdenk Jul 29, 2020
91f2a25
[SPARK-28818][SQL][2.4] Respect source column nullability in the arra…
Aug 2, 2020
39d31dc
[SPARK-32003][CORE][2.4] When external shuffle service is used, unreg…
wypoon Aug 4, 2020
dc04bf5
Preparing Spark release v2.4.7-rc1
ScrapCodes Aug 6, 2020
7dc17cd
Preparing development version 2.4.8-SNAPSHOT
ScrapCodes Aug 6, 2020
701c44d
[SPARK-32560][SQL] Improve exception message at InsertIntoHiveTable.p…
GuoPhilipse Aug 7, 2020
a693960
[SPARK-32556][INFRA][2.4] Fix release script to URL encode the user p…
ScrapCodes Aug 7, 2020
f1f6fc3
[MINOR][DOCS] Fix typos at ExecutorAllocationManager.scala
kimminw00 Aug 8, 2020
b9595c3
[MINOR] Update URL of the parquet project in code comment
izchen Aug 13, 2020
e5bef51
[SPARK-31703][SQL] Parquet RLE float/double are read incorrectly on b…
tinhto-000 Aug 13, 2020
75d45fa
[SPARK-32625][SQL] Log error message when falling back to interpreter…
wangyum Aug 15, 2020
e3ec2d7
Revert "[SPARK-32018][SQL][2.4] UnsafeRow.setDecimal should set null …
cloud-fan Aug 17, 2020
34544d6
[SPARK-32609] Incorrect exchange reuse with DataSourceV2
mingjialiu Aug 17, 2020
ff8a663
[SPARK-32647][INFRA] Report SparkR test results with JUnit reporter
HyukjinKwon Aug 18, 2020
7c65f76
[SPARK-32249][INFRA][2.4] Run Github Actions builds in branch-2.4
HyukjinKwon Aug 20, 2020
d846476
[SPARK-31966][ML][TESTS][PYTHON][2.4] Increase the timeout for Stream…
HyukjinKwon Aug 20, 2020
4344a69
[SPARK-32674][DOC] Add suggestion for parallel directory listing in t…
sunchao Aug 21, 2020
55efce1
Preparing Spark release v2.4.7-rc2
ScrapCodes Aug 21, 2020
1b42be4
Preparing development version 2.4.8-SNAPSHOT
ScrapCodes Aug 21, 2020
b3f7989
[SPARK-32672][SQL] Fix data corruption in boolean bit set compression
revans2 Aug 22, 2020
d7e1746
[MINOR][SQL] Add missing documentation for LongType mapping
yeshengm Aug 25, 2020
242c01d
[SPARK-32695][INFRA] Explicitly cache and hash 'build' directly in Gi…
HyukjinKwon Aug 26, 2020
ef42225
[SPARK-32693][SQL][2.4] Compare two dataframes with same schema excep…
viirya Aug 29, 2020
081c12b
[SPARK-32771][DOCS] The example of expressions.Aggregator in Javadoc …
sarutak Sep 2, 2020
bc471f3
[SPARK-32810][SQL][2.4] CSV/JSON data sources should avoid globbing p…
MaxGekk Sep 8, 2020
f2bcc93
[SPARK-32812][PYTHON][TESTS] Avoid initiating a process during the ma…
itholic Sep 8, 2020
277ccba
[SPARK-31511][SQL][2.4] Make BytesToBytesMap iterators thread-safe
cxzl25 Sep 8, 2020
14211a1
Preparing Spark release v2.4.7-rc3
ScrapCodes Sep 8, 2020
3980c66
Preparing development version 2.4.8-SNAPSHOT
ScrapCodes Sep 8, 2020
ef24542
[SPARK-32815][ML][2.4] Fix LibSVM data source loading error on file p…
MaxGekk Sep 8, 2020
c4590b6
[SPARK-32836][SS][TESTS] Fix DataStreamReaderWriterSuite to check wri…
dongjoon-hyun Sep 10, 2020
ee6b7d4
[SPARK-32832][SS][2.4] Use CaseInsensitiveMap for DataStreamReader/Wr…
dongjoon-hyun Sep 11, 2020
c82b6e4
[SPARK-32794][SS] Fixed rare corner case error in micro-batch engine …
tdas Sep 11, 2020
be76ee9
[SPARK-32845][SS][TESTS] Add sinkParameter to check sink options robu…
dongjoon-hyun Sep 11, 2020
4072665
[SPARK-32865][DOC] python section in quickstart page doesn't display …
bowenli86 Sep 13, 2020
d33052b
[SPARK-32708] Query optimization fails to reuse exchange with DataSou…
mingjialiu Sep 14, 2020
ad61af0
[SPARK-32872][CORE] Prevent BytesToBytesMap at MAX_CAPACITY from exce…
ankurdave Sep 14, 2020
94b9d6f
Revert "[SPARK-32872][CORE] Prevent BytesToBytesMap at MAX_CAPACITY f…
dongjoon-hyun Sep 14, 2020
5543f98
[SPARK-32872][CORE][2.4] Prevent BytesToBytesMap at MAX_CAPACITY from…
ankurdave Sep 15, 2020
cf0c907
[SPARK-32715][CORE] Fix memory leak when failed to store pieces of br…
LantaoJin Sep 15, 2020
ce545d1
[SPARK-32888][DOCS] Add user document about header flag and RDD as pa…
viirya Sep 16, 2020
2fa68a6
[SPARK-32900][CORE] Allow UnsafeExternalSorter to spill when there ar…
tomvanbussel Sep 17, 2020
eed7c62
[SPARK-32908][SQL][2.4] Fix target error calculation in `percentile_a…
MaxGekk Sep 18, 2020
62708db
[SPARK-32635][SQL][2.4] Fix foldable propagation
peter-toth Sep 18, 2020
b8e6fa7
[SPARK-32738][CORE][2.4] Should reduce the number of active threads i…
wzhfy Sep 18, 2020
2516128
[SPARK-32886][WEBUI] fix 'undefined' link in event timeline view
zhli1142015 Sep 21, 2020
7d935ff
[HOTFIX][2.4] Revert SPARK-32886
sarutak Sep 21, 2020
e1e94ed
[SPARK-32898][2.4][CORE] Fix wrong executorRunTime when task killed b…
Ngone51 Sep 22, 2020
d204795
[SPARK-32306][SQL][DOCS][2.4] Clarify the result of `percentile_appro…
MaxGekk Sep 23, 2020
1366443
[MINOR][SQL][2.4] Improve examples for `percentile_approx()`
MaxGekk Sep 23, 2020
cd3caab
[SPARK-32886][SPARK-31882][WEBUI][2.4] fix 'undefined' link in event …
zhli1142015 Sep 25, 2020
bf32ac8
[SPARK-27872][K8S][2.4] Fix executor service account inconsistency
Sep 25, 2020
32a28ff
Revert "[SPARK-27872][K8S][2.4] Fix executor service account inconsis…
dongjoon-hyun Sep 25, 2020
1973515
[SPARK-32999][SQL][2.4] Use Utils.getSimpleName to avoid hitting Malf…
Sep 29, 2020
326cdff
[SPARK-33051][INFRA][R] Uses setup-r to install R in GitHub Actions b…
HyukjinKwon Oct 2, 2020
e2e60e7
[SPARK-33069][INFRA] Skip test result report if no JUnit XML files ar…
HyukjinKwon Oct 6, 2020
1a77846
[SPARK-30201][SQL][2.4] HiveOutputWriter standardOI should use Object…
ulysses-you Oct 6, 2020
45a8b89
[SPARK-32901][CORE][2.4] Do not allocate memory while spilling Unsafe…
tomvanbussel Oct 8, 2020
3e28f49
[SPARK-33096][K8S] Use LinkedHashMap instead of Map for newlyCreatedE…
dongjoon-hyun Oct 8, 2020
27b75af
[SPARK-33101][ML][3.0] Make LibSVM format propagate Hadoop config fro…
MaxGekk Oct 9, 2020
e21c2d3
[SPARK-33094][SQL][2.4] Make ORC format propagate Hadoop config from …
MaxGekk Oct 10, 2020
5a50e30
[SPARK-27872][K8S][2.4] Fix executor service account inconsistency
Oct 12, 2020
652e574
[SPARK-25271][SQL][2.4] Hive ctas commands should use data source if …
viirya Oct 12, 2020
816ffd7
[SPARK-33136][SQL][2.4] Fix mistakenly swapped parameter in V2WriteCo…
HeartSaVioR Oct 15, 2020
4353f7d
[SPARK-30894][SQL][2.4] Make Size's nullable independent from SQL con…
MaxGekk Oct 15, 2020
2e72b01
[SPARK-26646][TEST][PYSPARK][2.4] Fix flaky test: pyspark.mllib.tests…
viirya Oct 17, 2020
1d9ffad
[SPARK-33131][SQL][2.4] Fix grouping sets with having clause can not …
ulysses-you Oct 18, 2020
4ebe947
[MINOR][DOCS][EXAMPLE] Fix the Python manual_load_options_csv example
kjmrknsn Oct 18, 2020
15dc67e
[SPARK-33123][INFRA] Ignore GitHub only changes in Amplab Jenkins build
williamhyun Oct 19, 2020
ad5d9fc
Revert "[SPARK-33069][INFRA] Skip test result report if no JUnit XML …
HyukjinKwon Oct 19, 2020
623d84c
[SPARK-33190][INFRA][TESTS] Set upper bound of PyArrow version in Git…
HyukjinKwon Oct 20, 2020
0a6dedd
[SPARK-33189][PYTHON][TESTS] Add env var to tests for legacy nested t…
BryanCutler Oct 21, 2020
9634da8
[SPARK-33189][FOLLOWUP][2.4] Fix syntax error in python/run-tests.py
dongjoon-hyun Oct 22, 2020
a39a096
Revert "[SPARK-33189][FOLLOWUP][2.4] Fix syntax error in python/run-t…
HyukjinKwon Oct 22, 2020
88a3110
Revert "[SPARK-33189][PYTHON][TESTS] Add env var to tests for legacy …
HyukjinKwon Oct 22, 2020
970a602
Revert "[SPARK-33190][INFRA][TESTS] Set upper bound of PyArrow versio…
HyukjinKwon Oct 22, 2020
da18b51
[SPARK-32247][INFRA] Install and test scipy with PyPy in GitHub Actions
HyukjinKwon Oct 15, 2020
8b9036f
[SPARK-33217][INFRA][PYTHON][2.4] Set upper bound of Pandas and PyArr…
HyukjinKwon Oct 22, 2020
a85d690
[SPARK-33228][SQL] Don't uncache data when replacing a view having th…
maropu Oct 25, 2020
5001349
[SPARK-33230][SQL] Hadoop committers to get unique job ID in "spark.s…
steveloughran Oct 26, 2020
e73acc5
Revert "[SPARK-33228][SQL] Don't uncache data when replacing a view h…
viirya Oct 26, 2020
ffda450
[SPARK-32090][SQL] Improve UserDefinedType.equal() to make it be symm…
Ngone51 Jun 29, 2020
d1f4a34
[SPARK-33228][SQL] Don't uncache data when replacing a view having th…
maropu Oct 25, 2020
7a3be12
[SPARK-33292][SQL] Make Literal ArrayBasedMapData string representati…
dongjoon-hyun Oct 30, 2020
6ae6999
Revert "[SPARK-33292][SQL] Make Literal ArrayBasedMapData string repr…
dongjoon-hyun Oct 30, 2020
bf4d7a6
[SPARK-33268][SQL][PYTHON][2.4] Fix bugs for casting data from/to Pyt…
maropu Oct 30, 2020
f1c3041
[SPARK-33292][SQL][2.4] Make Literal ArrayBasedMapData string represe…
dongjoon-hyun Oct 30, 2020
779ac0f
[SPARK-33183][SQL][2.4] Fix Optimizer rule EliminateSorts and add a p…
allisonwang-db Oct 30, 2020
64a21ea
[SPARK-29574][K8S][2.4] Add SPARK_DIST_CLASSPATH to the executor clas…
Oct 31, 2020
a32178c
[SPARK-33290][SQL][2.4] REFRESH TABLE should invalidate cache even th…
sunchao Oct 31, 2020
cabf957
[SPARK-33277][PYSPARK][SQL][2.4] Use ContextAwareIterator to stop con…
ueshin Nov 2, 2020
1bbba60
[SPARK-33313][TESTS][R][3.0][2.4] Add testthat 3.x support
HyukjinKwon Nov 2, 2020
6c5d008
[SPARK-33156][INFRA][2.4] Upgrade GithubAction image from 18.04 to 20.04
dongjoon-hyun Nov 3, 2020
2bde026
[SPARK-33338][SQL] GROUP BY using literal map should not fail
dongjoon-hyun Nov 4, 2020
c342bcd
Revert "[SPARK-33277][PYSPARK][SQL][2.4] Use ContextAwareIterator to …
HyukjinKwon Nov 5, 2020
8684720
[MINOR][SS][DOCS] Update join type in stream static joins code examples
sarveshdave1 Nov 5, 2020
fa1b476
[SPARK-33333][BUILD][2.4] Upgrade Jetty to 9.4.28.v20200408
dongjoon-hyun Nov 6, 2020
bfeaef1
[SPARK-33405][BUILD][2.4] Upgrade commons-compress to 1.20
dongjoon-hyun Nov 10, 2020
efceeee
[SPARK-33372][SQL][2.4] Fix InSet bucket pruning
wangyum Nov 10, 2020
fece4a3
[SPARK-33417][SQL][TEST] Correct the behaviour of query filters in TP…
maropu Nov 11, 2020
1e177c7
[MINOR][GRAPHX][2.4] Correct typos in the sub-modules: graphx, extern…
jsoref Nov 12, 2020
0297d54
[SPARK-33339][PYTHON][2.4] Pyspark application will hang due to non E…
Nov 13, 2020
27c0ccc
[SPARK-33464][INFRA][2.4] Add/remove (un)necessary cache and restruct…
HyukjinKwon Nov 19, 2020
351aef1
[SPARK-33483][INFRA][TESTS][2.4] Fix rat exclusion patterns and add a…
dongjoon-hyun Nov 19, 2020
3772bfa
[SPARK-27421][SQL][2.4] Fix filter for int column and value class jav…
wangyum Nov 19, 2020
6705912
[SPARK-33472][SQL][2.4] Adjust RemoveRedundantSorts rule order
allisonwang-db Nov 20, 2020
7ac44da
[MINOR][INFRA] Suppress warning in check-license
williamhyun Nov 23, 2020
a3d1604
[SPARK-33535][INFRA][TESTS] Export LANG to en_US.UTF-8 in run-tests-j…
LuciferYang Nov 24, 2020
ef1441b
[SPARK-24266][K8S][2.4] Restart the watcher when we receive a version…
Nov 24, 2020
3eb6e38
[SPARK-26645][PYTHON][2.4] Support decimals with negative scale when …
mgaido91 Nov 25, 2020
1a0283b
[SPARK-33585][SQL][DOCS] Fix the comment for `SQLContext.tables()` an…
MaxGekk Nov 29, 2020
7b68757
[SPARK-33588][SQL][2.4] Respect the `spark.sql.caseSensitive` config …
MaxGekk Nov 30, 2020
7a8af18
[SPARK-33631][DOCS][TEST] Clean up spark.core.connection.ack.wait.tim…
LuciferYang Dec 2, 2020
3ab2936
[SPARK-33667][SQL][2.4] Respect the `spark.sql.caseSensitive` config …
MaxGekk Dec 6, 2020
955033a
[SPARK-33675][INFRA][2.4] Add GitHub Action job to publish snapshot
dongjoon-hyun Dec 7, 2020
61a6b47
[SPARK-33681][K8S][TESTS][3.0] Increase K8s IT timeout to 3 minutes
dongjoon-hyun Dec 7, 2020
9ca324a
[SPARK-33675][INFRA][2.4][FOLLOWUP] Set GIT_REF to branch-2.4
dongjoon-hyun Dec 7, 2020
78fade3
[SPARK-33670][SQL][2.4] Verify the partition provider is Hive in v1 S…
MaxGekk Dec 7, 2020
e961cb7
[SPARK-33725][BUILD][2.4] Upgrade snappy-java to 1.1.8.2
viirya Dec 10, 2020
7f2320e
[SPARK-33732][K8S][TESTS][2.4] Kubernetes integration tests doesn't w…
sarutak Dec 10, 2020
1261e13
[SPARK-33749][BUILD][PYTHON] Exclude target directory in pycodestyle …
HyukjinKwon Dec 11, 2020
d34bc2a
[SPARK-33757][INFRA][R] Fix the R dependencies build error on GitHub …
sarutak Dec 11, 2020
3e64795
[SPARK-33742][SQL][2.4] Throw PartitionsAlreadyExistException from Hi…
MaxGekk Dec 11, 2020
abded89
[MINOR][UI] Correct JobPage's skipped/pending tableHeaderId
linzebing Dec 13, 2020
88c3649
[SPARK-33757][INFRA][R][FOLLOWUP] Provide more simple solution
sarutak Dec 14, 2020
2964626
[SPARK-33770][SQL][TESTS][2.4] Fix the `ALTER TABLE .. DROP PARTITION…
MaxGekk Dec 14, 2020
029d577
[SPARK-33788][SQL][3.1][3.0][2.4] Throw NoSuchPartitionsException fro…
MaxGekk Dec 16, 2020
399b2ca
[SPARK-33733][SQL][2.4] PullOutNondeterministic should check and coll…
ulysses-you Dec 17, 2020
7e98ade
[SPARK-33831][UI] Update to jetty 9.4.34
srowen Dec 18, 2020
9b8c193
[SPARK-33756][SQL] Make BytesToBytesMap's MapIterator idempotent
advancedxy Dec 20, 2020
59cee43
[SPARK-33593][SQL][2.4] Vector reader got incorrect data with binary …
AngersZhuuuu Dec 21, 2020
b25f208
[SPARK-33869][PYTHON][SQL][TESTS] Have a separate metastore directory…
HyukjinKwon Dec 21, 2020
377ad9a
[BUILD][MINOR] Do not publish snapshots from forks
EnricoMi Dec 22, 2020
a21625f
[SPARK-33831][UI][FOLLOWUP] Update spark-deps-hadoop-3.1 for jetty 9.…
dongjoon-hyun Dec 22, 2020
1a153da
[SPARK-27575][CORE][YARN][2.4] Yarn file-related confs should merge n…
HeartSaVioR Dec 23, 2020
afd219f
[SPARK-33277][PYSPARK][SQL][2.4] Use ContextAwareIterator to stop con…
ueshin Dec 24, 2020
45e19bb
[SPARK-33911][SQL][DOCS][2.4] Update the SQL migration guide about ch…
MaxGekk Dec 27, 2020
3e6a6b7
[SPARK-33935][SQL][2.4] Fix CBO cost function
Jan 6, 2021
d442146
[SPARK-34012][SQL][2.4] Keep behavior consistent when conf `spark.sql…
AngersZhuuuu Jan 6, 2021
63e93a5
[SPARK-34059][SQL][CORE][2.4] Use for/foreach rather than map to make…
HyukjinKwon Jan 12, 2021
7ae6c8d
[SPARK-34118][CORE][SQL][2.4] Replaces filter and check for emptiness…
LuciferYang Jan 15, 2021
e0e1e21
[SPARK-34125][CORE][2.4] Make EventLoggingListener.codecMap thread-safe
cxzl25 Jan 18, 2021
96d2533
[SPARK-34153][SQL][2.4] Remove unused `getRawTable()` from `HiveExter…
MaxGekk Jan 19, 2021
d910643
[SPARK-34187][SS][2.4] Use available offset range obtained during pol…
viirya Jan 25, 2021
a194eab
[SPARK-34229][SQL] Avro should read decimal values with the file schema
dongjoon-hyun Jan 26, 2021
a5f844b
[SPARK-33726][SQL][2.4] Fix for Duplicate field names during Aggregation
yliou Jan 26, 2021
6519a7e
[SPARK-34212][SQL] Fix incorrect decimal reading from Parquet files
dongjoon-hyun Jan 26, 2021
33bdbf8
[SPARK-34231][AVRO][TEST] Make proper use of resource file within Avr…
xkrogen Jan 27, 2021
6bc088f
[SPARK-34212][SQL][FOLLOWUP] Refine the behavior of reading parquet n…
cloud-fan Jan 27, 2021
86eb199
[SPARK-34268][SQL][DOCS] Correct the documentation of the concat_ws f…
wangyum Jan 28, 2021
78bf448
[SPARK-34275][CORE][SQL][MLLIB][2.4] Replaces filter and size with count
LuciferYang Jan 28, 2021
953cc1c
[SPARK-34273][CORE] Do not reregister BlockManager when SparkContext …
dongjoon-hyun Jan 28, 2021
6c8bc7c
[SPARK-34260][SQL][2.4] Fix UnresolvedException when creating temp vi…
linhongliu-db Jan 29, 2021
da3ccab
[SPARK-34270][SS] Combine StateStoreMetrics should not override State…
viirya Jan 30, 2021
3696ba8
[SPARK-34310][CORE][SQL][2.4] Replaces map and flatten with flatMap
LuciferYang Feb 2, 2021
5f4e9ea
[SPARK-34212][SQL][FOLLOWUP] Parquet vectorized reader can read decim…
cloud-fan Feb 3, 2021
90db0ab
[SPARK-34327][BUILD] Strip passwords from inlining into build informa…
ScrapCodes Feb 3, 2021
e7acca2
[SPARK-34318][SQL][2.4] Dataset.colRegex should work with column name…
sarutak Feb 4, 2021
fa78e68
[SPARK-34407][K8S] KubernetesClusterSchedulerBackend.stop should clea…
dongjoon-hyun Feb 9, 2021
c9683be
[MINOR][SQL][DOCS] Fix the comments in the example at window function
yzjg Feb 19, 2021
9807250
[SPARK-34449][BUILD][2.4] Upgrade Jetty to fix CVE-2020-27218
sarutak Feb 20, 2021
96f5137
[SPARK-30228][BUILD][2.4] Update zstd-jni to 1.4.4-3
Feb 25, 2021
eb4601e
[SPARK-32924][WEBUI] Make duration column in master UI sorted in the …
Mar 4, 2021
191b24c
[SPARK-34672][BUILD][2.4] Fix docker file for creating release
viirya Mar 9, 2021
7985360
[SPARK-34507][BUILD] Update scala.version in parent POM when changing…
srowen Mar 11, 2021
906df15
[SPARK-34703][PYSPARK][2.4] Fix pyspark test when using sort_values o…
viirya Mar 11, 2021
6ec74e2
[SPARK-34696][SQL][TESTS] Fix CodegenInterpretedPlanTest to generate …
dongjoon-hyun Mar 11, 2021
7e0fbe0
[SPARK-31807][INFRA] Use python 3 style in release-build.sh
williamhyun May 25, 2020
41f46cf
[SPARK-34724][SQL] Fix Interpreted evaluation by using getMethod inst…
dongjoon-hyun Mar 12, 2021
7b7a8fe
[SPARK-34743][SQL][TESTS] ExpressionEncoderSuite should use deepEqual…
dongjoon-hyun Mar 15, 2021
3c627ad
[MINOR][SQL] Remove unused variable in NewInstance.constructor
dongjoon-hyun Mar 16, 2021
c5d81cb
[SPARK-34774][BUILD][2.4] Ensure change-scala-version.sh update scala…
LuciferYang Mar 19, 2021
59e4ae4
[SPARK-34776][SQL][3.0][2.4] Window class should override producedAtt…
viirya Mar 20, 2021
7879a0c
[SPARK-26625] Add oauthToken to spark.redaction.regex
Jan 16, 2019
29b981b
[SPARK-34811][CORE] Redact fs.s3a.access.key like secret and token
dongjoon-hyun Mar 21, 2021
ce58e05
[SPARK-34719][SQL][2.4] Correctly resolve the view query with duplica…
cloud-fan Mar 20, 2021
5685d84
[SPARK-34726][SQL][2.4] Fix collectToPython timeouts
peter-toth Mar 22, 2021
e756130
[MINOR][DOCS] Updating the link for Azure Data Lake Gen 2 in docs
lenadroid Mar 23, 2021
6ee1c08
[SPARK-34596][SQL][2.4] Use Utils.getSimpleName to avoid hitting Malf…
maropu Mar 24, 2021
615dbe1
[SPARK-34607][SQL][2.4] Add `Utils.isMemberClass` to fix a malformed …
maropu Mar 26, 2021
8062ab0
[SPARK-34874][INFRA] Recover test reports for failed GA builds
HyukjinKwon Mar 26, 2021
b83ab63
[SPARK-34876][SQL] Fill defaultResult of non-nullable aggregates
tanelk Mar 29, 2021
102b723
Revert "[SPARK-34876][SQL] Fill defaultResult of non-nullable aggrega…
HyukjinKwon Mar 29, 2021
3e65ba9
[SPARK-34855][CORE] Avoid local lazy variable in SparkContext.getCall…
viirya Mar 29, 2021
38238d5
[SPARK-34876][SQL][2.4] Fill defaultResult of non-nullable aggregates
tanelk Mar 29, 2021
f2ddbab
[SPARK-34909][SQL] Fix conversion of negative to unsigned in conv()
timarmstrong Mar 31, 2021
58a859a
Revert "[SPARK-33935][SQL][2.4] Fix CBO cost function"
viirya Apr 1, 2021
04485fe
[SPARK-24931][INFRA][2.4] Fix the GA failure related to R linter
sarutak Apr 1, 2021
30436b5
[SPARK-34939][CORE][2.4] Throw fetch failure exception when unable to…
viirya Apr 4, 2021
53d37e4
Preparing Spark release v2.4.8-rc1
viirya Apr 7, 2021
c36cea9
Preparing development version 2.4.9-SNAPSHOT
viirya Apr 7, 2021
f7ac0db
[SPARK-34988][CORE][2.4] Upgrade Jetty for CVE-2021-28165
sarutak Apr 8, 2021
b4d9d4a
[SPARK-34994][BUILD][2.4] Fix git error when pushing the tag after re…
viirya Apr 8, 2021
ae5568e
[SPARK-34963][SQL][2.4] Fix nested column pruning for extracting case…
viirya Apr 10, 2021
a0ab27c
Preparing Spark release v2.4.8-rc2
viirya Apr 11, 2021
63ebabb
[SPARK-34212][SQL][FOLLOWUP] Move the added test to ParquetQuerySuite
wangyum Apr 13, 2021
d4b9719
[SPARK-34834][NETWORK] Fix a potential Netty memory leak in Transport…
weixiuli Apr 14, 2021
c438f5f
[SPARK-35080][SQL] Only allow a subset of correlated equality predica…
allisonwang-db Apr 20, 2021
1630d64
[SPARK-31225][SQL][2.4] Override sql method of OuterReference
yaooqinn Apr 21, 2021
bec7389
Preparing development version 2.4.9-SNAPSHOT
viirya Apr 22, 2021
67aad59
[SPARK-35210][BUILD][2.4] Upgrade Jetty to 9.4.40 to fix ERR_CONNECTI…
sarutak Apr 25, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
252 changes: 252 additions & 0 deletions .github/workflows/build_and_test.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,252 @@
name: Build and test

on:
push:
branches:
- branch-2.4
pull_request:
branches:
- branch-2.4

jobs:
# Build: build Spark and run the tests for specified modules.
build:
name: "Build modules: ${{ matrix.modules }} ${{ matrix.comment }} (JDK ${{ matrix.java }}, ${{ matrix.hadoop }})"
# Ubuntu 20.04 is the latest LTS. The next LTS is 22.04.
runs-on: ubuntu-20.04
strategy:
fail-fast: false
matrix:
java:
- 8
hadoop:
- hadoop2.6
# TODO(SPARK-32246): We don't test 'streaming-kinesis-asl' for now.
# Kinesis tests depends on external Amazon kinesis service.
# Note that the modules below are from sparktestsupport/modules.py.
modules:
- >-
core, unsafe, kvstore, avro,
network-common, network-shuffle, repl, launcher,
examples, sketch, graphx
- >-
catalyst, hive-thriftserver
- >-
streaming, sql-kafka-0-10, streaming-kafka-0-10,
mllib-local, mllib,
yarn, mesos, kubernetes, hadoop-cloud, spark-ganglia-lgpl,
streaming-flume, streaming-flume-sink, streaming-kafka-0-8
- >-
pyspark-sql, pyspark-mllib
- >-
pyspark-core, pyspark-streaming, pyspark-ml
- >-
sparkr
- >-
sql
# Here, we split Hive and SQL tests into some of slow ones and the rest of them.
included-tags: [""]
excluded-tags: [""]
comment: [""]
include:
# Hive tests
- modules: hive
java: 8
hadoop: hadoop2.6
included-tags: org.apache.spark.tags.SlowHiveTest
comment: "- slow tests"
- modules: hive
java: 8
hadoop: hadoop2.6
excluded-tags: org.apache.spark.tags.SlowHiveTest
comment: "- other tests"
env:
MODULES_TO_TEST: ${{ matrix.modules }}
EXCLUDED_TAGS: ${{ matrix.excluded-tags }}
INCLUDED_TAGS: ${{ matrix.included-tags }}
HADOOP_PROFILE: ${{ matrix.hadoop }}
# GitHub Actions' default miniconda to use in pip packaging test.
CONDA_PREFIX: /usr/share/miniconda
GITHUB_PREV_SHA: ${{ github.event.before }}
ARROW_PRE_0_15_IPC_FORMAT: 1
steps:
- name: Checkout Spark repository
uses: actions/checkout@v2
# In order to fetch changed files
with:
fetch-depth: 0
# Cache local repositories. Note that GitHub Actions cache has a 2G limit.
- name: Cache Scala, SBT, Maven and Zinc
uses: actions/cache@v2
with:
path: |
build/apache-maven-*
build/zinc-*
build/scala-*
build/*.jar
~/.sbt
key: build-${{ hashFiles('**/pom.xml', 'project/build.properties', 'build/mvn', 'build/sbt', 'build/sbt-launch-lib.bash', 'build/spark-build-info') }}
restore-keys: |
build-
- name: Cache Ivy local repository
uses: actions/cache@v2
with:
path: ~/.ivy2/cache
key: ${{ matrix.java }}-${{ matrix.hadoop }}-ivy-${{ hashFiles('**/pom.xml', '**/plugins.sbt') }}
restore-keys: |
${{ matrix.java }}-${{ matrix.hadoop }}-ivy-
- name: Install Java ${{ matrix.java }}
uses: actions/setup-java@v1
with:
java-version: ${{ matrix.java }}
# PySpark
- name: Install PyPy3
# Note that order of Python installations here matters because default python is
# overridden.
uses: actions/setup-python@v2
if: contains(matrix.modules, 'pyspark')
with:
python-version: pypy3
architecture: x64
- name: Install Python 3.6
uses: actions/setup-python@v2
if: contains(matrix.modules, 'pyspark')
with:
python-version: 3.6
architecture: x64
- name: Install Python 2.7
uses: actions/setup-python@v2
# Yarn has a Python specific test too, for example, YarnClusterSuite.
with:
python-version: 2.7
architecture: x64
- name: Install Python packages (Python 3.6 and PyPy3)
if: contains(matrix.modules, 'pyspark')
# PyArrow is not supported in PyPy yet, see ARROW-2651.
run: |
python3.6 -m pip install numpy 'pyarrow<0.12.0' 'pandas<0.24.0' scipy xmlrunner
python3.6 -m pip list
# PyPy does not have xmlrunner, and pandas<0.24.0 installation fails in PyPy3, just skipping.
pypy3 -m pip install numpy scipy
pypy3 -m pip list
- name: Install Python packages (Python 2.7)
if: contains(matrix.modules, 'pyspark') || (contains(matrix.modules, 'sql') && !contains(matrix.modules, 'sql-'))
run: |
# Some tests do not pass in PySpark with PyArrow, for example, pyspark.sql.tests.ArrowTests.
python2.7 -m pip install numpy 'pandas<0.24.0' scipy xmlrunner
python2.7 -m pip list
# SparkR
- name: Install R 4.0
uses: r-lib/actions/setup-r@v1
if: contains(matrix.modules, 'sparkr')
with:
r-version: 4.0
- name: Install R packages
if: contains(matrix.modules, 'sparkr')
run: |
# qpdf is required to reduce the size of PDFs to make CRAN check pass. See SPARK-32497.
sudo apt-get install -y libcurl4-openssl-dev qpdf
sudo Rscript -e "install.packages(c('knitr', 'rmarkdown', 'testthat', 'devtools', 'e1071', 'survival', 'arrow', 'roxygen2'), repos='https://cloud.r-project.org/')"
# Show installed packages in R.
sudo Rscript -e 'pkg_list <- as.data.frame(installed.packages()[, c(1,3:4)]); pkg_list[is.na(pkg_list$Priority), 1:2, drop = FALSE]'
# Run the tests.
- name: Run tests
run: |
# Hive tests become flaky when running in parallel as it's too intensive.
if [[ "$MODULES_TO_TEST" == "hive" ]]; then export SERIAL_SBT_TESTS=1; fi
./dev/run-tests --parallelism 2 --modules "$MODULES_TO_TEST" --included-tags "$INCLUDED_TAGS" --excluded-tags "$EXCLUDED_TAGS"
- name: Upload test results to report
if: always()
uses: actions/upload-artifact@v2
with:
name: test-results-${{ matrix.modules }}-${{ matrix.comment }}-${{ matrix.java }}-${{ matrix.hadoop }}
path: "**/target/test-reports/*.xml"
- name: Upload unit tests log files
if: failure()
uses: actions/upload-artifact@v2
with:
name: unit-tests-log-${{ matrix.modules }}-${{ matrix.comment }}-${{ matrix.java }}-${{ matrix.hadoop }}-${{ matrix.hive }}
path: "**/target/unit-tests.log"

# Static analysis, and documentation build
lint:
name: Linters, licenses, dependencies and documentation generation
runs-on: ubuntu-20.04
steps:
- name: Checkout Spark repository
uses: actions/checkout@v2
# Cache local repositories. Note that GitHub Actions cache has a 2G limit.
- name: Cache Scala, SBT, Maven and Zinc
uses: actions/cache@v2
with:
path: |
build/apache-maven-*
build/zinc-*
build/scala-*
build/*.jar
~/.sbt
key: build-${{ hashFiles('**/pom.xml', 'project/build.properties', 'build/mvn', 'build/sbt', 'build/sbt-launch-lib.bash', 'build/spark-build-info') }}
restore-keys: |
build-
- name: Cache Ivy local repository
uses: actions/cache@v2
with:
path: ~/.ivy2/cache
key: docs-ivy-${{ hashFiles('**/pom.xml', '**/plugins.sbt') }}
restore-keys: |
docs-ivy-
- name: Cache Maven local repository
uses: actions/cache@v2
with:
path: ~/.m2/repository
key: docs-maven-${{ hashFiles('**/pom.xml') }}
restore-keys: |
docs-maven-
- name: Install Java 8
uses: actions/setup-java@v1
with:
java-version: 8
- name: Install Python 3.6
uses: actions/setup-python@v2
with:
python-version: 3.6
architecture: x64
- name: Install Python linter dependencies
run: |
pip3 install flake8 sphinx numpy
- name: Install R 4.0
uses: r-lib/actions/setup-r@v1
with:
r-version: 4.0
- name: Install R linter dependencies and SparkR
run: |
sudo apt-get install -y libcurl4-openssl-dev libgit2-dev libssl-dev libxml2-dev
sudo Rscript -e "install.packages(c('devtools'), repos='https://cloud.r-project.org/')"
sudo Rscript -e "devtools::install_github('jimhester/lintr@v2.0.1')"
./R/install-dev.sh
- name: Install Ruby 2.7 for documentation generation
uses: actions/setup-ruby@v1
with:
ruby-version: 2.7
- name: Install dependencies for documentation generation
run: |
sudo apt-get install -y libcurl4-openssl-dev pandoc
pip install sphinx mkdocs numpy
gem install jekyll jekyll-redirect-from pygments.rb
sudo Rscript -e "install.packages(c('devtools', 'testthat', 'knitr', 'rmarkdown', 'roxygen2'), repos='https://cloud.r-project.org/')"
- name: Scala linter
run: ./dev/lint-scala
- name: Java linter
run: ./dev/lint-java
- name: Python linter
run: ./dev/lint-python
- name: R linter
run: ./dev/lint-r
- name: License test
run: ./dev/check-license
- name: Dependencies test
run: ./dev/test-dependencies.sh
- name: Run documentation build
run: |
cd docs
jekyll build
33 changes: 33 additions & 0 deletions .github/workflows/publish_snapshot.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
name: Publish Snapshot

on:
push:
branches:
- branch-2.4

jobs:
publish-snapshot:
if: github.repository == 'apache/spark'
runs-on: ubuntu-latest
steps:
- name: Checkout Spark repository
uses: actions/checkout@master
- name: Cache Maven local repository
uses: actions/cache@v2
with:
path: ~/.m2/repository
key: snapshot-maven-${{ hashFiles('**/pom.xml') }}
restore-keys: |
snapshot-maven-
- name: Install Java 8
uses: actions/setup-java@v1
with:
java-version: 8
- name: Publish snapshot
env:
ASF_USERNAME: ${{ secrets.NEXUS_USER }}
ASF_PASSWORD: ${{ secrets.NEXUS_PW }}
GPG_KEY: "not_used"
GPG_PASSPHRASE: "not_used"
GIT_REF: "branch-2.4"
run: ./dev/create-release/release-build.sh publish-snapshot
25 changes: 25 additions & 0 deletions .github/workflows/test_report.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
name: Report test results
on:
workflow_run:
workflows: ["Build and test"]
types:
- completed

jobs:
test_report:
runs-on: ubuntu-latest
steps:
- name: Download test results to report
uses: dawidd6/action-download-artifact@v2
with:
github_token: ${{ secrets.GITHUB_TOKEN }}
workflow: ${{ github.event.workflow_run.workflow_id }}
commit: ${{ github.event.workflow_run.head_commit.id }}
workflow_conclusion: completed
- name: Publish test report
uses: scacap/action-surefire-report@v1
with:
check_name: Report test results
github_token: ${{ secrets.GITHUB_TOKEN }}
report_paths: "**/target/test-reports/*.xml"
commit: ${{ github.event.workflow_run.head_commit.id }}
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ project/plugins/project/build.properties
project/plugins/src_managed/
project/plugins/target/
python/lib/pyspark.zip
python/.eggs/
python/deps
python/test_coverage/coverage_data
python/test_coverage/htmlcov
Expand Down
50 changes: 0 additions & 50 deletions .travis.yml

This file was deleted.

2 changes: 1 addition & 1 deletion LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -243,7 +243,7 @@ MIT License
core/src/main/resources/org/apache/spark/ui/static/dagre-d3.min.js
core/src/main/resources/org/apache/spark/ui/static/*dataTables*
core/src/main/resources/org/apache/spark/ui/static/graphlib-dot.min.js
ore/src/main/resources/org/apache/spark/ui/static/jquery*
core/src/main/resources/org/apache/spark/ui/static/jquery*
core/src/main/resources/org/apache/spark/ui/static/sorttable.js
docs/js/vendor/anchor.min.js
docs/js/vendor/jquery*
Expand Down
2 changes: 1 addition & 1 deletion LICENSE-binary
Original file line number Diff line number Diff line change
Expand Up @@ -305,7 +305,6 @@ com.google.code.gson:gson
com.google.inject:guice
com.google.inject.extensions:guice-servlet
com.twitter:parquet-hadoop-bundle
commons-beanutils:commons-beanutils-core
commons-cli:commons-cli
commons-dbcp:commons-dbcp
commons-io:commons-io
Expand Down Expand Up @@ -468,6 +467,7 @@ Common Development and Distribution License (CDDL) 1.1
------------------------------------------------------

javax.annotation:javax.annotation-api https://jcp.org/en/jsr/detail?id=250
javax.el:javax.el-api https://javaee.github.io/uel-ri/
javax.servlet:javax.servlet-api https://javaee.github.io/servlet-spec/
javax.transaction:jta http://www.oracle.com/technetwork/java/index.html
javax.ws.rs:javax.ws.rs-api https://github.com/jax-rs
Expand Down
Loading