[SPARK-12122][STREAMING] Prevent batches from being submitted twice after recovering StreamingContext from checkpoint#10127
[SPARK-12122][STREAMING] Prevent batches from being submitted twice after recovering StreamingContext from checkpoint#10127tdas wants to merge 2 commits into
Conversation
|
Test build #2161 has finished for PR 10127 at commit
|
|
Test build #47134 has finished for PR 10127 at commit
|
|
Test build #2162 has finished for PR 10127 at commit
|
There was a problem hiding this comment.
Could you clarify why pendingTimes may contain restartTime?
There was a problem hiding this comment.
Explained offline:
The restart time is always checkpointTime+1 (assuming batch duration = 1). However, pending times can already have batches >= checkpointTime+1. This can cause timesToReschedule to have batches >= checkpointTime+1, which will be explicitly submitted, and then resubmitted through the timer.
|
LGTM |
|
Test build #2164 has finished for PR 10127 at commit
|
|
Test build #2165 has finished for PR 10127 at commit
|
|
Test build #47163 has finished for PR 10127 at commit
|
…fter recovering StreamingContext from checkpoint Author: Tathagata Das <tathagata.das1565@gmail.com> Closes #10127 from tdas/SPARK-12122. (cherry picked from commit 4106d80) Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>
No description provided.