-
Notifications
You must be signed in to change notification settings - Fork 417
[CELEBORN-2264] Support cancel shuffle when write bytes exceeds threshold #3601
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #3601 +/- ##
==========================================
- Coverage 67.13% 66.91% -0.22%
==========================================
Files 357 357
Lines 21860 21932 +72
Branches 1943 1949 +6
==========================================
Hits 14674 14674
- Misses 6166 6244 +78
+ Partials 1020 1014 -6 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
d15feca to
2182ddb
Compare
2182ddb to
9522ca4
Compare
9522ca4 to
65ee9d9
Compare
7ebf5d1 to
d7c7080
Compare
d7c7080 to
4a513c7
Compare
eolivelli
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have left one question about protocol compatibility
Can you please add some unit tests ?
| bytesWrittenPerPartition: Array[Long], | ||
| serdeVersion: SerdeVersion) | ||
| serdeVersion: SerdeVersion, | ||
| bytesWritten: Long) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IIUC this patch is changing the protocol
how is this handled ?
will the new client be compatible with the new server version and viceversa ?
What changes were proposed in this pull request?
This patch adds configurable threshold check for shuffle write bytes.
Why are the changes needed?
Shuffle will be canceled automatically if write bytes exceed the threshold to avoid cluster resource exhaustion.
Does this PR resolve a correctness bug?
No
Does this PR introduce any user-facing change?
No
How was this patch tested?
CI and Manual testing