-
Notifications
You must be signed in to change notification settings - Fork 0
Equality delete support - 1 #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
|
||
| return matching_files | ||
|
|
||
| def _can_contain_eq_deletes_for_file(self, data_file: DataFile, delete_wrapper: EqualityDeleteFileWrapper) -> bool: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you compare this to java implementaion again i see java uses contains null and all null with range check helpers.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have added the same checks just doing them inline directly in _can_contain_eq_deletes_for_file(). Should I create helpers and call them in _can_contain_eq_deletes_for_file() ?
Equality Delete Support - 1
This PR implements equality delete support -1 for the Python Iceberg library.
Changes
-> Added functionality to match equality deletes to data files
-> Enhanced PyArrow IO implementation to handle equality delete files
-> Added some test cases to verify functions added for equality delete support
-> Added test fixtures for equality delete testing
Testing
All tests pass, including new tests for equality delete functionality.
Related Issues
This is part of the project to support equality deletes in the Python Iceberg implementation, aligning with the Iceberg spec.