Skip to content

Conversation

@rutb327
Copy link
Owner

@rutb327 rutb327 commented Jun 18, 2025

Equality Delete Support - 1

This PR implements equality delete support -1 for the Python Iceberg library.

Changes
-> Added functionality to match equality deletes to data files
-> Enhanced PyArrow IO implementation to handle equality delete files
-> Added some test cases to verify functions added for equality delete support
-> Added test fixtures for equality delete testing

Testing
All tests pass, including new tests for equality delete functionality.

Related Issues
This is part of the project to support equality deletes in the Python Iceberg implementation, aligning with the Iceberg spec.


return matching_files

def _can_contain_eq_deletes_for_file(self, data_file: DataFile, delete_wrapper: EqualityDeleteFileWrapper) -> bool:
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you compare this to java implementaion again i see java uses contains null and all null with range check helpers.

Copy link
Owner Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added the same checks just doing them inline directly in _can_contain_eq_deletes_for_file(). Should I create helpers and call them in _can_contain_eq_deletes_for_file() ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants