[Append Scan] Introduce `IncrementalAppendScan` class (without integration tests) by smaheshwar-pltr · Pull Request #2234 · apache/iceberg-python

smaheshwar-pltr · 2025-07-22T12:54:52Z

Note: Contains changes from

Smaller diff from those changes: smaheshwar-pltr#5.

Rationale for this change

Split up from incremental append scan work - see #2031 (comment). PyIceberg doesn't support incremental reading of appended data between snapshots, like Spark does.

This PR adds equality adds the IncrementalAppendScan class and the API for constructing it on pyiceberg.Table.

Are these changes tested?

Integration tests are separated into a different PR - #2235, to keep this one small.

Are there any user-facing changes?

Ignoring the other PRs, there's a new scan class and method on Table.

smaheshwar-pltr · 2025-07-22T13:17:09Z

pyiceberg/table/__init__.py

+
+        append_snapshot_ids: Set[int] = {snapshot.snapshot_id for snapshot in append_snapshots}
+
+        manifests = {


#2031 (comment)

smaheshwar-pltr · 2025-07-22T13:17:22Z

pyiceberg/table/__init__.py

            limit=limit,
        )

+    def incremental_append_scan(


#2031 (comment)

smaheshwar-pltr · 2025-07-22T13:17:57Z

pyiceberg/table/__init__.py

+                Optional ID of the "from" snapshot, to start the incremental scan from, exclusively. This can be set
+                on the IncrementalAppendScan object returned, but ultimately must not be None.


#2031 (comment)

smaheshwar-pltr · 2025-07-22T13:20:00Z

pyiceberg/table/__init__.py

+        return current_schema.select(*self.selected_fields, case_sensitive=self.case_sensitive)
+
+    def plan_files(self) -> Iterable[FileScanTask]:
+        from_snapshot_id, to_snapshot_id = self._validate_and_resolve_snapshots()


#2031 (comment)

smaheshwar-pltr · 2025-07-22T13:20:22Z

pyiceberg/table/__init__.py

+        ).plan_files(
+            manifests=list(manifests),
+            manifest_entry_filter=lambda manifest_entry: manifest_entry.snapshot_id in append_snapshot_ids
+            and manifest_entry.status == ManifestEntryStatus.ADDED,


#2031 (comment)

github-actions · 2026-03-17T00:28:55Z

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that's incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@iceberg.apache.org list. Thank you for your contributions.

github-actions · 2026-03-25T00:30:42Z

This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

Sreesh Maheshwar added 4 commits July 22, 2025 10:51

Introduce an AbstractTableScan with default methods

90a861f

Extract manifest group planning into separate class

6d561f8

Add __eq__ and __hash__ methods to ManifestFile

1eeedb8

Introduce IncrementalAppendScan class

17a6865

smaheshwar-pltr mentioned this pull request Jul 22, 2025

[Append Scan] Integration tests for IncrementalAppendScan #2235

Closed

smaheshwar-pltr commented Jul 22, 2025

View reviewed changes

pyiceberg/table/__init__.py

limit=limit,

)

def incremental_append_scan(

Copy link

Contributor Author

smaheshwar-pltr Jul 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

#2031 (comment)

smaheshwar-pltr commented Jul 22, 2025

View reviewed changes

smaheshwar-pltr mentioned this pull request Jul 22, 2025

Incremental Append Scan #2031

Closed

smaheshwar-pltr marked this pull request as ready for review July 22, 2025 13:36

smaheshwar-pltr mentioned this pull request Jul 22, 2025

[Append Scan] Extract manifest group planning into separate class #2232

Closed

smaheshwar-pltr mentioned this pull request Oct 19, 2025

Incremental Append Scan #2634

Open

github-actions bot added the stale label Mar 17, 2026

github-actions bot closed this Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Append Scan] Introduce `IncrementalAppendScan` class (without integration tests)#2234

[Append Scan] Introduce `IncrementalAppendScan` class (without integration tests)#2234
smaheshwar-pltr wants to merge 4 commits intoapache:mainfrom
smaheshwar-pltr:sm/append-scan-no-tests

smaheshwar-pltr commented Jul 22, 2025 •

edited

Loading

Uh oh!

smaheshwar-pltr Jul 22, 2025

Uh oh!

smaheshwar-pltr Jul 22, 2025

Uh oh!

smaheshwar-pltr Jul 22, 2025

Uh oh!

smaheshwar-pltr Jul 22, 2025

Uh oh!

smaheshwar-pltr Jul 22, 2025

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		append_snapshot_ids: Set[int] = {snapshot.snapshot_id for snapshot in append_snapshots}

		manifests = {

		Optional ID of the "from" snapshot, to start the incremental scan from, exclusively. This can be set
		on the IncrementalAppendScan object returned, but ultimately must not be None.

Conversation

smaheshwar-pltr commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

Uh oh!

smaheshwar-pltr Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

smaheshwar-pltr Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

smaheshwar-pltr Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

smaheshwar-pltr Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

smaheshwar-pltr Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

smaheshwar-pltr commented Jul 22, 2025 •

edited

Loading