DAO Pagination (GSI-2412) by TheByronHimes · Pull Request #231 · ghga-de/hexkit

TheByronHimes · 2026-06-10T14:15:43Z

The DAO protocol and providers now expose pagination parameters in find_all(): sort, skip, and limit. Additionally, the result of find_all() exposes a .total_count() method that returns the total number of items matching the filter regardless of skip/limit. Together, these should enable both offset-based and cursor-based pagination.

Bumps the version to 8.3.0

coveralls · 2026-06-10T14:53:09Z

Coverage Report for CI Build 27284487376

Coverage increased (+0.009%) to 93.308%

Details

Coverage increased (+0.009%) from the base build.
Patch coverage: 4 uncovered changes across 1 file (87 of 91 lines covered, 95.6%).
No coverage regressions found.

Uncovered Changes

File	Changed	Covered	%
src/hexkit/providers/mongokafka/provider/daopub.py	26	22	84.62%
Total (4 files)	91	87	95.6%

Coverage Regressions

No coverage regressions found.

Coverage Stats


Relevant Lines:	3915
Covered Lines:	3653
Line Coverage:	93.31%
Coverage Strength:	0.93 hits per line

💛 - Coveralls

Cito

Thank you, that looks useful.

I have pointed out two edge cases that need some refinement, and made one suggestions regarding the sort spec.

Cito · 2026-06-15T14:32:09Z

+            A FindResult that is async-iterable and also provides total_count().
        """
+        skip = skip or 0
+        limit = limit or 0


maybe add a comment that limit=0 means unbounded in mongo (not all db engines do this)

Cito · 2026-06-15T14:38:34Z

+                if sort:
+                    cursor = cursor.sort(sort)
+                async for document in cursor.skip(skip).limit(limit):
+                    if document.get("__metadata__", {}).get("deleted", False):


here you filter after skipping and limiting - it should be the other way round

i guess the most performant way is to add a filter for (metadata missing or metadata.deleted false) upfront.

Ugh, yes, I don't know how I didn't catch that. Good spot

Cito · 2026-06-15T14:53:06Z

+                Number of matching resources to skip before yielding results.
+                Defaults to None (no skipping).
+            limit:
+                Maximum number of resources to yield. Defaults to None (no limit).


The docstring should also define what limit=0 means - unbounded (as in mongo) or literally 0 (which I prefer since it#s much more intuitive). The implementation is currently for the former, the latter would need slight adjustment for mongo.

What is the use case for having a limit of literally 0?

Passing a value of 0 is not really useful (if you want unbound you can already just pass nothing/None). My main point is that the behavior should be well specified in the docstring. Options are 1) raise error, 2) do exactly what was requested (return nothing) or 3) re-interpret as "infinity". I think 2) is the most obvious one - Django ORM, Python slicing, Postgres and MySQL all work that way.

Cito · 2026-06-15T14:55:53Z

+    results = [x.title async for x in dao.find_all(mapping={}, skip=10, sort=asc_sort)]
+    assert results == []
+
+    # skip=0, limit=0 behaves like no pagination (limit=0 means no limit)


we should test both, limit=0 and limit=None and check that they do what we say in the docstring (for all providers)

Cito · 2026-06-15T15:00:11Z


        with pytest.raises(UniqueConstraintViolationError):
            await dao.upsert(dto5)
+


Here we should also add some pagination tests after adding raw documents with no metadata at all, and metadata.deleted = True

Cito · 2026-06-15T15:07:14Z

 ]

+SortSpec = list[tuple[str, Literal[1, -1]]]
+


We could use simple simple strings with a "-" prefix for descending order - that's how the Django ORM does it, so it's a well-established pattern in Python.

TheByronHimes added 7 commits June 9, 2026 13:57

Add pagination to find_all

7f45812

Add tests

edd049b

Add sort parameter

fa336b8

Add FindResult class for pagination and total count in DAO methods

51e9dde

Bump version to 8.3.0

f89c82e

Simplify double negative

4c893c0

Add test to make sure deleted outbox docs are excluded in total_count

e161b40

TheByronHimes marked this pull request as ready for review June 10, 2026 14:52

TheByronHimes requested a review from Cito June 10, 2026 14:52

Cito requested changes Jun 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DAO Pagination (GSI-2412)#231

DAO Pagination (GSI-2412)#231
TheByronHimes wants to merge 7 commits into
mainfrom
feature/dao_pagination

TheByronHimes commented Jun 10, 2026 •

edited

Loading

Uh oh!

coveralls commented Jun 10, 2026

Uh oh!

Cito left a comment

Uh oh!

Cito Jun 15, 2026

Uh oh!

Cito Jun 15, 2026

Uh oh!

TheByronHimes Jun 15, 2026

Uh oh!

Cito Jun 15, 2026

Uh oh!

TheByronHimes Jun 15, 2026

Uh oh!

Cito Jun 15, 2026

Uh oh!

Cito Jun 15, 2026

Uh oh!

Cito Jun 15, 2026

Uh oh!

Cito Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		with pytest.raises(UniqueConstraintViolationError):
		await dao.upsert(dto5)

Conversation

TheByronHimes commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Jun 10, 2026

Coverage Report for CI Build 27284487376

Coverage increased (+0.009%) to 93.308%

Details

Uncovered Changes

Coverage Regressions

Coverage Stats

💛 - Coveralls

Uh oh!

Cito left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TheByronHimes commented Jun 10, 2026 •

edited

Loading