feat: enhance rollup aggregation with dedup logic by gustavobtflores · Pull Request #1843 · kernelci/dashboard

gustavobtflores · 2026-04-06T21:08:30Z

Description

Adds deduplication logic to the tree_tests_rollup aggregation and handle cases where a test's status is updated from null to a concrete value (pass/fail/etc.). Previously, these updates would incorrectly increment counters without adjusting the null count, leading to inflated totals.

Changes

Implemented deduplication logic in _process_tree_tests_rollup() to detect and handle existing processed entries with null statuses
Added get_rollup_key() helper function for consistent rollup key generation
Added comprehensive unit tests for rollup entry correction behavior

How to test

Run the unit tests: python -m pytest backend/kernelCI_app/tests/unitTests/commands/process_pending_helpers_test.py -v
Verify the deduplication logic handles null-to-status transitions correctly by checking test cases covering:
- New tests with null status
- New tests with concrete status
- Existing null tests transitioning to pass/fail/skip/error statuses
- Reprocessing with unchanged status (should be skipped)

Part of #1801

Copilot

Pull request overview

This PR improves the tree_tests_rollup denormalized aggregation to correctly handle reprocessing when a test transitions from a NULL status to a concrete status, avoiding inflated totals by applying a “correction” delta (decrement null_tests, increment the concrete bucket, keep total_tests unchanged).

Changes:

Added a rollup-specific processed-item key (get_rollup_key) and used ProcessedListingItems to deduplicate rollup processing and detect NULL -> non-NULL transitions.
Extended rollup aggregation helpers to support correction deltas via is_correction / reprocess_test_ids.
Added unit tests validating correction behavior and the reprocess_test_ids pathway.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
backend/kernelCI_app/management/commands/process_pending_aggregations.py	Adds rollup key generation and dedup/correction detection using `ProcessedListingItems` during rollup batching.
backend/kernelCI_app/management/commands/helpers/process_pending_helpers.py	Implements correction delta behavior in rollup accumulation and threads `reprocess_test_ids` through aggregation.
backend/kernelCI_app/tests/unitTests/commands/process_pending_helpers_test.py	Adds unit tests covering `null -> non-null` correction behavior and `reprocess_test_ids` handling.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-07T13:20:24Z

backend/kernelCI_app/management/commands/process_pending_aggregations.py

+                    continue
+                else:
+                    # null -> non-null: This is a correction (reprocess)
+                    reprocess_test_ids.add(test.test_id)


In the null->non-null correction path (stored_status is None and new test.status is not None), the code marks the test for reprocessing but does not add an updated ProcessedListingItems entry to set status to the new non-null value. This leaves the processed row with status=None permanently, so any future re-ingestion of the same test_id will be treated as a correction again, repeatedly decrementing null_tests and skewing rollup totals. Add/update a ProcessedListingItems entry for the correction case (same listing_item_key) so bulk_create(update_conflicts=True) updates status away from None once corrected.

Suggested change

reprocess_test_ids.add(test.test_id)

reprocess_test_ids.add(test.test_id)

new_processed_entries.add(

ProcessedListingItems(

listing_item_key=rollup_key,

checkout_id=checkout_id,

status=test.status,

)

)

Copilot · 2026-04-07T13:20:25Z

backend/kernelCI_app/management/commands/helpers/process_pending_helpers.py

 def aggregate_tests_rollup(
    ready_tests: Sequence[PendingTest],
    test_builds_by_id: dict[str, Builds],
    issues_map: dict[str, dict],
+    reprocess_test_ids: set[str] = set(),
 ) -> dict[tuple, dict]:


reprocess_test_ids uses a mutable default (set()). Even if this function doesn't mutate it today, mutable defaults are error-prone and make future edits risky. Prefer reprocess_test_ids: Optional[set[str]] = None and normalize to an empty set inside the function.

alanpeixinho · 2026-04-07T14:14:29Z

backend/kernelCI_app/management/commands/process_pending_aggregations.py

+        new_processed_entries: set[ProcessedListingItems] = set()
+
+        for test in ready_tests:
+            rollup_key = rollup_keys_by_test_id[test.test_id]


do we really need to precompute the rollup keys?
I believe we could compute them on demand (it would even avoid computation of keys where build id is not found)

theoretically we should just receive tests here that have a build, the check below is mostly for checkout. we can move this after the check for sure, anyway

alanpeixinho · 2026-04-07T14:40:50Z

backend/kernelCI_app/management/commands/helpers/process_pending_helpers.py

+        record[counter] += 1
+    else:
+        record[counter] += 1
+        record["total_tests"] += 1


wouldn´t it be safer (less error prone) to make total_tests a derived value?
I believe it should be the sum of a small amount of fields, and wouldn´t make a significant impact on performance.

we could make it computed in the database, as we process tests individually and within batches, it would be kinda bad to have this computation here

maybe something like: https://www.postgresql.org/docs/current/ddl-generated-columns.html

Yeah, I second that

MarceloRobert

LGTM. These things happen when we end up ingesting the same file multiple times such as when the ingester breaks for some reason

gustavobtflores self-assigned this Apr 6, 2026

gustavobtflores added Backend Most or all of the changes for this issue will be in the backend code. Ingester The issue relates to the ingester tool, including the command itself and related functions. labels Apr 6, 2026

gustavobtflores force-pushed the feat/tree-tests-rollup-deduplication branch from fe923af to de1d3d5 Compare April 7, 2026 12:34

gustavobtflores requested review from MarceloRobert and Copilot April 7, 2026 13:14

Copilot started reviewing on behalf of gustavobtflores April 7, 2026 13:15 View session

Copilot AI reviewed Apr 7, 2026

View reviewed changes

gustavobtflores added 2 commits April 7, 2026 10:36

feat: enhance tree_tests_rollup aggregation with dedup logic

8a57a18

test: add unit tests for rollup entry correction behavior

8aaa613

gustavobtflores force-pushed the feat/tree-tests-rollup-deduplication branch from de1d3d5 to 8aaa613 Compare April 7, 2026 13:36

alanpeixinho reviewed Apr 7, 2026

View reviewed changes

MarceloRobert approved these changes Apr 7, 2026

View reviewed changes

gustavobtflores added this pull request to the merge queue Apr 7, 2026

Merged via the queue into kernelci:main with commit 1e925d5 Apr 7, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enhance rollup aggregation with dedup logic#1843

feat: enhance rollup aggregation with dedup logic#1843
gustavobtflores merged 2 commits intokernelci:mainfrom
gustavobtflores:feat/tree-tests-rollup-deduplication

gustavobtflores commented Apr 6, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

alanpeixinho Apr 7, 2026

Uh oh!

gustavobtflores Apr 7, 2026

Uh oh!

alanpeixinho Apr 7, 2026

Uh oh!

gustavobtflores Apr 7, 2026

Uh oh!

alanpeixinho Apr 7, 2026

Uh oh!

MarceloRobert left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-                    reprocess_test_ids.add(test.test_id)
+                    reprocess_test_ids.add(test.test_id)
+                    new_processed_entries.add(
+                        ProcessedListingItems(
+                            listing_item_key=rollup_key,
+                            checkout_id=checkout_id,
+                            status=test.status,
+                        )
+                    )

Conversation

gustavobtflores commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

How to test

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

alanpeixinho Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

gustavobtflores Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

alanpeixinho Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

gustavobtflores Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

alanpeixinho Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

MarceloRobert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gustavobtflores commented Apr 6, 2026 •

edited

Loading