Implement cross-table segment pruning for logical table by abhishekbafna · Pull Request #17868 · apache/pinot

abhishekbafna · 2026-03-12T13:03:53Z

Implement cross-table segment pruning for logical table

Summary

For logical tables, segment pruning is now performed once across all segments from every physical table (cross-table prune) instead of once per physical table. This allows pruners such as SelectionQuerySegmentPruner (ORDER BY + LIMIT) to prune effectively across the logical table, reducing the number of segments processed and aligning behavior with a single physical table holding the same data.

Problem

LogicalTableExecutionInfo.getSelectedSegmentsInfo previously called SingleTableExecutionInfo.getSelectedSegmentsInfo (and thus segmentPrunerService.prune) per physical table.
For ORDER BY + LIMIT, each table was pruned in isolation, so the logical table often processed many more segments than a single physical table (e.g. ~3× for 3 physical tables).

Solution

Collect all segments from all SingleTableExecutionInfo instances into one list and maintain a segmentToTable map.
Prune once: call segmentPrunerService.prune(allSegments, queryContext, prunerStats, executorService) on the combined list.

Single-table execution is unchanged; only the logical-table path in LogicalTableExecutionInfo is modified.

Backward compatibility

No API changes. Single-table path and SingleTableExecutionInfo behavior are unchanged; only the logical-table branch in LogicalTableExecutionInfo.getSelectedSegmentsInfo is new.

Copilot

Pull request overview

Adds cross-table segment pruning for logical tables by pruning the combined segment set once, and then resolving segment contexts per physical table while preserving prune order.

Changes:

Collect all segments across physical tables and run SegmentPrunerService.prune(...) once (cross-table) in LogicalTableExecutionInfo.
Add constant-false filter/having short-circuit to skip pruning and return no selected segments.
Add new unit tests covering cross-table pruning behavior, constant-false shortcut, provided segment contexts, and order preservation.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
pinot-core/src/main/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfo.java	Implements cross-table pruning and builds per-table `SegmentContext` mapping while preserving prune order.
pinot-core/src/test/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfoTest.java	Adds tests validating cross-table pruning correctness and key edge cases.

You can also share your feedback on Copilot code review. Take the survey.

Copilot · 2026-03-12T13:09:32Z

pinot-core/src/main/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfo.java

+        for (IndexSegment segment : segmentsForTable) {
+          segmentToContext.put(segment, providedContexts.get(segment));
+        }


If providedContexts is non-null but does not contain an entry for a selected segment, providedContexts.get(segment) will return null, and the code will later add a null SegmentContext into selectedSegmentContexts (or potentially trigger null-related failures downstream). Consider validating presence and either (a) falling back to getSegmentContexts(...) for missing segments, or (b) throwing an explicit exception with a clear message when a selected segment lacks a provided context.

Suggested change

for (IndexSegment segment : segmentsForTable) {

segmentToContext.put(segment, providedContexts.get(segment));

}

List<IndexSegment> missingSegments = null;

for (IndexSegment segment : segmentsForTable) {

SegmentContext context = providedContexts.get(segment);

if (context != null) {

segmentToContext.put(segment, context);

} else {

if (missingSegments == null) {

missingSegments = new ArrayList<>();

}

missingSegments.add(segment);

}

}

if (missingSegments != null && !missingSegments.isEmpty()) {

List<SegmentContext> contexts =

tableExecutionInfo.getSegmentContexts(missingSegments, queryContext.getQueryOptions());

for (int i = 0; i < missingSegments.size(); i++) {

segmentToContext.put(missingSegments.get(i), contexts.get(i));

}

}

The providedContexts is created for the same segments at the beginning from the same indexSegments and possible a subset of them are processed here after pruning. So it should never be null.

Copilot · 2026-03-12T13:09:32Z

pinot-core/src/main/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfo.java

+      selectedSegments = segmentPrunerService.prune(allSegments, queryContext, prunerStats, executorService);
+      segmentPruneTimer.stopAndRecord();


segmentPruneTimer.stopAndRecord() is not guaranteed to run if segmentPrunerService.prune(...) throws, which can skew phase timing metrics and leave timers unrecorded. Wrap the prune call in a try/finally so the timer is always stopped/recorded (or use a closeable/auto-close pattern if available in TimerContext.Timer).

Suggested change

selectedSegments = segmentPrunerService.prune(allSegments, queryContext, prunerStats, executorService);

segmentPruneTimer.stopAndRecord();

try {

selectedSegments = segmentPrunerService.prune(allSegments, queryContext, prunerStats, executorService);

} finally {

segmentPruneTimer.stopAndRecord();

}

This uses the same pattern fromSingleTableExecutionInfo - no try/finally. We can add it but I think it is not required.

codecov-commenter · 2026-03-12T14:00:05Z

Codecov Report

❌ Patch coverage is 97.72727% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 63.31%. Comparing base (5f4ad42) to head (449ce85).
⚠️ Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
.../pinot/core/query/executor/TableExecutionInfo.java	85.71%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #17868      +/-   ##
============================================
+ Coverage     63.23%   63.31%   +0.08%     
- Complexity     1480     1481       +1     
============================================
  Files          3190     3190              
  Lines        192283   192314      +31     
  Branches      29470    29477       +7     
============================================
+ Hits         121589   121773     +184     
+ Misses        61158    60991     -167     
- Partials       9536     9550      +14

Flag	Coverage Δ
custom-integration1	`100.00% <ø> (ø)`
integration	`100.00% <ø> (ø)`
integration1	`100.00% <ø> (ø)`
integration2	`0.00% <ø> (ø)`
java-11	`63.22% <97.72%> (+28.96%)`	⬆️
java-21	`63.28% <97.72%> (+0.06%)`	⬆️
temurin	`63.31% <97.72%> (+0.08%)`	⬆️
unittests	`63.31% <97.72%> (+0.08%)`	⬆️
unittests1	`55.58% <97.72%> (+0.04%)`	⬆️
unittests2	`34.31% <0.00%> (+0.05%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

shauryachats · 2026-03-14T04:20:03Z

LGTM. @abhishekbafna could you please address the Copilot suggestions? I can approve it then.

abhishekbafna · 2026-03-16T04:45:30Z

LGTM. @abhishekbafna could you please address the Copilot suggestions? I can approve it then.

@shauryachats I think, we do not need further changes. The code changes is line with the existing approach and code structure from the SingleTableExecutionInfo code. Let me know if you have any further thoughts. Thanks for the review.

krishan1390 · 2026-03-16T11:44:14Z

pinot-core/src/main/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfo.java

        .anyMatch(tableExecutionInfo -> tableExecutionInfo.getTableDataManager() instanceof RealtimeTableDataManager);
  }

+  /**


Can we write an end-to-end integration test that verifies via the query starts the number of segments queried for an order and limit query?

I could not get a integration test working. Added an unit test.

krishan1390 · 2026-03-16T11:54:36Z

pinot-core/src/main/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfo.java

+    int numTotalSegments = allSegments.size();
+
+    // Constant false shortcut: skip pruning
+    List<IndexSegment> selectedSegments;


Lets move selectSegments() in SingleTableExecutionInfo to TableExecutionInfo so that it can be reused here without duplicating the logic

I think, we should leave that.

The selectSegments() is currently a private method, to make accessible we would need to make it static and public/default access modifier. This seems more hack than code refactoring. We do not access to the SingleTableExecutionInfo object and may have to create it.

I will try to push the selectSegments() to the top level class such that it is accessible in both.

krishan1390 · 2026-03-16T12:00:17Z

pinot-core/src/main/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfo.java

+    }
+    Map<IndexSegment, SegmentContext> segmentToContext = new HashMap<>();
+    for (Map.Entry<SingleTableExecutionInfo, List<IndexSegment>> entry : tableToSelected.entrySet()) {
+      SingleTableExecutionInfo tableExecutionInfo = entry.getKey();


Lets add a new API in SingleTableExecutionInfo

getSelectedSegmentsInfo(List<IndexSegment> selectedSegments, QueryContext queryContext, TimerContext timerContext, ExecutorService executorService, SegmentPrunerService segmentPrunerService)

which is called here and from current SingleTableExecutionInfo.getSelectedSegmentsInfo() after it prunes segments.

That will avoid duplicate code of what happens after pruning.

The new API does not make sense as it does not have functional use case.

Also, should this be added to the SingleTableExecutionInfo or to super class TableExecutionInfo? Accordingly the implementation would flow. Also, we would need instance of the SingleTableExecutionInfo to access it.

This API belongs to SingleTableExecutionInfo because its per table.

You have access to SingleTableExecutionInfo here right ?

I have move the method to TableExecutionInfo as default method.

krishan1390 · 2026-03-16T12:05:11Z

...t-core/src/test/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfoTest.java

+
+  /**
+   * Verifies that for a logical table, all segments from all physical tables are collected and
+   * prune is invoked once (cross-table). With LIMIT 5 and segments of 10 docs each, only 1 segment


I think we should have a more realistic test (here or in integration tests) where we prune based on min/max statistics across tables. So the query should have both order and limit by clause.

Even if limit is 5 and number of docs per segment is 10, setup segment metadata such that the pruning returns multiple segments

That will be a more comprehensive test for the use case

Added a test.

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

...t-core/src/test/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfoTest.java

+    assertEquals(selectedSegmentsInfo.getNumSelectedSegments(), 2,
+        "ORDER BY DESC LIMIT 10 with overlapping ranges should select exactly 2 segments");
+    List<SegmentContext> contexts = selectedSegmentsInfo.getSelectedSegmentContexts();


...t-core/src/test/java/org/apache/pinot/core/query/executor/LogicalTableExecutionInfoTest.java

+    // Seg1: [100, 100] 10 docs - first in DESC order, covers LIMIT 10
+    // Seg2: [90, 101] 10 docs - overlaps (max 101 > 100), kept
+    // Seg3, Seg4, Seg5: [1, 50] 10 docs each - max 50 < 100, pruned


…/pinot#17868)

…/pinot#17868) (#564)

Implement cross-table segment pruning for logical table

524c9f3

abhishekbafna requested a review from Copilot March 12, 2026 13:04

Copilot AI reviewed Mar 12, 2026

View reviewed changes

Copilot started reviewing on behalf of abhishekbafna March 12, 2026 13:14 View session

Merge branch 'master' into logical_table/segment_pruning

24965c0

krishan1390 reviewed Mar 16, 2026

View reviewed changes

Refactor segment selection logic in TableExecutionInfo and add unit test

449ce85

gortiz requested review from Copilot, gortiz and yashmayya March 18, 2026 10:43

Copilot started reviewing on behalf of gortiz March 18, 2026 10:44 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

gortiz approved these changes Mar 18, 2026

View reviewed changes

xiangfu0 added the logical-tables Related to logical table abstraction label Mar 20, 2026

yashmayya merged commit 8fb6c91 into apache:master Mar 23, 2026
20 checks passed

xiangfu0 added a commit to pinot-contrib/pinot-docs that referenced this pull request Mar 23, 2026

docs: document cross-table segment pruning for logical tables (apache…

4d30e92

…/pinot#17868)

xiangfu0 mentioned this pull request Mar 23, 2026

docs: document cross-table segment pruning for logical tables (apache/pinot#17868) pinot-contrib/pinot-docs#564

Merged

xiangfu0 added a commit to pinot-contrib/pinot-docs that referenced this pull request Mar 23, 2026

docs: document cross-table segment pruning for logical tables (apache…

2ffe410

…/pinot#17868) (#564)

-        for (IndexSegment segment : segmentsForTable) {
-          segmentToContext.put(segment, providedContexts.get(segment));
-        }
+        List<IndexSegment> missingSegments = null;
+        for (IndexSegment segment : segmentsForTable) {
+          SegmentContext context = providedContexts.get(segment);
+          if (context != null) {
+            segmentToContext.put(segment, context);
+          } else {
+            if (missingSegments == null) {
+              missingSegments = new ArrayList<>();
+            }
+            missingSegments.add(segment);
+          }
+        }
+        if (missingSegments != null && !missingSegments.isEmpty()) {
+          List<SegmentContext> contexts =
+              tableExecutionInfo.getSegmentContexts(missingSegments, queryContext.getQueryOptions());
+          for (int i = 0; i < missingSegments.size(); i++) {
+            segmentToContext.put(missingSegments.get(i), contexts.get(i));
+          }
+        }

		selectedSegments = segmentPrunerService.prune(allSegments, queryContext, prunerStats, executorService);
		segmentPruneTimer.stopAndRecord();

Conversation

abhishekbafna commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!