Add query optimization and explain API #1136

bplatz · 2025-10-13T17:31:14Z

Summary

Implements query pattern optimization based on property and class statistics. Adds explain API to show optimization decisions without executing queries.

Key Features

Query Optimization

Reorders WHERE clause patterns based on selectivity scores
Lower selectivity = more selective = execute first
Respects optimization boundaries (filters, binds, etc.)
Statistics-driven: uses property/class counts from index

Explain API

New fluree.db.api/explain function
Shows original and optimized pattern order
Includes selectivity scores for each pattern
User-friendly output with decoded IRIs

Selectivity Scoring

Specific value lookups: 0 (most selective)
ID patterns: 0 (single entity)
Property scans: property count from stats
Class patterns: class count from stats
Full scans: ∞ (least selective)

Implementation

Protocol-based Design

Optimizable protocol for FlakeDB, AsyncDB, Dataset
Pattern segmentation preserves optimization boundaries
Independent segment optimization

Query Integration

Optimization runs automatically before query execution
No breaking changes to existing query API
Federated queries (DataSets) are not optimized

Test Coverage

315 new assertions across 5 integration tests:

No-optimization scenarios (equal selectivity)
Value lookup optimization (specific value → class)
Property count optimization (rare property → common class)
Optimization boundaries (filters separate segments)
Multiple segment optimization

All tests pass (290 tests, 2142 assertions).

- Implemented `explain` function to return query execution plans. - Added `optimize-query` function to reorder query patterns based on selectivity. - Introduced `Optimizable` protocol for query optimization. - Created integration tests for explain functionality and optimization behavior. - Added unit tests for pattern recognition and boundary splitting in optimization.

dpetran

It would be nice to see tests with more unoptimizable patterns with nested clauses: optional, union, subquery, etc.

src/fluree/db/flake/flake_db.cljc

src/fluree/db/query/optimize.cljc

src/fluree/db/flake/flake_db.cljc

dpetran · 2025-10-23T21:06:47Z

I was looking at how to reconcile this explain api and the one I put together earlier this summer: #1030

I think they're quite complementary - if you're familiar with Postgres, this work corresponds to the EXPLAIN statement, reporting information about the query plan, while the other PR more closely corresponds with EXPLAIN ANALYZE, where it actually runs the query and reports true flake counts and other execution metrics.

This one doesn't yet have support for nested clauses, and I think we could integrate the two approaches without too much trouble. And I'd be happy to pick this up and finish it, depending on your availability.

…n for improved readability

…r improved clarity in user-value conversion

… reporting for unexpected types

…ved clarity and consistency

…rove selectivity calculation logic

bplatz · 2025-10-30T19:39:24Z

I was looking at how to reconcile this explain api and the one I put together earlier this summer: #1030

I think they're quite complementary - if you're familiar with Postgres, this work corresponds to the EXPLAIN statement, reporting information about the query plan, while the other PR more closely corresponds with EXPLAIN ANALYZE, where it actually runs the query and reports true flake counts and other execution metrics.

This one doesn't yet have support for nested clauses, and I think we could integrate the two approaches without too much trouble. And I'd be happy to pick this up and finish it, depending on your availability.

Please do! The main purpose of including this is to see how the query got reordered for an end-user, but I'm sure there is lots more value we can bring. The upstream branch includes detailed statistics on each property to explain the state of the data and why it was reordered, so you should at least use that as the baseline for any future work here.

bplatz · 2025-10-30T19:40:06Z

Closing because all work was done on upstream branch which is based off this branch.

bplatz requested a review from a team October 13, 2025 17:31

dpetran reviewed Oct 16, 2025

View reviewed changes

bplatz added 6 commits October 24, 2025 12:25

Merge branch 'feature/data-stats' into feature/query-optimize1

801ca9f

Refactor component value conversion to use JSON-LD compacting functio…

865fcb2

…n for improved readability

Refactor component value handling: remove unnecessary db parameter fo…

701e62c

…r improved clarity in user-value conversion

Enhance component conversion: handle nil components and improve error…

2505113

… reporting for unexpected types

Refactor explain patterns: update type from tuple to triple for impro…

7555eae

…ved clarity and consistency

Refactor query optimization: streamline where clause handling and imp…

3abcd39

…rove selectivity calculation logic

Base automatically changed from feature/data-stats to main October 26, 2025 12:03

Merge remote-tracking branch 'origin/main' into feature/query-optimize1

3c9dff6

bplatz closed this Oct 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add query optimization and explain API #1136

Add query optimization and explain API #1136

Uh oh!

bplatz commented Oct 13, 2025

Uh oh!

dpetran left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dpetran commented Oct 23, 2025

Uh oh!

bplatz commented Oct 30, 2025

Uh oh!

bplatz commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add query optimization and explain API #1136

Add query optimization and explain API #1136

Uh oh!

Conversation

bplatz commented Oct 13, 2025

Summary

Key Features

Implementation

Test Coverage

Uh oh!

dpetran left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dpetran commented Oct 23, 2025

Uh oh!

bplatz commented Oct 30, 2025

Uh oh!

bplatz commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants