docs: add RFC for Query Results Cache by imjalpreet · Pull Request #60 · prestodb/rfcs

imjalpreet · 2026-04-17T09:45:51Z

Abstract

This RFC proposes a Query Results Cache for Presto that caches completed SELECT query results so semantically equivalent future queries skip execution entirely. The cache intercepts at SqlQueryExecution.start() after optimization, before scheduling. On a cache hit, pages are read from TempStorage and fed directly to the client via the existing OutputBuffer → ExchangeClient → HTTP response pipeline, bypassing scheduling and execution entirely.

agrawalreetika · 2026-05-13T06:04:13Z

+
+1. Compute canonical plan hash.
+2. Check `TempStorage.exists()` on the metadata handle for the key.
+3. On exists: read and deserialize `QueryResultsCacheEntry` from metadata file. Validate: schema match (column names + types vs current `OutputNode`), expiration check, whole-result `resultHash` check.


Does output column order participate in cache validation? The RFC mentions validating column names + types, but it may help to explicitly mention column ordering as well.

agrawalreetika · 2026-05-13T06:05:45Z

+
+- A running byte counter tracks total result size. If it exceeds `max-result-size`, the tee-write is abandoned and partial files are cleaned up.
+- On successful `SELECT` completion, the `QueryResultsCacheWriter` collects the accumulated `TempStorageHandle` references, builds a `QueryResultsCacheEntry`, and stores metadata in the cache provider.
+- If the query fails or is cancelled, partial cache writes are discarded.


How are orphaned/partial cache objects cleaned up? Or cache write starts only after query is completely FINISHED?
For example, if page uploads succeed but metadata.json write fails, or the coordinator crashes during cache population. Is cleanup purely TTL-based?

agrawalreetika · 2026-05-13T06:07:57Z

+3. On exists: read and deserialize `QueryResultsCacheEntry` from metadata file. Validate: schema match (column names + types vs current `OutputNode`), expiration check, whole-result `resultHash` check.
+4. On hit: `serveCachedResult()` — read `SerializedPage` objects from `TempStorage`, verify per-page CRC, enqueue into `OutputBuffer`, transition to finishing. Client sees no difference from normal execution.
+5. On miss: register query ID + cache key for population on completion, proceed with normal scheduling.
+


How is resource usage accounted for on cache hits? Since cached queries bypass tasks/operators/exchanges:

Are they still counted against Resource Groups/concurrency limits?

How are coordinator CPU/network/memory costs tracked?

On similar lines, what are the operator level metrics that will show up for such a query ?

To me a cached-query-result is akin to doing a rewrite for this query to a MV or a (materialized) CTEProducerNode, so I would expect to see the plan + metrics related to reading data from such a proxy-source-node. Maybe we can call it a CachedQueryResults plan node ?

agrawalreetika · 2026-05-13T06:08:36Z

+
+CRCs are computed over the **encrypted** page bytes, not the plaintext. Integrity and confidentiality are therefore independent: corruption is detectable without decrypting the page first, and the CRC does not leak plaintext structure.
+
+Any integrity check failure (per-page or whole-result) is treated identically to a cache miss. The query proceeds to normal execution and the corrupted entry is overwritten on completion via the normal write path. Failures are emitted as a metric (`cache.integrity_failure_count`) for visibility.


Do we plan to expose cache-hit observability/metrics?
For example:

servedFromResultCache

EventListener visibility

agrawalreetika · 2026-05-13T06:09:11Z

+1. Compute canonical plan hash.
+2. Check `TempStorage.exists()` on the metadata handle for the key.
+3. On exists: read and deserialize `QueryResultsCacheEntry` from metadata file. Validate: schema match (column names + types vs current `OutputNode`), expiration check, whole-result `resultHash` check.
+4. On hit: `serveCachedResult()` — read `SerializedPage` objects from `TempStorage`, verify per-page CRC, enqueue into `OutputBuffer`, transition to finishing. Client sees no difference from normal execution.


What query states will cache-hit queries go through? Since execution is bypassed, will queries still transition through RUNNING as expected by the UI/client protocol?

aaneja · 2026-05-13T12:02:31Z

+
+## Future Work
+
+- **Predicate stitching**: Partial cache reuse for partition-decomposable queries when only some partitions change. Requires per-partition stats.


+1. I think we could extend the idea to do sub-plan matching, similar to how we're exploring MV matching

amitkdutta · 2026-05-15T20:20:28Z

For Purging the cache, wondering if we are thinking any approach or just delete the cache directory upon restart. Also wonderign what happens if the local tempstroage is out of capacity as lots of query results will be stored there.

prestodb-ci added the from:IBM PRs from IBM label Apr 17, 2026

prestodb-ci requested review from a team, imsayari404 and jp-sivaprasad and removed request for a team April 17, 2026 09:45

imjalpreet force-pushed the queryResultsCache branch from c73508e to be988ea Compare April 17, 2026 09:49

imjalpreet requested a review from tdcmeehan April 17, 2026 09:50

jja725 self-requested a review May 2, 2026 00:12

docs: add RFC for Query Results Cache

bf5ec2e

imjalpreet force-pushed the queryResultsCache branch from be988ea to bf5ec2e Compare May 12, 2026 12:04

agrawalreetika reviewed May 13, 2026

View reviewed changes

aaneja reviewed May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add RFC for Query Results Cache#60

docs: add RFC for Query Results Cache#60
imjalpreet wants to merge 1 commit into
prestodb:mainfrom
imjalpreet:queryResultsCache

imjalpreet commented Apr 17, 2026

Uh oh!

agrawalreetika May 13, 2026

Uh oh!

agrawalreetika May 13, 2026

Uh oh!

agrawalreetika May 13, 2026

Uh oh!

aaneja May 13, 2026

Uh oh!

agrawalreetika May 13, 2026

Uh oh!

agrawalreetika May 13, 2026

Uh oh!

aaneja May 13, 2026

Uh oh!

amitkdutta commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		CRCs are computed over the encrypted page bytes, not the plaintext. Integrity and confidentiality are therefore independent: corruption is detectable without decrypting the page first, and the CRC does not leak plaintext structure.

		Any integrity check failure (per-page or whole-result) is treated identically to a cache miss. The query proceeds to normal execution and the corrupted entry is overwritten on completion via the normal write path. Failures are emitted as a metric (`cache.integrity_failure_count`) for visibility.


		## Future Work

		- Predicate stitching: Partial cache reuse for partition-decomposable queries when only some partitions change. Requires per-partition stats.

Conversation

imjalpreet commented Apr 17, 2026

Abstract

Uh oh!

agrawalreetika May 13, 2026

Choose a reason for hiding this comment

Uh oh!

agrawalreetika May 13, 2026

Choose a reason for hiding this comment

Uh oh!

agrawalreetika May 13, 2026

Choose a reason for hiding this comment

Uh oh!

aaneja May 13, 2026

Choose a reason for hiding this comment

Uh oh!

agrawalreetika May 13, 2026

Choose a reason for hiding this comment

Uh oh!

agrawalreetika May 13, 2026

Choose a reason for hiding this comment

Uh oh!

aaneja May 13, 2026

Choose a reason for hiding this comment

Uh oh!

amitkdutta commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants