Skip to content

[#11737] docs(core): add multi-node entity store cache invalidation design#11738

Draft
yuqi1129 wants to merge 3 commits into
apache:mainfrom
yuqi1129:docs/entity-cache-multinode-design
Draft

[#11737] docs(core): add multi-node entity store cache invalidation design#11738
yuqi1129 wants to merge 3 commits into
apache:mainfrom
yuqi1129:docs/entity-cache-multinode-design

Conversation

@yuqi1129

Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

Adds a design doc, design-docs/entity-cache-multinode-changelog-design.md, proposing how to make the entity store cache (gravitino.cache.enabled) usable in multi-node deployments by invalidating it across nodes via the existing entity_change_log and EntityChangeLogPoller. It presents two options (coarse by-metalake vs. precise per-endpoint) with a recommendation, plus a consistency model and a coverage audit.

Why are the changes needed?

Today the entity store cache only invalidates locally, so multi-node deployments must disable it, which hurts performance (Iceberg especially). This is design-only; no code behavior changes yet.

Fix: #11737

Does this PR introduce any user-facing change?

No. Documentation and design only.

How was this patch tested?

No code changes; not applicable. The doc passes the repo docs-style audit scripts.

@github-actions

github-actions Bot commented Jun 23, 2026

Copy link
Copy Markdown

Code Coverage Report

Overall Project 67.23% 🟢
Files changed No Java source files changed -

Module Coverage
aliyun 1.72% 🔴
api 46.82% 🟢
authorization-common 85.96% 🟢
aws 26.5% 🔴
azure 2.47% 🔴
catalog-common 10.4% 🔴
catalog-fileset 80.23% 🟢
catalog-glue 66.91% 🟢
catalog-hive 79.42% 🟢
catalog-jdbc-clickhouse 80.02% 🟢
catalog-jdbc-common 44.22% 🟢
catalog-jdbc-doris 80.28% 🟢
catalog-jdbc-hologres 54.03% 🟢
catalog-jdbc-mysql 79.23% 🟢
catalog-jdbc-oceanbase 80.91% 🟢
catalog-jdbc-postgresql 82.29% 🟢
catalog-jdbc-starrocks 78.51% 🟢
catalog-kafka 77.01% 🟢
catalog-lakehouse-generic 58.53% 🟢
catalog-lakehouse-hudi 79.1% 🟢
catalog-lakehouse-iceberg 85.86% 🟢
catalog-lakehouse-paimon 82.14% 🟢
catalog-model 77.72% 🟢
cli 44.51% 🟢
client-java 78.01% 🟢
common 50.17% 🟢
core 82.58% 🟢
filesystem-hadoop3 77.27% 🟢
flink 0.0% 🔴
flink-common 47.12% 🟢
flink-runtime 0.0% 🔴
gcp 14.12% 🔴
hadoop-common 10.88% 🔴
hive-metastore-common 53.77% 🟢
iceberg-common 58.15% 🟢
iceberg-rest-server 73.9% 🟢
idp-basic 85.71% 🟢
integration-test-common 0.0% 🔴
jobs 66.17% 🟢
lance-common 20.83% 🔴
lance-rest-server 60.13% 🟢
lineage 53.02% 🟢
optimizer 82.95% 🟢
optimizer-api 21.95% 🔴
server 85.96% 🟢
server-common 74.18% 🟢
spark 28.57% 🔴
spark-common 41.66% 🟢
trino-connector 40.25% 🟢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Improvement] Make entity store cache usable in multi-node deployments

1 participant