BlazeDB

BlazeDB is a high-performance, specialized storage engine for Minecraft Bedrock Edition worlds, designed essentially as a drop-in replacement for LevelDB.

It is engineered to overcome specific bottlenecks found in general-purpose KV stores like LevelDB when applied to Minecraft's spatial data patterns. BlazeDB leverages Z-Order (Morton) Curves for spatial locality, an append-only architecture for high write throughput, and a sharded O(1) LRU cache for low-latency reads.

Features

Unchanged Save Elision: Repeated saves of byte-identical hot chunks skip disk append and fsync after an exact encoded-record comparison.
Workload APIs: Overlay providers, explicit block-delta records, pinned hot-area caches, and optional segmented providers for reset-heavy or mutation-heavy workloads.
🚀 High Performance: Optimized specifically for chunk data (reads/writes).
🗺️ Spatial Locality: Uses Z-Order (Morton) curves to keep creating/loading nearby chunks physically close on disk, effectively acting as a hardware-level prefetch.
💾 Append-Only Storage: Writes are always appended to the end of the file, eliminating random write seeks and "compaction stalls" during gameplay.
🧠 Smart Caching: Sharded, thread-safe LRU cache with O(1) operations and auto-tuning capabilities.
⚡ Async Writes: Non-blocking background write worker with buffering to prevent frame drops during heavy chunk generation.
📦 Compression: Supports LZ4 (default, ultra-fast) and Snappy, and builds encoded records directly to minimize copying.
🛡️ Crash Safety: Atomic offset tracking and index rebuilding capabilities.

Architecture & How It Works

Data Storage Schematic

BlazeDB avoids the complexity of LSM Trees (used by LevelDB) in favor of a simpler, faster append-only model paired with an in-memory spatial index.

Minecraft Server
|-- Put Chunk
|   `-- Write Buffer (map)
|       |-- async flush -> chunks.dat (append-only data)
|       `-- index update -> Spatial Index (Z-order map)
|
`-- Get Chunk
    `-- Sharded LRU Cache
        `-- miss -> Spatial Index (Z-order map)
            `-- lookup offset -> chunks.dat
                `-- read + decompress -> Sharded LRU Cache

Repeated unchanged saves compare the new encoded record with the cached
encoded record for that chunk. If the bytes are identical, BlazeDB returns
without appending another record or syncing the data file.

Disk files:
|-- chunks.dat  append-only full chunk data
|-- index.dat   persisted spatial index
`-- deltas.dat  optional explicit block-delta records

1. Z-Order Spatial Indexing

Minecraft accesses data spatially (e.g., "load all chunks around player X,Z"). Standard KV stores like LevelDB just see keys as strings and may scatter nearby chunks across different SSTables.

BlazeDB interleaves the bits of the chunk X and Z coordinates to create a Morton Code. This ensures that chunks which are close to each other in the game world are mathematically close in the index.

Key: {MortonCode(x, z), DimensionID}
Value: {FileOffset, Size, DimensionID}

2. Append-Only Data File (`chunks.dat`)

When a chunk is saved, it is effectively serialized, compressed (LZ4), and appended to the end of chunks.dat. The index is updated to point to this new location.

Pros: Zero seek time for writes; extremely fast.
Cons: Old versions of chunks remain in the file until a "compaction" process runs (similar to garbage collection).

3. Chunk Format

Each entry in chunks.dat follows this binary format:

[Header: "BLAZ"]      4 bytes
[Size]                4 bytes (Uint32)
[CRC32]               4 bytes
[ChunkX, ChunkZ]      8 bytes (Int32, Int32)
[DimensionID]         4 bytes (Int32)
[CompressionType]     1 byte
[SubChunkCount]       1 byte
[Biomes Data]         Length + Data
[SubChunks Data...]   Length + Data (Repeated)
[Entities Data]       Length + NBT
[BlockEntities Data]  Length + NBT

4. Sharded LRU Cache

To handle the high concurrency of a multi-player server, the in-memory cache is partitioned into 16 shards. This reduces lock contention, allowing multiple routines to read/write cache entries simultaneously without waiting on a single global lock.

BlazeDB vs. LevelDB

Feature	BlazeDB	LevelDB (mcdb)
Write Strategy	Append-Only (Log Structured)	LSM Tree (Sorted Strings Table)
Write Performance	Extremely High (Sequential I/O)	High (but suffers from compaction stalls)
Read Performance	High (Spatial Indexing + Caching)	High (Bloom Filters + Block Cache)
Spatial Locality	Excellent (Z-Order Curve)	Poor/Random (Lexicographical)
Concurrency	Sharded Locks (Low contention)	Global/Level Implementation dependent
Disk Usage	Higher (Requires periodic compaction)	Lower (Aggressive auto-compaction)
Compression	LZ4 (Fast)	Snappy / Zlib
Stalls	Rare (Background compaction optional)	Occasional (LevelDB write stalls)

Benchmarks

Lower is better. These numbers are from the committed benchmark suite in tests/benchmark_test.go, run locally on Windows/amd64 with an AMD Ryzen 7 7730U and Go 1.26.3. The benchmark columns use synthetic Dragonfly chunk.Column values plus a generated dense Dragonfly-style chunk fixture; default BlazeDB rows use LZ4 compression unless noted. Allocation columns are -benchmem per-operation allocation, not total process RSS.

Workload	BlazeDB	LevelDB	Speedup	BlazeDB Alloc/op	LevelDB Alloc/op
Hot cached chunk load	~82.9 ns/op	~43.8 us/op	~529x	0 B	~15.6 KB
Store chunk, default LZ4	~4.2 us/op	~74.8 us/op	~18x	~2.8 KB	~15.6 KB
Store chunk, no compression	~3.57 us/op	~74.8 us/op	~21x	~2.2 KB	~15.6 KB
Repeated unchanged chunk save, safe mode	~5.47 us/op	~55.8 us/op	~10x	~2.1 KB	~17.3 KB
Reopened tiny-cache chunk load	~26.9 us/op	~1.93 ms/op	~72x	~12.2 KB	~734 KB
Reopened 11x11 area load	~3.21 ms/op	~236.6 ms/op	~74x	~1.48 MB	~86.2 MB
Generated dense chunk store	~11.2 us/op	~166.4 us/op	~15x	~15.4 KB	~48.0 KB
Generated dense chunk reopened load	~84.0 us/op	~1.48 ms/op	~18x	~28.1 KB	~637 KB

BlazeDB also caches negative area misses. In sparse repeated area scans, the benchmarked path drops from about 190 us/op cold to about 11.1 us/op once misses are cached.

For repeated same-position saves of unchanged chunks, BlazeDB keeps the last encoded hot record in the chunk cache and performs an exact byte comparison before committing a new record. In the targeted benchmark, DurabilitySafe unchanged saves dropped from about 548 us/op before this optimization to about 5.47 us/op, while DurabilitySafeBatch unchanged saves measured about 6.75 us/op. Dense unchanged safe saves still pay Dragonfly encode cost and measured about 31.2 us/op, but skip the expensive append/fsync step.

Provider workflow benchmarks from tests/provider_workflow_benchmark_test.go:

Workload	Cost	Alloc/op	Notes
Explicit one-block delta record	~14-19 us/op	~166-183 B	Avoids full chunk-column encoding for event-layer block edits.
Overlay edit/reset mix	~398-456 us/op	~62 KB	Writes arena mutations to a disposable overlay and periodically resets it.
Segmented provider store	~42-51 us/op	~4.7-5.3 KB	Routes chunks into per-region DBs for independent deletion/compaction.

Cache-size tuning benchmarks showed that 128MB is the smallest tested cache setting that keeps a generated dense radius-32 working set hot after warm-up. In that benchmark, 64MB was enough for radius-16, while radius-32 needed 128MB to reach 100% cache hits and roughly 150 ns/op cached loads. Larger caches are still useful for multiple players in different areas.

Safety mode store costs from the same benchmark run:

Mode	Store Cost	vs LevelDB Store	Recommended Use
`DurabilityFast`	~4.1 us/op	~18x faster	Benchmarks, bulk conversion, local testing, and worlds where losing recent buffered writes is acceptable.
`DurabilityBalanced`	~3.6 us/op	~21x faster	Recommended server mode: keeps buffered throughput while syncing completed flush batches.
`DurabilitySafe`	~558 us/op	~7.4x slower	Critical single-threaded writes where `StoreColumn` must not return until chunk bytes are synced to disk.
`DurabilitySafeBatch`	~551 us/op sequential; ~285 us/op parallel	~7.3x slower sequential; ~3.8x slower parallel	Concurrent critical writes that can share synced flush batches while preserving per-call durability.

Safety Modes

BlazeDB exposes four durability modes so you can choose the safety/performance tradeoff explicitly:

Mode	What It Guarantees	Performance Cost
`DurabilityFast`	Recent writes may be lost if the process or machine dies before buffered data reaches disk. Existing synced chunk records can still be recovered by index rebuild.	Fastest write path; no fsync on store/flush.
`DurabilityBalanced`	Once a flush batch completes, its chunk bytes are fsynced and recoverable even if `index.dat` is stale or corrupted. Writes still return when queued, so the latest queued writes can be lost before the next flush.	Near-fast-mode caller latency; fsync cost is paid per flush batch.
`DurabilitySafe`	`StoreColumn` writes the chunk record and fsyncs `chunks.dat` before returning. If the newest record is later corrupt, rebuild can fall back to the previous valid record.	Slowest mode because every store pays storage sync latency.
`DurabilitySafeBatch`	`StoreColumn` returns only after the caller's write has been included in a synced flush. Concurrent callers may share the same fsync.	Similar to `DurabilitySafe` for single writes, faster under concurrent critical write bursts.

opts := blazedb.BalancedOptions() // recommended for production servers
db, err := blazedb.Config{Options: opts}.Open("world_blazedb")

Use SafeOptions() when per-call durability matters more than throughput:

opts := blazedb.SafeOptions()
db, err := blazedb.Config{Options: opts}.Open("world_blazedb")

Use SafeBatchOptions() when multiple goroutines may save critical chunks at the same time and each caller still needs to wait for synced bytes:

opts := blazedb.SafeBatchOptions()
db, err := blazedb.Config{Options: opts}.Open("world_blazedb")

Recovery and Durability

BlazeDB treats index.dat as a persisted acceleration index. If it is missing, corrupted, or stale compared with chunks.dat, BlazeDB rebuilds it on open by scanning valid records in chunks.dat, then saves the rebuilt index atomically. During rebuild, incomplete tail records from interrupted appends are truncated, complete records with bad CRCs are skipped, and older valid chunk versions remain usable.

Old chunk versions are removed only when Compact() is called.

Delta records are optional and are only used when Options.EnableDeltaRecords is true and callers use StoreBlockDeltas. A later full StoreColumn snapshots the chunk and supersedes older deltas. Compact() folds active deltas back into full chunk records and truncates deltas.dat.

Pros of BlazeDB

No Write Stalls: Since it just appends to a file, you don't get the "hiccups" caused by LevelDB compacting generic string keys in the background.
Faster Chunk Loading: The use of Z-order curves means that when the OS reads pages from the disk, it's more likely to grab specifically the chunks relevant to the player's area.
Modern Compression: LZ4 is significantly faster than standard Snappy/Zlib implementations for this use case.

Cons of BlazeDB

Disk Space: Because it is append-only, if a chunk is saved 100 times, it exists 100 times in the file until compacted. LevelDB handles this cleanup automatically and continuously.
Memory Usage: The in-memory index scales with world size. For massive worlds (terabytes), the index RAM usage needs to be considered (though it is quite compact).

Usage

package main

import (
    "log/slog"

    "github.com/cqdetdev/blazedb"
    "github.com/df-mc/dragonfly/server"
)

func main() {
    // Open BlazeDB provider
    p, err := blazedb.Config{Options: blazedb.DefaultOptions()}.Open("path/to/world/db")
    if err != nil {
        panic(err)
    }
    
    // Use it in Dragonfly
    userConf := server.DefaultConfig()
    userConf.World.SaveData = false
    conf, err := userConf.Config(slog.Default())
    if err != nil {
        panic(err)
    }
    conf.WorldProvider = p
    
    srv := conf.New()
    srv.CloseOnProgramEnd()
    srv.Listen()
    
    for range srv.Accept() {
    }
}

High-Churn and Ephemeral Worlds

For long-running worlds, disposable arenas, minigames, and generated maps, BlazeDB exposes opt-in APIs that keep the default provider fast while allowing server-specific storage strategies.

Presets

active := blazedb.ActiveWorldOptions()       // good starting point for factions/warzones
ephemeral := blazedb.EphemeralWorldOptions() // good starting point for practice/duel overlays

Read-only template plus disposable overlay

Use an overlay provider for temporary maps. The base provider is the clean template; the overlay receives all session mutations. Resetting the map discards the overlay instead of copying the whole map. This fits practice and duel arenas well, but the API itself is generic.

template, err := blazedb.Open("templates/nodebuff")
if err != nil {
    panic(err)
}

arena, err := blazedb.OpenOverlayProvider(template, "arenas/match-123", blazedb.EphemeralWorldOptions(), nil)
if err != nil {
    panic(err)
}

// Use arena as the Dragonfly world provider.
// After the match:
_ = arena.ResetOverlay()

Overlay fallback clones template chunks by default, so mutating a loaded arena chunk does not mutate the shared template cache.

Explicit block-delta records

When the server event layer knows exactly which blocks changed, call StoreBlockDeltas instead of saving a full chunk column for every small edit:

opts := blazedb.ActiveWorldOptions()
db, err := blazedb.Config{Options: opts}.Open("world_blazedb")
if err != nil {
    panic(err)
}

err = db.StoreBlockDeltas(chunkPos, world.Overworld, []blazedb.BlockDelta{{
    Pos:       blockPos, // absolute block position is fine
    Layer:     0,
    RuntimeID: runtimeID,
}})

Delta records are not automatic dirty tracking. Dragonfly still calls StoreColumn for normal provider saves. Use deltas from block-place/break/explosion handlers when your server already has exact mutation events.

Pinned hot areas

Pin known hot arenas or spawn/warzone areas so cache auto-tuning and LRU eviction do not push them out:

pinned, err := db.PinArea(world.ChunkPos{0, 0}, 8, world.Overworld)
_ = pinned
_ = err
defer db.UnpinArea(world.ChunkPos{0, 0}, 8, world.Overworld)

Pinned entries may let the cache exceed its configured limit if you pin more chunks than the cache can hold, so pin deliberately.

Optional segmented provider

Use segmentation when you want independent per-region storage files, for example deleting a finished arena shard or compacting a hot factions region without touching the full world:

provider, err := blazedb.OpenSegmentedProvider("world_segmented", &blazedb.SegmentOptions{
    SegmentSize: 32,
    Options:     blazedb.ActiveWorldOptions(),
})
if err != nil {
    panic(err)
}

// Delete the segment containing a disposable arena.
_ = provider.DeleteSegmentFor(world.ChunkPos{128, 0}, world.Overworld)

Segmentation adds routing/open-file overhead, so use it for reset/operations benefits rather than for the fastest single-chunk store path.

Predictive Prefetching

BlazeDB includes a smart, threaded prefetcher that analyzes player movement to predict and load future chunks before they are requested.

Velocity-Based: Calculates player direction and velocity to determine where they are heading.
Look-Ahead: Loads 1-3 chunks ahead in the movement direction (including diagonals).
Background Workers: runs on dedicated goroutines to avoid blocking the main server thread.

// The prefetcher is automatically initialized with the DB.
// You just need to update it with player positions.
prefetcher.UpdatePlayerPosition(playerUUID, currentChunkPos, dimension)

Iterator

BlazeDB provides a ColumnIterator for efficient scanning of the world. It uses the spatial index to quickly identify relevant chunks without reading them all from disk.

iter := db.NewColumnIterator(&blazedb.IteratorRange{
    Min: world.ChunkPos{0, 0},
    Max: world.ChunkPos{100, 100},
    Dimension: world.Overworld,
})
defer iter.Release()

for iter.Next() {
    pos := iter.Position()
    col := iter.Column()
    // Process chunk...
}
if err := iter.Error(); err != nil {
    // Handle error
}

Advanced Configuration

Option	Default	Description
`CacheSize`	`128MB`	Maximum memory usage for the chunk cache.
`Compression`	`CompressionLZ4`	Compression algorithm (`LZ4`, `Snappy`, `None`).
`WriteBufferSize`	`4MB`	Size of the in-memory write buffer before forcing a flush.
`FlushInterval`	`1000ms`	Simple periodic background flushes.
`VerifyChecksums`	`false`	Verify CRC32 checksums on read (costs CPU).
`Durability`	`DurabilityFast`	Fsync/buffering policy (`Fast`, `Balanced`, `Safe`, `SafeBatch`).
`EnableDeltaRecords`	`false`	Enables the explicit `StoreBlockDeltas` API and `deltas.dat`.
`Log`	`slog.Default()`	Logger for debug/error messages.

TurboOptions() uses a 512MB cache, CompressionNone, a 16MB write buffer, and a 5000ms flush interval for maximum write throughput when crash-window tradeoffs are acceptable.

BalancedOptions() enables DurabilityBalanced, checksum verification, Snappy compression, and smaller flush batches. SafeOptions() enables DurabilitySafe, checksum verification, immediate writes, and no background write buffer. SafeBatchOptions() enables DurabilitySafeBatch for synced batch commits across concurrent callers.

ActiveWorldOptions() starts from BalancedOptions(), uses a 256MB cache, and enables explicit delta records. Use it as a starting point for factions, warzones, and other long-running mutation-heavy worlds. EphemeralWorldOptions() starts from TurboOptions(), uses a smaller 64MB cache, and enables explicit delta records for disposable overlays, duel arenas, minigame rounds, or generated temporary worlds.

API Ideas Still Worth Exploring

Dirty-chunk adapters for Dragonfly handlers so clean chunks never reach StoreColumn.
Arena lease management around OverlayProvider, including automatic cleanup of old overlay directories.
Region-level compaction scheduling for SegmentedProvider.
Metrics hooks for overlay hit rate, delta record count, pinned cache size, and per-segment write pressure.
Optional "snapshot overlay into base" tooling for promoting an edited arena template.

Credits

Antigravity and Claude Code

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
cmd		cmd
tests		tests
.gitignore		.gitignore
README.md		README.md
cache.go		cache.go
column_clone.go		column_clone.go
compression.go		compression.go
conf.go		conf.go
db.go		db.go
delta.go		delta.go
doc.go		doc.go
go.mod		go.mod
go.sum		go.sum
index.go		index.go
iterator.go		iterator.go
negative_cache.go		negative_cache.go
options.go		options.go
overlay.go		overlay.go
prefetch.go		prefetch.go
segmented.go		segmented.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BlazeDB

Features

Architecture & How It Works

Data Storage Schematic

1. Z-Order Spatial Indexing

2. Append-Only Data File (`chunks.dat`)

3. Chunk Format

4. Sharded LRU Cache

BlazeDB vs. LevelDB

Benchmarks

Safety Modes

Recovery and Durability

Pros of BlazeDB

Cons of BlazeDB

Usage

High-Churn and Ephemeral Worlds

Presets

Read-only template plus disposable overlay

Explicit block-delta records

Pinned hot areas

Optional segmented provider

Predictive Prefetching

Iterator

Advanced Configuration

API Ideas Still Worth Exploring

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BlazeDB

Features

Architecture & How It Works

Data Storage Schematic

1. Z-Order Spatial Indexing

2. Append-Only Data File (chunks.dat)

3. Chunk Format

4. Sharded LRU Cache

BlazeDB vs. LevelDB

Benchmarks

Safety Modes

Recovery and Durability

Pros of BlazeDB

Cons of BlazeDB

Usage

High-Churn and Ephemeral Worlds

Presets

Read-only template plus disposable overlay

Explicit block-delta records

Pinned hot areas

Optional segmented provider

Predictive Prefetching

Iterator

Advanced Configuration

API Ideas Still Worth Exploring

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. Append-Only Data File (`chunks.dat`)

Packages