Zue Storage Engine

Zue is a distributed, replicated log-structured storage engine written in Zig, providing total ordering and strong consistency guarantees through Raft-like replication.

Core Design

The design is based on the following principles:

Append-Only Log: All writes are sequential appends to a log file, which provides O(1) write complexity and is optimal for most storage hardware.
Segmentation: The log is partitioned into segments, each consisting of a .log file for data and a .index file. This allows for efficient, file-level data retention and cleanup.
Sparse Indexing: To avoid the overhead of indexing every record, an index entry is created only at configurable intervals (e.g., every 4KB). This minimizes the index's memory footprint while still allowing for fast lookups.
Data Integrity: All records and index entries are protected by CRC32 checksums to prevent data corruption.

For a more detailed breakdown, see the wiki.

Getting Started

Prerequisites

Zig 0.15.1 or later
Python 3.7+ (for plotting benchmarks)

Build and Test

To run all unit tests:

zig build test --summary all

To run integration tests:

zig build test-integration --summary all

To run replication tests:

zig build test-replication --summary all

Usage

(This section is under development. See the source code in src/log/log.zig for the public API or see the wiki.)

Benchmarking

The project includes a benchmark suite. A convenience script is provided to run the benchmarks and generate plots:

./run_benchmarks_and_plot.sh

Benchmark data is stored in /tmp/zue_bench_results/, and plots are in benchmark_plots/.

Completed Features

Core Log-Structured Storage Engine: Append-only log with segmentation and sparse indexing
Leader-Follower Replication: Raft-like consensus with parallel, non-blocking I/O
- Quorum-based commits with In-Sync Replica (ISR) tracking
- Leader-driven log repair for consistency
- Hybrid inline recovery for minimal latency (≤10 entry lag recovers inline)
- Heartbeat monitoring and automatic failure detection

In Progress / Future Work

Memory-Mapped I/O Integration: mmap implementations exist but not yet integrated
- MmapSegment, MmapIndex, and MmapLogReader/Writer implementations complete
- Expected 10-100x faster index lookups, 5-50x faster log reads
- Needs integration into main Log and replication system
Log Compaction: Automatic cleanup of old/duplicate entries
Leader Election: Automatic failover on leader failure (currently static leader)
Cluster Status API: Programmatic monitoring of cluster state, ISR, and follower lag

Contributing

Contributions are welcome. Please open an issue to discuss your proposed changes.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.zig		build.zig
build.zig.zon		build.zig.zon
clean_bench.sh		clean_bench.sh
plot_benchmarks.py		plot_benchmarks.py
requirements.txt		requirements.txt
run_benchmarks_and_plot.sh		run_benchmarks_and_plot.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Zue Storage Engine

Core Design

Getting Started

Prerequisites

Build and Test

Usage

Benchmarking

Completed Features

In Progress / Future Work

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

lostcache/zue

Folders and files

Latest commit

History

Repository files navigation

Zue Storage Engine

Core Design

Getting Started

Prerequisites

Build and Test

Usage

Benchmarking

Completed Features

In Progress / Future Work

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages