High-performance binary format for compressed nucleic acid sequences
ZNA (Compressed Z-Nucleic N-Acid A) is a specialized binary format for storing DNA/RNA sequences with exceptional compression and I/O speed.
Features: - 135 MB/s roundtrip throughput (9.5x faster than Python baseline) - 2.8+ GB/s encoding/decoding for long reads - 3.7-4.0x compression ratio with Zstd - C++ acceleration with pure Python fallback - Block-based architecture for memory efficiency - Supports single-end, paired-end, and interleaved reads - Supports strand-specific protocols