Genozip Standard
Objective
​
Genozip Standard was created with the objective of revolutionizing compression of genomic data. In the vast majority of cases it achieves better compression than any other compressor avaiable, commercial or open source, and often by a large margin.
​
How good is Genozip?
​
Below are some benchmarks across a wide range of data. Note that the compression results may vary considerably and depend on the specific details of the data - you are encouraged to test Genozip on your data (free 30 day evaluation).
FASTQ
Genozip compression of .fastq.gz files generated by a variety of sequencers - showing their relative size before and after Genozip compression. Genozip can compress with or without a reference, however using --reference is always advisable for FASTQ files. --optimize is avaiable with Genozip Enterprise. The files tested were obtained from (in order): 1. (unpublished) 2. here 3. here
BAM (or CRAM or SAM)
Genozip compression of .bam files generated by a variety of sequencers and aligners - showing their relative size before and after Genozip compression. . Genozip can compress with or without a reference, however using --reference improves compression of BAM fifiles. --optimize is avaiable with Genozip Enterprise. The files tested were obtained from (in order): 1. here 2. (unpublished) 3. here 4. here
VCF
Genozip compression of a variety of .vcf.gz files - showing their relative size before and after Genozip compression. Compression was done using the --best command line option, and for the two GVCF files, using the --reference options as well. The files tested were obtained from (in order): 1. (unpublished) 2. here 3. here 4. here 5. here 6. here.
Losslessness
​
Genozip is a lossless compressor, meaning that when a file is decompressed, it is exactly identical to the source data, verifiable by MD5.
​
Features
​
⦾ Compresses (almost) all file types: while the vast majority of data that is compressed with Genozip consists of FASTQ, BAM or VCF files, Genozip is designed to compress many other genomic formats. In fact, Genozip can compress any file (with some exceptions), usually better and faster than common general purpose compressors.
​
⦾ Archive entire directories with the --tar option: you can compress entire directories, including all subdirectories and files (genomic or not) directly into a standard tar file, in preparation for delivering it to a customer or archiving it.
​
⦾ Partial decompression ("random access"): It is possible to access parts of a compressed file without decompressing the entire file.
​
⦾ Many more features.
​
​
Licensing
​
Genozip is licensed by a flat annual license fee for unlimited usage: unlimited data, unlimited machines, unlimited users - as long as they all belong to the licensing organization. See pricing.
​
​
Free decompression forever
​
The payment of a Genozip license fee covers compression of files. In contrast, decompression does not require paying a license fee. This means that you are never locked-in: when you eventually decide it is time to part ways with us, you can still (of course!) access your compressed data.
This also means that you can share compressed files with external parties, which they would be able to decompress without needing a paid license.​
​
Questions? support@genozip.com