Core Features
See also: advanced features, premium features
Summary
​
Genozip was created with the objective of revolutionizing compression of genomic data. In the vast majority of cases it achieves better compression than any other compressor avaiable, commercial or open source, and often by a large margin.
​
How good is Genozip?
​
Below are some benchmarks across a wide range of data. Note that the compression results may vary considerably and depend on the specific details of the data - you are encouraged to test Genozip on your data (free 30 day evaluation).
FASTQ
Genozip compression of .fastq.gz files generated by a variety of sequencers - showing their relative size before and after Genozip compression. Genozip can compress with or without a reference, however using --reference is always advisable for FASTQ files. --optimize is avaiable with Genozip Enterprise. The files tested were obtained from (in order): 1. (unpublished) 2. here 3. here
BAM (or CRAM or SAM)
Genozip compression of .bam files generated by a variety of sequencers and aligners - showing their relative size before and after Genozip compression. . Genozip can compress with or without a reference, however using --reference improves compression of BAM fifiles. --optimize is avaiable with Genozip Enterprise. The files tested were obtained from (in order): 1. here 2. (unpublished) 3. here 4. here
VCF
Genozip compression of a variety of .vcf.gz files - showing their relative size before and after Genozip compression. Compression was done using the --best command line option, and for the two GVCF files, using the --reference options as well. The files tested were obtained from (in order): 1. (unpublished) 2. here 3. here 4. here 5. here 6. here.
Losslessness
​
Genozip is a lossless compressor, meaning that when a file is decompressed, it is exactly identical to the source data, verifiable by MD5.
​
Features
​
⦾ Compresses (almost) all file types: while the vast majority of data that is compressed with Genozip consists of FASTQ, BAM or VCF files, Genozip is designed to compress many other genomic formats. In fact, Genozip can compress any file (with some exceptions), usually better and faster than common general purpose compressors.
​
⦾ Archive entire directories with the --tar option: you can compress entire directories, including all subdirectories and files (genomic or not) directly into a standard tar file, in preparation for delivering it to a customer or archiving it.
​
⦾ Partial decompression ("random access"): It is possible to access parts of a compressed file without decompressing the entire file.
​
⦾ This is just a partial list - see detailed core feature set here.
​
⦾ You might also want to check out our advanced features and premium features​.
​
Free uncompressing forever
​​
Uncompressing Genozip-compressed files if free forever. This means that you are never locked-in: when you eventually decide it is time to part ways with us, you can still (of course!) access your compressed data.
​
This also means that you can share compressed files with external parties, which they would be able to uncompress without needing a paid license.​
​
Questions? support@genozip.com