Cost & Time Savings

Storage cost and data transfer times
reduced by 60-90%


Lossless Streaming Compression

Full validation and MD5 matching


FIPS 140-2 compliant AES-256
regional encryption


Transparent Usage

Access compressed files in their original format. Your tools and pipelines won’t even know that anything has changed.


Speeds up Analysis

Reduces I/O which dominates performance

Audit data use

Searchable cryptographic ledger of
how the data is accessed and used.


No lock-in

Free updates and decompression tools. Distribute your compressed data to others.


Easy IT Deployment

Software and tools are user mode. No security issues. No sysadmin headaches.

Want to know how much you can save?

Award Winning Innovation

PetaGene has won “Best of Show” at Bio-IT World, the premier conference for IT in the Life Sciences, three times. In 2016, PetaSuite won against 46 competitors, in the category for optimising speed and storage. In 2018, PetaSuite Cloud Edition (CE) won in the infrastructure and hardware category and in 2019 PetaGene’s latest security innovation: PetaSuite Protect won the ‘Nailed It’ award against 30 competing products.

“The judges chose a new product that could give you millions of dollars worth of storage savings right now, a product that several of our judges wanted to go buy immediately after lunch.”

Allison Proffitt

Boston, USA Editorial Director of Bio-IT World

Bio-IT World Best of Show Winner logos


"By using PetaSuite compression software for our data we have achieved our primary aim of dramatically increasing our storage capacity. This means that we do not need to spend precious resources on replacing or adding to it. The PetaGene team were responsive to our needs, including managing the demands of using IGV to efficiently access the compressed data via Apache server without decompressing the data first.

Per Sikora, Head of Facility

Per Sikora

Gothenburg, Sweden
Head of Facility, Clinical Genomics Gothenburg

“We were looking for a reliable NGS compression solution that we could quickly deploy at scale on our large cluster and would allow us to reduce our tier 1 storage needs. We were under time pressure to decide on a solution for funding reasons and PetaGene was willing to go the extra mile to help us. We decided to go with PetaGene as they offer transparent on-the-fly decompression and we estimated that there would be an overall cost saving compared to other solutions.”

Dr Christophe Trefois of Luxembourg Centre for Systems Biomedicine

Dr Christophe Trefois

Belvaux, Luxembourg
Technical Specialist at Luxembourg Centre for Systems Biomedicine, University of Luxembourg

“Handling the enormous amount of data we receive from genome sequencing is a huge challenge in our group as we analyse data from more than 10,000 human genomes... PetaGene’s solutions allow us to easily store, use and visualise the sequencing data at a fraction of the cost.”

Dr Chris Penkett Head of Pipelines for the 10K NIHR Rare Disease Genomes Project

Dr Chris Penkett

Cambridge, United Kingdom
Head of Pipelines for the 10K NIHR Rare Disease Genomes Project NHS Blood and Transplant & University of Cambridge

Customers and Collaborators

Table showing size of files created using Fastq.gz, bam, cram and PetaGene compression

How Does it Work?

  • PetaGene supplies multi-threaded Linux software (PetaSuite) for you to use to losslessly compress your BAM and FASTQ.gz files for savings of between 60% and 90%, whether on-premises on in the cloud.
  • You never need to decompress the files - our software comes with a user-mode shim (PetaLink) that does efficient random-access on-the-fly decompression out of memory so that the files appear with their original filenames in their original format. Performance is improved by doing this, due to I/O savings.
  • The Cloud Edition of PetaSuite even allows you to transparently migrate your pipelines to the cloud and/or access remote data as if it is local without downloading it first.


Managing NGS Data, a Dell and PetaGene healthcare podcast

Recently our co-founder Vaughan Wittorff and Phil Sweeney from Dell Technologies sat down to discuss how the use of Next-Generation Sequencing is expanding as the costs are coming down, creating an explosion of NGS processing and resulting data. Find out how PetaGene can address the demands of that scale of data, in a two-part Dell …

HISAT2 benchmarked with PetaGene’s compression and transparent readback tools

HISAT2 (Hierarchical Indexing for Spliced Alignment of Transcripts 2) is a graph-based read mapping tool for both DNA and RNA sequences.  HISAT2 enables a fast search through its graph index, mapping reads to the entire human genome along with a large number of variants. Since it is a widely used tool, at PetaGene we have …

PetaGene’s customers have now compressed one million genome files

For PetaGene, the one million genome era is underway We are pleased to announce the reaching of a landmark: PetaGene’s customers have now compressed over one million genome files. The dramatic drop in the cost of sequencing genomes and the numerous applications of this data to tackle critical diseases such as cancer and rare diseases …