PetaGene decreases the size of genomic data, reducing storage costs and data transfer times without compromising data quality.

Save storage

Keep using your BAM, CRAM and FASTQ files, but use just a fraction of the space.

Start saving

Faster transfers

Accelerate transfers to/from the cloud, and share data with collaborators.

Move data faster

Remote access

Give collaborators accelerated remote access to your genomics data.

Collaborate better

Software for smaller, faster genomics data

PetaGene software addresses challenges caused by growing volumes of genomics data. Developed by an award-winning team from the University of Cambridge, PetaGene grew out of a project exploring new storage and compression approaches in collaboration with the European Bioinformatics Institute. It achieves up to a 6x reduction in both storage costs and data transfer times compared to BAM and gzipped FASTQ files - this is a 96% reduction compared to raw FASTQ files. It transparently integrates with existing storage infrastructure and bioinformatics pipelines.

Award winning innovation

"The judges chose a new product that could give you millions of dollars worth of storage savings right now, a product that several of our judges wanted to go buy immediately after lunch."
Allison Proffitt, Editorial Director of Bio-IT World


PetaGene won Best of Show at Bio-IT World 2016 in Boston, for its PetaSuite compression tools in the category for optimising speed and storage, beating out 46 competitors including EMC, Clever Safe (IBM), Avere, and others from 190 total exhibiting companies. Bio-IT World is the premiere conference for IT in the Life Sciences.

How it Works

Superior Compression

PetaGene allows researchers to focus on what is most important to them and to patients: data analysis. PetaGene greatly reduces the footprint of genomics datasets in FASTQ.gz and BAM by up to 6x while preserving genotyping accuracy. With PetaGene, researchers can reduce hardware storage costs to as little as a quarter of the original cost and experience lossless compression.
 
 

Easy Integration

We've worked hard to make PetaGene integration seamless, with no need for a separate mount or volume. Just compress your BAM or FASTQ files where they are, and you can continue using the exact same filenames but with up to a 6x size reduction. Our compression even preserves access control permissions, extended attributes, and timestamps. Your tools and pipelines won’t even know that anything has changed.
 

Faster Transfers

PetaGene improves collaboration between researchers by enabling faster transfers of genomics datasets with its streaming compression. PetaGene also enables WAN acceleration for fast direct access by remote collaborators.
 

Faster Transfers

PetaGene improves collaboration between researchers by enabling faster transfers of genomics datasets with its streaming compression. PetaGene also enables WAN acceleration for fast direct access by remote collaborators.
 
 
 
 

Open Access

No lock-in. Free decompression and accessibility updates. Use PetaGene software to compress and easily distribute your data to others, who can freely access PetaGene-compressed files as ordinary BAM or FASTQ files.

Customers & Collaborators

"Handling the enormous amount of data we receive from genome sequencing is a huge challenge in our group as we analyse data from more than 10,000 human genomes...PetaGene’s solutions allow us to easily store, use, and visualise the sequencing data at a fraction of the cost."

Dr. Chris Penkett, Head of Pipelines for the 10K NIHR Rare Disease Genomes Project