Mango

You are here

Data Exploration on Large Genomic Datasets

Alyssa Morrow
Eric Tu
Frank Austin Nothaft
Anthony D. Joseph
David A. Patterson

Current genomics visualization tools are intended for a single node environment and lack computational resources to provide interactive speeds. Data from the 1000 Genomes Project provides 1.6 terabytes of variant data and over 14 terabytes of alignment data. However, typical genomic visualizations materialize less than 10 kbp, approximately 3.3e-7% of the genome. Mango is a visualization browser that selectively materializes and organizes genomic data to provide fast in memory queries. Mango materializes data from persistent storage as the user requests different regions of the genome. This data is efficiently partitioned and organized in memory using interval trees, which enables quick range queries over genomic data.

Release Date:
January, 2017
Status:
Availability:
Data type:
Techniques:
2D
Software:
Installed
Technology:
Platform:
Linux, Mac OSX, Windows
Requirements:

Project development

Institution: UC Berkeley, CA