High performance data storage for importing, querying and transforming variants.
Master | Develop |
---|---|
GenomicsDB is built on top of a fork of htslib and a tile-based array storage system for importing, querying and transforming variant data. Variant data is sparse by nature (sparse relative to the whole genome) and using sparse array data stores is a perfect fit for storing such data. GenomicsDB is a highly performant scalable data storage written in C++ for importing, querying and transforming genomic variant data. See genomicsdb.readthedocs.io for documentation and usage.
Included are
GenomicsDB is packaged into gatk4 and benefits qualitatively from a large user base.
GenomicsDB is open source and all participation is welcome. GenomicsDB is released under the MIT License and all external contributors are expected to grant an MIT License for their contributions.
Please ensure that the code is well documented in Javadoc style for Java/Scala. For Java/C/C++ code formatting, roughly adhere to the Google Style Guides. See GenomicsDB Style Guide