🧬 gget enables efficient querying of genomic reference databases
Fix Windows bug in gget elm setup
gget search
and gget ref
now also support fungi 🍄, protists 🌝, and invertebrate metazoa 🐝 🐜 🐌 🐙 (in addition to vertebrates and plants)
gget cosmic
gget enrichr
: Fix duplicate scatter dots in plot when pathway names are duplicatedgget elm
:
gget
modules.gget setup
: Use the out
argument to specify a directory the ELM database will be downloaded into. Completes this feature request.gget diamond
: The DIAMOND command is now run with --ignore-warnings
flag, allowing niche sequences such as amino acid sequences that only contain nucleotide characters and repeated sequences. This is also true for DIAMOND alignments performed within gget elm
.gget ref
and gget search
back-end change: the current Ensembl release is fetched from the new release file on the Ensembl FTP site to avoid errors during uploads of new releases.gget search
:
--ftp
) are saved in txt file format instead of json.gget ref
:
list_iv_species
to list all available invertebrate species (can be combined with the release
argument to fetch all species available from a specific Ensembl release)gget info
: Return a logging error message when the NCBI server fails for a reason other than a fetch fail (this is an error on the server side rather than an error with gget
)BeautifulSoup
gget elm
: Remove false positive and true negative instances from returned resultsgget elm
: Add expand
argumentgget muscle
to add a tutorial on how to visualize sequences with sequence name lengths + slight change to returned visualization so it's a bit more robust to varying sequence namesgget muscle
now also allows a list of sequences as input (as an alternative to providing the path to a FASTA file)gget cellxgene
(fixes bug)gget seq
: Allow missing gene names (fixes https://github.com/pachterlab/gget/issues/107)gget enrichr
: Use arguments kegg_out
and kegg_rank
to create an image of the KEGG pathway with the genes from the enrichment analysis highlighted (thanks to this PR by Noriaki Sato)gget elm
and gget diamond
co-authored-by: @anhchi172
Resolves #90 , resolves #9
Co-authored-by: @anhchi172
release
argument to gget searchAlso see: https://pachterlab.github.io/gget/updates.html
Co-contributor: @anhchi172
Moved dependencies for modules gget gpt and gget cellxgene from automatically installed requirements to gget setup. Updated gget alphafold dependencies for compatibility with Python >= 3.10. Added census_version argument to gget cellxgene.
Updated gget search to function correctly with new Pandas version 2.0.0 (released on April 3rd, 2023) as well as older versions of Pandas
Updated gget info with new flags uniprot and ncbi which allow turning off results from these databases independently to save runtime (note: flag ensembl_only was deprecated)
All gget modules now feature a -q / --quiet (Python: verbose=False) flag to turn off progress information
Co-author of this release: @anhchi172