A comprehensive library for computational molecular biology
writer_iter()
to some File
classes
sequence.io.FastaFile
, sequence.io.FastqFile
and all
classes inheriting from structure.io.TrajectoryFile
symbol_spacing
parameter to sequence.graphics.plot_alignment
sequence.graphics.plot_alignment_similarity_based
and
sequence.graphics.plot_alignment_types_based
structure.io.NetCDFFile
, previously it was always None
sequence.SequenceProfile
probability_matrix()
to compute the symbol probabilities
from total frequencieslog_odds_matrix()
to calculate a position weight matrixsequence_probability()
and sequence_score()
to assess the
adherence of a given sequence to a profilestructure.BondList
structure.BondType.AROMATIC_TRIPLE
to support triple bonds
in aromatic systemsstructure.BondList.without_aromaticity()
to convert bonds with
structure.BondType.AROMATIC_<order>
to structure.BondType.<order>
structure.info.bond_order()
structure.BondType
of the bond between two atoms in a
residuestructure.info.bond_order()
structure.info.bond_order()
structure.io.pdb.PDBFile.get_structure()
raising an exception,
if the PDB file contains an invalid CRYST1
record, now a warning is printedCONECT
records written by structure.io.pdb.PDBFile.set_structure()
CONECT
record was created, if a atom has 4
bond partnersapplication.muscle.MuscleApp
with nucleotide sequencessequence.graphics.plot_sequence_logo()
requires now a
sequence.SequenceProfile
, usage of sequence.align.Alignment
still works,
but is deprecateddatabase.rcsb.FieldQuery
has now an optional molecular_definition
parameter to allow searches in molecule related fields.database.rcsb.Query
objects can be combined with logical
operatorsapplication.blast.BlastWebApp
now also reports hit sequences containing
selenocysteine without errors (#344)Atom
, AtomArray
and AtomArrayStack
can be unpickled (#349)sequence.graphics.plot_sequence_logo()
supports now all Matplotlib backends
(#345)application.tantan.TantanApp
for sequence repeat masking with
Tantan
sequence.align.KmerAlphabet
and sequence.align.KmerTable
support spaced
k-mers
sequence.align.local_gapped()
and sequence.align.local_ungapped()
for aligning sequences locally with X-drop heuristicEValueEstimator
for calculation of expect values (E-values)
from alignment scoressequence.LetterAlphabet.extends()
sequence.SequenceProfile
for representing sequence profiles by means
of a symbol frequency table
sequence.SequenceProfile.from_alignment()
creates a profile from an
alignmentsequence.SequenceProfile.to_consenus()
creates a consensus sequencesequence.align.get_codes()
and
sequence.align.get_symbols()
CONECT
records in
structure.io.pdb.PDBFile
(#329)'X'
or '*'
(#322)sequence.graphics,plot_nucleotide_secondary_structure()
is now able to set
the color of symbols (#333)repr()
function (#290)sequence.CodonTable
structure.info.all_residues()
, that gives all residue names from the
Chemical Component Dictionary
structure.info
structure.io.mol.MOLFile
to support MOL and SDF files for
small molecule structure datadatabase.uniprot
for UniProt database support
database.uniprot.search()
searches for Uniprot IDs that match a given
database.uniprot.Query
database.uniprot.fetch()
downloads the file corresponding to the given
Uniprot ID.structure.pseudoknots()
by using NetworkX
to identify conflicting regions (#289)structure.BondType
enum values for more precise description of
aromatic bonds
BondType.AROMATIC
is replaced by
BondType.AROMATIC_SINGLE
and BondType.AROMATIC_DOUBLE
structure.BondList.remove_aromaticity()
converts
BondType.AROMATIC_SINGLE
to BondType.SINGLE
and
BondType.AROMATIC_DOUBLE
to BondType.DOUBLE
in-placestructure.io.PDBFile
,
if the MODEL
line is missing in the filestructure.io.PDBFile
'+2'
instead of the correct 2+
atom_i
parameter when reading a trajectory via
structure.io.load_structure()
(#308)structure.CellList
radius
parameter of structure.CellList.get_atoms()
was equal to the
cell_size
parameter of the constructor (#311)structure.AtomArray
and
structure.AtomArrayStack
preserves the NumPy dtype
application.autodock.VinaApp
uses vina
executable to perform
docking of ligand to a receptor moleculestructure.io.pdbqt.PDBQTFile
class for writing input for
and reading output from vina
sequence.align.KmerAlphabet
encodes a sequence.Sequence
into
k-mers
sequence.align.KmerTable
is able to find k-mer matches between
sequence in an efficient mannersequence.align.SimilarityRule
allows matching similar instead of
exact k-mer matches via a sequence.align.KmerTable
sequence.align.align_banded()
performs a heuristic local or
semi-global sequence alignment within a defined diagonal bandsequence.align.remove_terminal_gaps()
functionapplication.sra.FastqDumpApp.get_file_paths()
methodsequence.Sequence.get_symbol_frequency()
sequence.NucleotideSequence.complement()
sequence.Sequence.reverse()
can optionally create an array view
instead of a copyapplication.sra.FastqDumpApp.get_file_paths()
only parses
downloaded PDBQT files, if requiredapplication.viennarna.RNAfoldApp
uses RNAfold
to predict the minimum
free energy secondary structure of an RNA sequenceapplication.viennarna.RNAplotApp
uses RNAplot
to calculate the
2D coordinates for base symbols in a secondary structure plotstructure.graphics.plot_nucleotide_secondary_structure()
for
visualization of an RNA secondary structure via Matplotlib
RNAplot
structure.find_connected()
structure.partial_charges()
structure.BondList.remove_bonds_to
structure.get_molecule_indices()
, structure.get_molecule_masks()
and
structure.molecule_iter
select atoms belong to a single molecule, i.e.
atoms that are connected via bondsas_graph()
method to sequence.phylo.Tree
and structure.BondList
for conversion into a NetworkX Graph
find_rotatable_bonds()
function uses NetworkX to identify
rotatable bonds, i.e. single bonds that are not part of a cycle,
in structures with a structure.BondList()
networkx
package as dependencystructure.io.pdbx.set_structure()
does not convert the residue ID -1
to "."
anymorestructure.io.pdb.PDBFile
structure.io.pdbx.set_structure()
supports now atom IDs larger than
one millionapplication.dssp.DsspApp
unable to work with multicharacter chain
identifiers (#264)application.muscle.MuscleApp
sometimes not finishing for long
alignments (#273)structure.BondList
potentially ending in a broken state after
indexing it with an unordered index arraystructure.partial_charges()
uses bond order instead of number of bond
partners to calculate correct charges for atoms with positive or negative
formal chargestructure.info.nucleotide_names()
structure.base_pairs()
structure.base_pairs()
and
structure.filter_nucleotides()
structure.base_pairs_edge()
and
structure.base_pairs_glycosidic_bond()
for further
characterization of base pairsstructure.base_stacking()
for identification of pi-stacking
of nucleobasesstructure.pseudoknots()
for identification of pseudoknots
in a given list of base pairingsstructure.dot_bracket()
,
structure.dot_bracket_from_structure()
and
structure.base_pairs_from_dot_bracket()
for conversion of
base pairs to dot-bracket-letter notation and vice versastructure.BondList
:
get_all_bonds()
for obtaining the bonds atoms for each atom
in the structureadjacency_matrix()
and bond_type_matrix()
structure.partial_charges()
for partial charge calculation
using the PEOE methodstructure.info.standardize_order()
, that reorders atoms in
residues into the PDB standard atom order for the respective residuestructure.graphics.plot_ball_and_stick_model()
structure.get_residue_positions()
sequence.io.genbank.get_raw_sequence()
which returns the
sequence as stringstructure.hbond()
raises a warning if an input structure without
hydrogen atoms is given (#241)get_sequence()
and get_sequences()
of biotite.sequence.io.fasta
and biotite.sequence.io.fastq
convert selenocysteine to cysteine (#232)biotite.sequence.io.fasta.get_sequence()
and
biotite.sequence.io.fasta.get_sequences()
try to create (#232):
sequence.NucleotideSequence
sequence.ProteinSequence
application
subpackage are removed via
os.remove()
due to issues on Windows (#243)structure.base_pairs()
for structures that contian residues,
that are not in the PDB standard order (#237)structure.graphics.plot_atoms()
structure.BondList
constructor
(related to #252):
structure.BondList
indexing with an unsorted index array (#238)charge
annotation of molecules obtained via
structure.info.residue()
(#254)sequence.ProteinSequence.get_molecular_weight()
methodapplication.sra
subpackage as interface to NCBI SRA tools
FastqDumpApp
is used for fetching FASTQ files from the NCBI SRA
iter_read()
static method to sequence.io.fasta.FastaFile
and sequence.io.fasta.FastqFile
set_sequence
and set_sequences
in sequence.io.fasta
and sequence.io.fasta
support writing RNA sequences with the new
as_rna
parameter_loop
category labels in
PDBx/mmCIF files (#224)1
(#227)structure.density()
sequence.io.fastq.get_sequence()
and sequence.io.fastq.get_sequences()
properly handle RNA and ambiguous sequences nowstart
parameter in structure.renumber_atom_ids
and
structure.renumber_res_ids
database.rcsb.fetch()
database.rcsb.Query
classes, that reflect the entirety of the new
search API, including sequence, sequence motif and structure searches
database.rcsb.Query
objects can be combined/negated using the
operators |
, &
and ~
return_type
, sort_by
and range
parameter to
database.rcsb.search()
database.rcsb.count()
function to count the number of results a
database.rcsb.Query
would yield in a less costly way than
database.rcsb.search()
biotite.structure.BondList
sequence.Sequence.alphabet
property, that is equivalent to
sequence.Sequence.get_alphabet()
fastq.get_sequence()
, fastq.get_sequences()
,
fastq.set_sequence()
and fastq.set_sequences()
sequence.io.fasta.FastaFile
sequence.AlphabetMapper
sequence.Alphabet.is_letter_alphabet()
methodsequence.io.load_sequence()
, sequence.io.load_sequences()
,
sequence.io.save_sequence()
and sequence.io.save_sequences()
that derive
the appropriate File
class from the suffix of the file name.omit_chain
parameter has been removed from database.rcsb.search()
database.rcsb.Query
classes have been removedpython setup.py test
and python setup.py build_sphinx
commands,
please use pytest
and sphinx-build
directly insteadsequence.NucleotideSequence.alphabet
to
sequence.NucleotideSequence.alphabet_unamb
sequence.io.fastq.FastqFile
returns its entries only as str
instead of
sequence.NucleotideSequence
for consistency with
sequence.io.fastq.FastaFile
sequence.io.fastq.FastqFile.get_sequence()
is deprecatedsequence.io.fastq.FastqFile.get_seq_string()
returns the
sequence as a str
instead of a sequence.NucleotideSequence
expect_looped
parameter in
structure.io.pdbx.PDBxFile.get_category()
structure.io.pdbx.PDBxFile
, that was raised, if a PDBx
field and its single-line value are in separate linesbiotite.structure.BondList
chain_id
dtype from 'U3'
to 'U4'
(#215)