Biopandas Versions Save

Working with molecular structures in pandas DataFrames

v0.2.4

5 years ago
New Features
Changes
  • Minor adjustments to support to address deprecation warnings in pandas >= 23.0
Bug Fixes

v0.2.3

6 years ago
New Features
  • N/A
Changes
  • PandasMol2.distance_df was added as a static method that allows distance computations based for external data frames with its behavior otherwise similar to PandasMol2.distance.
  • PandasPdb.distance_df was added as a static method that allows distance computations based for external data frames with its behavior otherwise similar to PandasPdb.distance.
  • PandasPdb.distance now supports multiple record sections to be considered (e.g., records=('ATOM', 'HETATM') to include both protein and ligand in a query. Now also defaults to records=('ATOM', 'HETATM') for concistency with the impute method.
  • PandasPdb.get(...) now supports external data frames and lets the user specify the record section to be considered (e.g., records=('ATOM', 'HETATM') to include both protein and ligand in a query. Now also defaults to records=('ATOM', 'HETATM') for concistency with the impute method.
  • The section parameter of PandasPdb.impute_element(...) was renamed to records for API consistency.
Bug Fixes
  • N/A

v0.2.2

6 years ago
New Features
Changes
  • Raises a meaningful error message if attempting to overwrite the df attributes of PandasMol2 and PandasPdb directly.
  • Added PandasPdb.pdb_path and PandasMol2.mol2_path attributes that store the location of the data file last read.
Bug Fixes
  • The rmsd methods of PandasMol2 and PandasPdb don't return a NaN anymore if the array indices of to structures are different.

v0.2.1

7 years ago

0.2.1 (2017-05-11)

Downloads
New Features
Changes
  • The amino3to1 method of biopandas.pdb.PandasPDB objects now returns a pandas DataFrame instead of a pandas Series object. The returned data frame has two columns, 'chain_id' and 'residue_name', where the former contains the chain ID of the amino acid and the latter contains the 1-letter amino acid code, respectively.
  • Significant speed improvements of the distance method of both PandasPdb and PandasMol2 (now about 300 percent faster than previously).
Bug Fixes
  • The amino3to1 method of biopandas.pdb.PandasPDB objects now handles multi-chain proteins correctly.
  • The amino3to1 method of biopandas.pdb.PandasPDB objects now also works as expected if the 'ATOM' entry DataFrame contains disordered DataFrame indices or duplicate DataFrame index values.

v0.2.0

7 years ago
Downloads
New Features
  • Added an amino3to1 method to PandasPdb data frames to convert 3-amino acid letter codes to 1-letter codes.
  • Added a distance method to PandasPdb data frames to compute the Euclidean distance between atoms and a reference point.
  • Added the PandasMol2 class for working with Tripos MOL2 files in pandas DataFrames.
Changes
  • PandasPDB was renamed to PandasPdb.
  • Raises a warning if PandasPdb is written to PDB and ATOM and HETAM section contains unexpected columns; these columns will now be skipped.
Bug Fixes

v0.1.5

7 years ago

0.1.5

  • Included test data in the PyPI package; add install_requires for pandas
  • Added an impute_element method to PandasPDB objects to infer the Element Symbol from the Atom Name column
  • Added two new selection types for PandasPDB ATOM and HETATM coordinate sections: 'heavy' and 'carbon'
  • The 'hydrogen' atom selection in PandasPDB methods is now based on the element type instead of the atom name
  • By default, the RMSD is now computed on all atoms unless a specific selection is defined

v0.1.4

7 years ago

0.1.4 (2015-11-24)

  • Needed to bump the version number due to a bug in the PyPI setup.py script
  • Support for the old pandas sorting syntax (DataFrame.sort vs DataFrame.sort_values) incl. DeprecationWarning

0.1.3 (2015-11-23)

  • Exception handling in tests if PDB goes down (which just happened)
  • Added a separate ANISOU engine to handle those records correctly

0.1.2 (2015-11-23)

  • First Release