Gazpacho Versions Save

🥫 The simple, fast, and modern web scraping library

v1.1

3 years ago

1.1 (2020-10-09)

  • Feature: now PEP 561 compliant
  • Feature: Soup now automatically formats and indents (pretty print) HTML where possible

v1.0

3 years ago

1.0 (2020-09-24)

  • Feature: gazpacho is now fully baked with type hints (thanks for the suggestion @ju-sh!)
  • Feature: Soup.get("url") alternative initializer
  • Fixed: .find is now able to capture malformed void tags (<img />, vs. <img>) (thanks for the Issue @mallegrini!)
  • Renamed: .find(..., strict=) is now find(..., partial=)
  • Renamed: .remove_tags is now .strip

v0.9.4

3 years ago

0.9.4 (2020-07-07)

  • Feature: automagical json-to-dictionary return behaviour for get
  • Improvement: automatic missing URL protocol inference for get
  • Improvement: condensed HTTPError Exceptions

v0.9.3

4 years ago

0.9.3 (2020-04-29)

  • Updated the README (thanks for flagging the lxml error, @koaning!)

v0.9.2

4 years ago

0.9.2 (2020-04-21)

  • Fixed find(..., mode='first') to return None and not an IndexError (thanks, psyonara!)

v0.9.1

4 years ago
  • Fixed UnicodeEncodeError lurking beneath get (thanks for the "Issue" mlehotay!)
  • Fixed find method to properly handle non-closing HTML tags

v0.9

4 years ago
  • Added the remove_tags method for isolating formatted text in a block of HTML

v0.8.1

4 years ago

Changelog

  • Fixed empty element tag counting within the find method

v0.8

4 years ago

Changelog

  • Added mode argument to the find method to adjust return behaviour (defaults to mode='auto')
  • Enabled strict attribute matching for the find method (defaults to strict=False)