Aws Data Wrangler Versions Save

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

2.19.0

1 year ago

Noteworthy

  • Glue Data Quality now supported, checkout the tutorial πŸ”₯
  • Delta lake support by @fvaleye
  • New DynamoDB read_items method by @a-slice-of-py

Features & enhancements

Bug fixes

Documentation

Tests

New Contributors

Thanks

We thank the following contributors/users for their work on this release: @jaidisido, @kukushking, @LeonLuttenberger, @cnfait, @malachi-constant, @mdavis-xyz, @dydc, @enricomarchesin

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/2.18.0...2.19.0

2.18.0

1 year ago

Noteworthy

Features & enhancements

Bug fixes

Documentation

Tests

New Contributors

Thanks

We thank the following contributors/users for their work on this release: @lucasasmith, @vikramsg, @mycaule, @pal0064, @LeonLuttenberger, @cnfait, @malachi-constant, @kukushking, @jaidisido

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/2.17.0...2.18.0

3.0.0rc2

1 year ago

What's Changed

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/3.0.0rc1...3.0.0rc2

3.0.0rc1

1 year ago

What's Changed

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/3.0.0b3...3.0.0rc1

3.0.0b3

1 year ago

What's Changed

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/3.0.0b2...3.0.0b3

3.0.0b2

1 year ago

What's Changed

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/3.0.0b1...3.0.0b2

3.0.0b1

1 year ago

What's Changed

Full Changelog: https://github.com/aws/aws-sdk-pandas/compare/3.0.0a2...3.0.0b1

2.17.0

1 year ago

New Functionalities

Enhancements

  • Returning empty DataFrame for empty TimeStream query #1430
  • Added support for INSERT IGNORE for mysql.to_sql #1429
  • Added use_column_names to redshift.copy akin to redshift.to_sql #1437
  • Enable passing kwargs to redshift.connect #1467
  • Add timestream_endpoint_url property to the config #1483
  • Add support for upserting to an empty Glue table #1579

Documentation

  • Fix typos in documentation #1434

Bug Fix

  • validate_schema=True for wr.s3.read_parquet breaks with partition columns and dataset=True #1426
  • wr.neptune.to_property_graph failing for Neptune version 1.1.1.0 #1407
  • ValueError when using opensearch.index_df with documents with an array field #1444
  • Missing catalog_id in wr.catalog.create_database #1480
  • Check for pair of brackets in query preparation for Athena cache #1529
  • Fix wrong type hint for TagColumnOperation in quicksight.create_athena_dataset #1570
  • s3.to_json compression parameters is passed twice when dataset=True #1585
  • Cast Athena array, map & struct types to pandas object #1581
  • In the OpenSearch module, use SSL only for HTTPS (port 443) #1603

Noteworthy

AWS Lambda Managed Layers

Since the last release, the library has been accepted as an official SDK for AWS, and rebranded as AWS SDK for pandas πŸš€. The module names in Python will remain the same. One noteworthy change, however, is that the AWS Lambda Manager layer name has been renamed from AWSDataWrangler to AWSSDKPandas.

You can view the ARN value for the layers here.

PyArrow 7 Support

⚠️ For platforms without PyArrow 7 support (e.g. MWAA, EMR, Glue PySpark Job):

pip install pyarrow==2 awswrangler

Thanks

We thank the following contributors/users for their work on this release:

@bechbd, @maxispeicher, @timgates42, @aeeladawy, @KhueNgocDang, @szemek, @malachi-constant, @cnfait, @jaidisido, @LeonLuttenberger, @kukushking

3.0.0a2

1 year ago

This is a pre-release for the Wrangler@Scale project

What's Changed

Full Changelog: https://github.com/awslabs/aws-data-wrangler/compare/3.0.0a1...3.0.0a2

3.0.0a1

1 year ago

This is a pre-release for the Wrangler@Scale project

What's Changed

Full Changelog: https://github.com/awslabs/aws-data-wrangler/compare/2.16.1...3.0.0a1