Astro Projects Astro Versions Save

Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.

1.8.0

3 months ago

Misc

Replace openlineage-airflow with Apache Airflow OSS provider apache-airflow-providers-openlineage #2103
Bump up minimum version of apache-airflow to 2.7 #2103
Bump up minimum version of Python to 3.8 #2103

Bug Fixes

Limit pandas version to <2.2.0 due to an open issue #57053 for the release pandas==2.2.0 #2105

1.7.0

8 months ago

Feature

Allow users to disable schema check and creation on load_file #1922
Allow users to disable schema check and creation on transform #1925
Add support for Excel files #1978
Support loading metadata columns from stage into table for Snowflake #2023

Bug Fixes

Add openlineage_dataset_uri in databricks #1919
Fix QueryModifier issue on Snowflake #1962
Fix AstroCustomXcomBackend circular import issue #1943

Misc

Add an example DAG for using dynamic task with dataframe #1912
Improve example_load_file DAG tasks names #1958
Limit databricks-sql-connector<2.9.0 #2013

Docs

Add docs about using dtype #1903
Make cleanup operator summary docs smaller #2017

1.6.2

9 months ago

Bug Fixes

Fix Snowflake QueryModifier issue #1962
Add support for Pandas 2, Airflow 2.6.3 and Python 3.11 #1989
Update the WASB connection #1994

1.6.1

11 months ago

Bug Fixes

Fix AstroCustomXcomBackend circular import issue. https://github.com/astronomer/astro-sdk/pull/1943

1.5.4

11 months ago

Bug Fixes

Fix AstroCustomXcomBackend circular import issue. #1943

1.7.0a2

1 year ago

Feature

Allow users to disable schema check and creation on transform #1925
Allow users to disable schema check and creation on load_file #1922

Documentation: https://astro-sdk-python.readthedocs.io/en/latest/configurations.html#configuring-if-schemas-existence-should-be-checked-and-if-the-sdk-should-create-them

1.7.0a1

1 year ago

Feature

Allow users to disable schema check and creation on load_file #1922

1.6.0

1 year ago

Feature

Add MySQL support #1801
Add support to load from Azure blob storage into Databricks #1561
Add argument skip_on_failure to CleanupOperator #1837 by @scottleechua

Add query_modifier to raw_sql, transform and transform_file, which allow users to define SQL statements to be run before the main query statement #1898. Example of how to use this feature can be used to add Snowflake query tags to a SQL statement:

from astro.query_modifier import QueryModifier
@aql.run_raw_sql(
    results_format="pandas_dataframe",
    conn_id="sqlite_default",
    query_modifier=QueryModifier(pre_queries=["ALTER team_1", "ALTER team_2"]),
)
def dummy_method():
    return "SELECT 1+1"

Upgrade astro-runtime to 7.4.2 #1878

Bug fix

Raise exception in case larger dataframes than expected are passed to aql.dataframe #1839
Revert breaking change introduced in 1.5.0, re-allowing aql.transform to receive `sql filepath #1879

Docs

Update open lineage documentation #1881

Misc

Support Apache Airflow 2.6 #1899, with internal serialization changes
Add basic tiltifle for local dev #1819

1.6.0a1

1 year ago

Feature

Add query_modifier to raw_sql, which users can use to define SQL statements to be run before and after the main query statement #1898

Example:

    from astro.query_modifier import QueryModifier
    @aql.run_raw_sql(
        results_format="pandas_dataframe",
        conn_id="sqlite_default",
        query_modifier=QueryModifier(pre_queries=["ALTER team_1", "ALTER team_2"]),
    )
    def dummy_method():
        return "SELECT 1+1"

1.5.3

1 year ago

Bug fix:

Support using SQL operators (run_raw_sql, transform, dataframe) to convert a Pandas dataframe into a table when using a DuckDB in-memory database. #1831

Thanks to @pgzmnk for reporting the issue!