Skip to content

Releases: apache/airflow

Apache Airflow Helm Chart 1.13.1

25 Mar 19:40
helm-chart/1.13.1
ae6fec9
Compare
Choose a tag to compare

Significant Changes

Default Airflow image is updated to 2.8.3 (#38036)

The default Airflow image that is used with the Chart is now 2.8.3, previously it was 2.8.2.

Bug Fixes

  • Don't overwrite .Values.airflowPodAnnotations (#37917)
  • Fix cluster-wide RBAC naming clash when using multiple multiNamespace releases with the same name (#37197)

Misc

  • Chart: Default airflow version to 2.8.3 (#38036)

Apache Airflow 2.8.3

11 Mar 12:53
2.8.3
948ec96
Compare
Choose a tag to compare

Significant Changes

The smtp provider is now pre-installed when you install Airflow. (#37713)

Bug Fixes

  • Add "MENU" permission in auth manager (#37881)
  • Fix external_executor_id being overwritten (#37784)
  • Make more MappedOperator members modifiable (#37828)
  • Set parsing context dag_id in dag test command (#37606)

Miscellaneous

  • Remove useless methods from security manager (#37889)
  • Improve code coverage for TriggerRuleDep (#37680)
  • The SMTP provider is now preinstalled when installing Airflow (#37713)
  • Bump min versions of openapi validators (#37691)
  • Properly include airflow_pre_installed_providers.txt artifact (#37679)

Doc Only Changes

  • Clarify lack of sync between workers and scheduler (#37913)
  • Simplify some docs around airflow_local_settings (#37835)
  • Add section about local settings configuration (#37829)
  • Fix docs of BranchDayOfWeekOperator (#37813)
  • Write to secrets store is not supported by design (#37814)
  • ERD generating doc improvement (#37808)
  • Update incorrect config value (#37706)
  • Update security model to clarify Connection Editing user's capabilities (#37688)
  • Fix ImportError on examples dags (#37571)

Apache Airflow Helm Chart 1.13.0

05 Mar 16:19
helm-chart/1.13.0
f7f005f
Compare
Choose a tag to compare

Significant Changes

Default Airflow image is updated to 2.8.2 (#37704)

The default Airflow image that is used with the Chart is now 2.8.2, previously it was 2.8.1.

New Features

  • Support labels specific to the database migration objects and pods (#37490)

Improvements

  • Flower K8s Probe config (#37528)

Bug Fixes

  • Remove duplicate ports key in webserver service (#37356)
  • Add AIRFLOW_HOME env var to log groomer sidecar (#37588)
  • Skip . path when preparing reproducible packages (#37402)

Misc

  • Default airflow version to 2.8.2 (#37704)

Apache Airflow 2.8.2

26 Feb 09:14
2.8.2
923e910
Compare
Choose a tag to compare

Significant Changes

The allowed_deserialization_classes flag now follows a glob pattern (#36147).

For example if one wants to add the class airflow.tests.custom_class to the
allowed_deserialization_classes list, it can be done by writing the full class
name (airflow.tests.custom_class) or a pattern such as the ones used in glob
search (e.g., airflow.*, airflow.tests.*).

If you currently use a custom regexp path make sure to rewrite it as a glob pattern.

Alternatively, if you still wish to match it as a regexp pattern, add it under the new
list allowed_deserialization_classes_regexp instead.

The audit_logs permissions have been updated for heightened security (#37501).

This was done under the policy that we do not want users like Viewer, Ops,
and other users apart from Admin to have access to audit_logs. The intention behind
this change is to restrict users with less permissions from viewing user details
like First Name, Email etc. from the audit_logs when they are not permitted to.

The impact of this change is that the existing users with non admin rights won't be able
to view or access the audit_logs, both from the Browse tab or from the DAG run.

AirflowTimeoutError is no longer except by default through Exception (#35653).

The AirflowTimeoutError is now inheriting BaseException instead of
AirflowException->Exception.
See https://docs.python.org/3/library/exceptions.html#exception-hierarchy

This prevents code catching Exception from accidentally
catching AirflowTimeoutError and continuing to run.
AirflowTimeoutError is an explicit intent to cancel the task, and should not
be caught in attempts to handle the error and return some default value.

Catching AirflowTimeoutError is still possible by explicitly excepting
AirflowTimeoutError or BaseException.
This is discouraged, as it may allow the code to continue running even after
such cancellation requests.
Code that previously depended on performing strict cleanup in every situation
after catching Exception is advised to use finally blocks or
context managers. To perform only the cleanup and then automatically
re-raise the exception.
See similar considerations about catching KeyboardInterrupt in
https://docs.python.org/3/library/exceptions.html#KeyboardInterrupt

Bug Fixes

  • Sort dag processing stats by last_runtime (#37302)
  • Allow pre-population of trigger form values via URL parameters (#37497)
  • Base date for fetching dag grid view must include selected run_id (#34887)
  • Check permissions for ImportError (#37468)
  • Move IMPORT_ERROR from DAG related permissions to view related permissions (#37292)
  • Change AirflowTaskTimeout to inherit BaseException (#35653)
  • Revert "Fix future DagRun rarely triggered by race conditions when max_active_runs reached its upper limit. (#31414)" (#37596)
  • Change margin to padding so first task can be selected (#37527)
  • Fix Airflow serialization for namedtuple (#37168)
  • Fix bug with clicking url-unsafe tags (#37395)
  • Set deterministic and new getter for Treeview function (#37162)
  • Fix permissions of parent folders for log file handler (#37310)
  • Fix permission check on DAGs when access_entity is specified (#37290)
  • Fix the value of dateTimeAttrFormat constant (#37285)
  • Resolve handler close race condition at triggerer shutdown (#37206)
  • Fixing status icon alignment for various views (#36804)
  • Remove superfluous @Sentry.enrich_errors (#37002)
  • Use execution_date= param as a backup to base date for grid view (#37018)
  • Handle SystemExit raised in the task. (#36986)
  • Revoking audit_log permission from all users except admin (#37501)
  • Fix broken regex for allowed_deserialization_classes (#36147)
  • Fix the bug that affected the DAG end date. (#36144)
  • Adjust node width based on task name length (#37254)
  • fix: PythonVirtualenvOperator crashes if any python_callable function is defined in the same source as DAG (#37165)
  • Fix collapsed grid width, line up selected bar with gantt (#37205)
  • Adjust graph node layout (#37207)
  • Revert the sequence of initializing configuration defaults (#37155)
  • Displaying "actual" try number in TaskInstance view (#34635)
  • Bugfix Triggering DAG with parameters is mandatory when show_trigger_form_if_no_params is enabled (#37063)
  • Secret masker ignores passwords with special chars (#36692)
  • Fix DagRuns with UPSTREAM_FAILED tasks get stuck in the backfill. (#36954)
  • Disable dryrun auto-fetch (#36941)
  • Fix copy button on a DAG run's config (#36855)
  • Fix bug introduced by replacing spaces by + in run_id (#36877)
  • Fix webserver always redirecting to home page if user was not logged in (#36833)
  • REST API set description on POST to /variables endpoint (#36820)
  • Sanitize the conn_id to disallow potential script execution (#32867)
  • Fix task id copy button copying wrong id (#34904)
  • Fix security manager inheritance in fab provider (#36538)
  • Avoid pendulum.from_timestamp usage (#37160)

Miscellaneous

  • Install latest docker CLI instead of specific one (#37651)
  • Bump undici from 5.26.3 to 5.28.3 in /airflow/www (#37493)
  • Add Python 3.12 exclusions in providers/pyproject.toml (#37404)
  • Remove markdown from core dependencies (#37396)
  • Remove unused pageSize method. (#37319)
  • Add more-itertools as dependency of common-sql (#37359)
  • Replace other Python 3.11 and 3.12 deprecations (#37478)
  • Include airflow_pre_installed_providers.txt into sdist distribution (#37388)
  • Turn Pydantic into an optional dependency (#37320)
  • Limit universal-pathlib to < 0.2.0 (#37311)
  • Allow running airflow against sqlite in-memory DB for tests (#37144)
  • Add description to queue_when (#36997)
  • Updated config.yml for environment variable sql_alchemy_connect_args (#36526)
  • Bump min version of Alembic to 1.13.1 (#36928)
  • Limit flask-session to <0.6 (#36895)

Doc Only Changes

  • Fix upgrade docs to reflect true CLI flags available (#37231)
  • Fix a bug in fundamentals doc (#37440)
  • Add redirect for deprecated page (#37384)
  • Fix the otel config descriptions (#37229)
  • Update Objectstore tutorial with prereqs section (#36983)
  • Add more precise description on avoiding generic package/module names (#36927)
  • Add airflow version substitution into Docker Compose Howto (#37177)
  • Add clarification about DAG author capabilities to security model (#37141)
  • Move docs for cron basics to Authoring and Scheduling section (#37049)
  • Link to release notes in the upgrade docs (#36923)
  • Prevent templated field logic checks in __init__ of operators automatically (#33786)

Apache Airflow Helm Chart 1.12.0

12 Feb 04:30
helm-chart/1.12.0
8c83e91
Compare
Choose a tag to compare

Significant Changes

The helm chart is now using a newer version of bitnami/postgresql dependency (#34817)

The version of bitnami/postgresql subchart upgraded from 12.10.0 to 13.2.24.
The version of PostgreSQL binaries upgraded from 11 to 16.1.0.

The change requires existing bitnami/postgresql subchart users to perform manual major version upgrade using pg_dumpall or pg_upgrade.

As a reminder, it is recommended to set up an external database <https://airflow.apache.org/docs/helm-chart/stable/production-guide.html#database>_ in production.

Default Airflow image is updated to 2.8.1 (#36907)

The default Airflow image that is used with the Chart is now 2.8.1, previously it was 2.7.1.

Default PgBouncer and PgBouncer Exporter images have been updated (#36898)

The PgBouncer and PgBouncer Exporter images are based on newer software/os.

  • pgbouncer: 1.21.0 based on alpine 3.14 (airflow-pgbouncer-2024.01.19-1.21.0)
  • pgbouncer-exporter: 0.16.0 based on alpine 3.19 (apache/airflow:airflow-pgbouncer-exporter-2024.01.19-0.16.0)

Default StatsD image is updated to v0.26.0 (#37187)

The default StatsD image that is used with the Chart is now v0.26.0, previously it was v0.22.8.

Default Redis image is updated to 7-bookworm (#37187)

The default Redis image that is used with the Chart is now 7-bookworm, previously it was 7-bullseye.

New Features

  • Enable native HPA for Airflow Workers (#36174)
  • Add init container + sidecar support for Airflow Kerberos (#35548)
  • Support MySQL backend as KEDA trigger (#36167)

Improvements

  • Improve PriorityClass to improve debuggability (#36365)
  • Add securityContexts in dag processors log groomer sidecar (#34499)
  • Add support for securityContexts in dag processors wait-for-migrations container (#35593)
  • Add templating for PVC storageClassName (#35581)
  • Add volumeClaimTemplate for worker (#34986)
  • Add support for priorityClassName on Redis pods (#34879)
  • Configurable mount path for DAGs volume (#35083)
  • Add support for custom emptyDir config (#34837)
  • Added ability to enable/disable scheduler and webserver (#36991)

Bug Fixes

  • Fix StatsD host in Airflow config (#35679)
  • Set AIRFLOW_HOME env var with airflowHome value (#34839)
  • Safer worker pod annotations (#35309)
  • Set worker safeToEvict properly (#35130)
  • Fix Redis broker URL with useStandardNaming (#34825)
  • Fix metadata DB & port in KEDA connection when usePgbouncer is false (#34741)
  • Fix PgBouncer connection with useStandardNaming (#34787)

Doc only changes

  • Add docs about extending the Airflow Helm chart (#36331)
  • Add comment for Elasticsearch connection scheme (#35588)
  • Add notes about Virtualenvs preventing the need for custom images (#35306)

Misc

  • Default Airflow version to 2.8.1 (#36907)
  • Support git-sync v4 (#34731)
  • Upgrade bitnami/postgresql subchart to 13.2.24 (#36156)
  • Change git sync container indent to 4 (#35824)
  • Remove K8S 1.24 support (#35214)
  • Rebuild pgbouncer and pgbouncer-exporter images with newer versions (#36898)
  • Update statsd and redis chart images (#37187)

Apache Airflow 2.8.1

19 Jan 13:09
2.8.1
c0ffa9c
Compare
Choose a tag to compare

Significant Changes

Target version for core dependency pendulum package set to 3 (#36281).

Support for pendulum 2.1.2 will be saved for a while, presumably until the next feature version of Airflow.
It is advised to upgrade user code to use pendulum 3 as soon as possible.

Airflow packaging specification follows modern Python packaging standards (#36537).

We standardized Airflow dependency configuration to follow latest development in Python packaging by
using pyproject.toml. Airflow is now compliant with those accepted PEPs:

  • PEP-440 Version Identification and Dependency Specification <https://www.python.org/dev/peps/pep-0440/>__
  • PEP-517 A build-system independent format for source trees <https://www.python.org/dev/peps/pep-0517/>__
  • PEP-518 Specifying Minimum Build System Requirements for Python Projects <https://www.python.org/dev/peps/pep-0518/>__
  • PEP-561 Distributing and Packaging Type Information <https://www.python.org/dev/peps/pep-0561/>__
  • PEP-621 Storing project metadata in pyproject.toml <https://www.python.org/dev/peps/pep-0621/>__
  • PEP-660 Editable installs for pyproject.toml based builds (wheel based) <https://www.python.org/dev/peps/pep-0660/>__
  • PEP-685 Comparison of extra names for optional distribution dependencies <https://www.python.org/dev/peps/pep-0685/>__

Also we implement multiple license files support coming from Draft, not yet accepted (but supported by hatchling) PEP:

  • PEP 639 Improving License Clarity with Better Package Metadata <https://peps.python.org/pep-0639/>__

This has almost no noticeable impact on users if they are using modern Python packaging and development tools, generally
speaking Airflow should behave as it did before when installing it from PyPI and it should be much easier to install
it for development purposes using pip install -e ".[devel]".

The differences from the user side are:

  • Airflow extras now get extras normalized to - (following PEP-685) instead of _ and .
    (as it was before in some extras). When you install airflow with such extras (for example dbt.core or
    all_dbs) you should use - instead of _ and ..

In most modern tools this will work in backwards-compatible way, but in some old version of those tools you might need to
replace _ and . with -. You can also get warnings that the extra you are installing does not exist - but usually
this warning is harmless and the extra is installed anyway. It is, however, recommended to change to use - in extras in your dependency
specifications for all Airflow extras.

  • Released airflow package does not contain devel, devel-*, doc and doc-gen extras.
    Those extras are only available when you install Airflow from sources in --editable mode. This is
    because those extras are only used for development and documentation building purposes and are not needed
    when you install Airflow for production use. Those dependencies had unspecified and varying behaviour for
    released packages anyway and you were not supposed to use them in released packages.

  • The all and all-* extras were not always working correctly when installing Airflow using constraints
    because they were also considered as development-only dependencies. With this change, those dependencies are
    now properly handling constraints and they will install properly with constraints, pulling the right set
    of providers and dependencies when constraints are used.

Graphviz dependency is now an optional one, not required one (#36647).

The graphviz dependency has been problematic as Airflow required dependency - especially for
ARM-based installations. Graphviz packages require binary graphviz libraries - which is already a
limitation, but they also require to install graphviz Python bindings to be build and installed.
This does not work for older Linux installation but - more importantly - when you try to install
Graphviz libraries for Python 3.8, 3.9 for ARM M1 MacBooks, the packages fail to install because
Python bindings compilation for M1 can only work for Python 3.10+.

This is not a breaking change technically - the CLIs to render the DAGs is still there and IF you
already have graphviz installed, it will continue working as it did before. The only problem when it
does not work is where you do not have graphviz installed it will raise an error and inform that you need it.

Graphviz will remain to be installed for most users:

  • the Airflow Image will still contain graphviz library, because
    it is added there as extra
  • when previous version of Airflow has been installed already, then
    graphviz library is already installed there and Airflow will
    continue working as it did

The only change will be a new installation of new version of Airflow from the scratch, where graphviz will
need to be specified as extra or installed separately in order to enable DAG rendering option.

Bug Fixes

  • Fix airflow-scheduler exiting with code 0 on exceptions (#36800)
  • Fix Callback exception when a removed task is the last one in the taskinstance list (#36693)
  • Allow anonymous user edit/show resource when set AUTH_ROLE_PUBLIC=admin (#36750)
  • Better error message when sqlite URL uses relative path (#36774)
  • Explicit string cast required to force integer-type run_ids to be passed as strings instead of integers (#36756)
  • Add log lookup exception for empty op subtypes (#35536)
  • Remove unused index on task instance (#36737)
  • Fix check on subclass for typing.Union in _infer_multiple_outputs for Python 3.10+ (#36728)
  • Make sure multiple_outputs is inferred correctly even when using TypedDict (#36652)
  • Add back FAB constant in legacy security manager (#36719)
  • Fix AttributeError when using Dagrun.update_state (#36712)
  • Do not let EventsTimetable schedule past events if catchup=False (#36134)
  • Support encryption for triggers parameters (#36492)
  • Fix the type hint for tis_query in _process_executor_events (#36655)
  • Redirect to index when user does not have permission to access a page (#36623)
  • Avoid using dict as default value in call_regular_interval (#36608)
  • Remove option to set a task instance to running state in UI (#36518)
  • Fix details tab not showing when using dynamic task mapping (#36522)
  • Raise error when DagRun fails while running dag test (#36517)
  • Refactor _manage_executor_state by refreshing TIs in batch (#36502)
  • Add flask config: MAX_CONTENT_LENGTH (#36401)
  • Fix get_leaves calculation for teardown in nested group (#36456)
  • Stop serializing timezone-naive datetime to timezone-aware datetime with UTC tz (#36379)
  • Make kubernetes decorator type annotation consistent with operator (#36405)
  • Fix Webserver returning 500 for POST requests to api/dag/*/dagrun from anonymous user (#36275)
  • Fix the required access for get_variable endpoint (#36396)
  • Fix datetime reference in DAG.is_fixed_time_schedule (#36370)
  • Fix AirflowSkipException message raised by BashOperator (#36354)
  • Allow PythonVirtualenvOperator.skip_on_exit_code to be zero (#36361)
  • Increase width of execution_date input in trigger.html (#36278)
  • Fix logging for pausing DAG (#36182)
  • Stop deserializing pickle when enable_xcom_pickling is False (#36255)
  • Check DAG read permission before accessing DAG code (#36257)
  • Enable mark task as failed/success always (#36254)
  • Create latest log dir symlink as relative link (#36019)
  • Fix Python-based decorators templating (#36103)

Miscellaneous

  • Rename concurrency label to max active tasks (#36691)
  • Restore function scoped httpx import in file_task_handler for performance (#36753)
  • Add support of Pendulum 3 (#36281)
  • Standardize airflow build process and switch to Hatchling build backend (#36537)
  • Get rid of pyarrow-hotfix for CVE-2023-47248 (#36697)
  • Make graphviz dependency optional (#36647)
  • Announce MSSQL support end in Airflow 2.9.0, add migration script hints (#36509)
  • Set min pandas dependency to 1.2.5 for all providers and airflow (#36698)
  • Bump follow-redirects from 1.15.3 to 1.15.4 in /airflow/www (#36700)
  • Provide the logger_name param to base hook in order to override the logger name (#36674)
  • Fix run type icon alignment with run type text (#36616)
  • Follow BaseHook connection fields method signature in FSHook (#36444)
  • Remove redundant docker decorator type annotations (#36406)
  • Straighten typing in workday timetable (#36296)
  • Use batch_is_authorized_dag to check if user has permission to read DAGs (#36279)
  • Replace deprecated get_accessible_dag_ids and use get_readable_dags in get_dag_warnings (#36256)

Doc Only Changes

  • Metrics tagging documentation (#36627)
  • In docs use logical_date instead of deprecated execution_date (#36654)
  • Add section about live-upgrading Airflow (#36637)
  • Replace numpy example with practical exercise demonstrating top-level code (#35097)
  • Improve and add more complete description in the architecture diagrams (#36513)
  • Improve the error message displayed when there is a webserver error (#36570)
  • Update dags.rst with information on DAG pausing (#36540)
  • Update installation prerequisites after upgrading to Debian Bookworm (#36521)
  • Add description on the ways how users should approach DB monitoring (#36483)
  • Add branching based on mapped task group example to dynamic-task-mapping.rst (#36480)
  • Add further details to replacement documentation (#36485)
  • Use cards when describing priority weighting methods (#36411)
  • Update metrics.rst for param dagrun.schedule_delay (#36404)
  • Update admonitions in Python operator doc to reflect sentiment (#36340)
  • Improve audit_logs.rst (#36213)
  • Remove Redshift mention from the list of managed Postgres backends (#36217)

Apache Airflow 2.8.0

18 Dec 19:16
2.8.0
db2b75c
Compare
Choose a tag to compare

Significant Changes

  • Raw HTML code in DAG docs and DAG params descriptions is disabled by default

    To ensure that no malicious javascript can be injected with DAG descriptions or trigger UI forms by DAG authors
    a new parameter webserver.allow_raw_html_descriptions was added with default value of False.
    If you trust your DAG authors code and want to allow using raw HTML in DAG descriptions and params, you can restore the previous
    behavior by setting the configuration value to True.

    To ensure Airflow is secure by default, the raw HTML support in trigger UI has been super-seeded by markdown support via
    the description_md attribute. If you have been using description_html please migrate to description_md.
    The custom_html_form is now deprecated. (#35460)

New Features

  • AIP-58: Add Airflow ObjectStore (AFS) (AIP-58)
  • Add XCom tab to Grid (#35719)
  • Add "literal" wrapper to disable field templating (#35017)
  • Add task context logging feature to allow forwarding messages to task logs (#32646, #32693, #35857)
  • Add Listener hooks for Datasets (#34418, #36247)
  • Allow override of navbar text color (#35505)
  • Add lightweight serialization for deltalake tables (#35462)
  • Add support for serialization of iceberg tables (#35456)
  • prev_end_date_success method access (#34528)
  • Add task parameter to set custom logger name (#34964)
  • Add pyspark decorator (#35247)
  • Add trigger as a valid option for the db clean command (#34908)
  • Add decorators for external and venv python branching operators (#35043)
  • Allow PythonVenvOperator using other index url (#33017)
  • Add Python Virtualenv Operator Caching (#33355)
  • Introduce a generic export for containerized executor logging (#34903)
  • Add ability to clear downstream tis in List Task Instances view (#34529)
  • Attribute clear_number to track DAG run being cleared (#34126)
  • Add BranchPythonVirtualenvOperator (#33356)
  • Allow PythonVenvOperator using other index url (#33017)
  • Add CLI notification commands to providers (#33116)
  • Use dropdown instead of buttons when there are more than 10 retries in log tab (#36025)

Improvements

  • Add multiselect to run state in grid view (#35403)
  • Fix warning message in Connection.get_hook in case of ImportError (#36005)
  • Add processor_subdir to import_error table to handle multiple dag processors (#35956)
  • Consolidate the call of change_state to fail or success in the core executors (#35901)
  • Relax mandatory requirement for start_date when schedule=None (#35356)
  • Use ExitStack to manage mutation of secrets_backend_list in dag.test (#34620)
  • improved visibility of tasks in ActionModal for taskinstance (#35810)
  • Create directories based on AIRFLOW_CONFIG path (#35818)
  • Implements JSON-string connection representation generator (#35723)
  • Move BaseOperatorLink into the separate module (#35032)
  • Set mark_end_on_close after set_context (#35761)
  • Move external logs links to top of react logs page (#35668)
  • Change terminal mode to cbreak in execute_interactive and handle SIGINT (#35602)
  • Make raw HTML descriptions configurable (#35460)
  • Allow email field to be templated (#35546)
  • Hide logical date and run id in trigger UI form (#35284)
  • Improved instructions for adding dependencies in TaskFlow (#35406)
  • Add optional exit code to list import errors (#35378)
  • Limit query result on DB rather than client in synchronize_log_template function (#35366)
  • Allow description to be passed in when using variables CLI (#34791)
  • Allow optional defaults in required fields with manual triggered dags (#31301)
  • Permitting airflow kerberos to run in different modes (#35146)
  • Refactor commands to unify daemon context handling (#34945)
  • Add extra fields to plugins endpoint (#34913)
  • Add description to pools view (#34862)
  • Move cli's Connection export and Variable export command print logic to a separate function (#34647)
  • Extract and reuse get_kerberos_principle func from get_kerberos_principle (#34936)
  • Change type annotation for BaseOperatorLink.operators (#35003)
  • Optimise and migrate to SA2-compatible syntax for TaskReschedule (#33720)
  • Consolidate the permissions name in SlaMissModelView (#34949)
  • Add debug log saying what's being run to EventScheduler (#34808)
  • Increase log reader stream loop sleep duration to 1 second (#34789)
  • Resolve pydantic deprecation warnings re update_forward_refs (#34657)
  • Unify mapped task group lookup logic (#34637)
  • Allow filtering event logs by attributes (#34417)
  • Make connection login and password TEXT (#32815)
  • Ban import Dataset from airflow package in codebase (#34610)
  • Use airflow.datasets.Dataset in examples and tests (#34605)
  • Enhance task status visibility (#34486)
  • Simplify DAG trigger UI (#34567)
  • Ban import AirflowException from airflow (#34512)
  • Add descriptions for airflow resource config parameters (#34438)
  • Simplify trigger name expression (#34356)
  • Move definition of Pod*Exceptions to pod_generator (#34346)
  • Add deferred tasks to the cluster_activity view Pools Slots (#34275)
  • heartbeat failure log message fix (#34160)
  • Rename variables for dag runs (#34049)
  • Clarify new_state in OpenAPI spec (#34056)
  • Remove version top-level element from docker compose files (#33831)
  • Remove generic trigger cancelled error log (#33874)
  • Use NOT EXISTS subquery instead of tuple_not_in_condition (#33527)
  • Allow context key args to not provide a default (#33430)
  • Order triggers by - TI priority_weight when assign unassigned triggers (#32318)
  • Add metric triggerer_heartbeat (#33320)
  • Allow airflow variables export to print to stdout (#33279)
  • Workaround failing deadlock when running backfill (#32991)
  • add dag_run_ids and task_ids filter for the batch task instance API endpoint (#32705)
  • Configurable health check threshold for triggerer (#33089)
  • Rework provider manager to treat Airflow core hooks like other provider hooks (#33051)
  • Ensure DAG-level references are filled on unmap (#33083)
  • Affix webserver access_denied warning to be configurable (#33022)
  • Add support for arrays of different data types in the Trigger Form UI (#32734)
  • Add a mechanism to warn if executors override existing CLI commands (#33423)

Bug Fixes

  • Account for change in UTC offset when calculating next schedule (#35887)
  • Add read access to pools for viewer role (#35352)
  • Fix gantt chart queued duration when queued_dttm is greater than start_date for deferred tasks (#35984)
  • Avoid crushing container when directory is not found on rm (#36050)
  • Update reset_user_sessions to work from either CLI or web (#36056)
  • Fix UI Grid error when DAG has been removed. (#36028)
  • Change Trigger UI to use HTTP POST in web ui (#36026)
  • Fix airflow db shell needing an extra key press to exit (#35982)
  • Change dag grid overscroll behaviour to auto (#35717)
  • Run triggers inline with dag test (#34642)
  • Add borderWidthRight to grid for Firefox scrollbar (#35346)
  • Fix for infinite recursion due to secrets_masker (#35048)
  • Fix write processor_subdir in serialized_dag table (#35661)
  • Reload configuration for standalone dag file processor (#35725)
  • Long custom operator name overflows in graph view (#35382)
  • Add try_number to extra links query (#35317)
  • Prevent assignment of non JSON serializable values to DagRun.conf dict (#35096)
  • Numeric values in DAG details are incorrectly rendered as timestamps (#35538)
  • Fix Scheduler and triggerer crashes in daemon mode when statsd metrics are enabled (#35181)
  • Infinite UI redirection loop after deactivating an active user (#35486)
  • Bug fix fetch_callback of Partial Subset DAG (#35256)
  • Fix DagRun data interval for DeltaDataIntervalTimetable (#35391)
  • Fix query in get_dag_by_pickle util function (#35339)
  • Fix TriggerDagRunOperator failing to trigger subsequent runs when reset_dag_run=True (#35429)
  • Fix weight_rule property type in mappedoperator (#35257)
  • Bugfix/prevent concurrency with cached venv (#35258)
  • Fix dag serialization (#34042)
  • Fix py/url-redirection by replacing request.referrer by get_redirect() (#34237)
  • Fix updating variables during variable imports (#33932)
  • Use Literal from airflow.typing_compat in Airflow core (#33821)
  • Always use Literal from typing_extensions (#33794)

Miscellaneous

  • Change default MySQL client to MariaDB (#36243)
  • Mark daskexecutor provider as removed (#35965)
  • Bump FAB to 4.3.10 (#35991)
  • Mark daskexecutor provider as removed (#35965)
  • Rename Connection.to_json_dict to Connection.to_dict (#35894)
  • Upgrade to Pydantic v2 (#35551)
  • Bump moto version to >= 4.2.9 (#35687)
  • Use pyarrow-hotfix to mitigate CVE-2023-47248 (#35650)
  • Bump axios from 0.26.0 to 1.6.0 in /airflow/www/ (#35624)
  • Make docker decorator's type annotation consistent with operator (#35568)
  • Add default to navbar_text_color and rm condition in style (#35553)
  • Avoid initiating session twice in dag_next_execution (#35539)
  • Work around typing issue in examples and providers (#35494)
  • Enable TCH004 and TCH005 rules (#35475)
  • Humanize log output about retrieved DAG(s) (#35338)
  • Switch from Black to Ruff formatter (#35287)
  • Upgrade to Flask Application Builder 4.3.9 (#35085)
  • D401 Support (#34932, #34933)
  • Use requires_access to check read permission on dag instead of checking it explicitly (#34940)
  • Deprecate lazy import AirflowException from airflow (#34541)
  • View util refactoring on mapped stuff use cases (#34638)
  • Bump postcss from 8.4.25 to 8.4.31 in /airflow/www (#34770)
  • Refactor Sqlalchemy queries to 2.0 s...
Read more

Apache Airflow 2.7.3

06 Nov 07:14
2.7.3
f124353
Compare
Choose a tag to compare

Significant Changes

No significant changes.

Bug Fixes

  • Fix pre-mature evaluation of tasks in mapped task group (#34337)
  • Add TriggerRule missing value in rest API (#35194)
  • Fix Scheduler crash looping when dagrun creation fails (#35135)
  • Fix test connection with codemirror and extra (#35122)
  • Fix usage of cron-descriptor since BC in v1.3.0 (#34836)
  • Fix get_plugin_info for class based listeners. (#35022)
  • Some improvements/fixes for dag_run and task_instance endpoints (#34942)
  • Fix the dags count filter in webserver home page (#34944)
  • Return only the TIs of the readable dags when ~ is provided as a dag_id (#34939)
  • Fix triggerer thread crash in daemon mode (#34931)
  • Fix wrong plugin schema (#34858)
  • Use DAG timezone in TimeSensorAsync (#33406)
  • Mark tasks with all_skipped trigger rule as skipped if any task is in upstream_failed state (#34392)
  • Add read only validation to read only fields (#33413)

Misc/Internal

  • Improve testing harness to separate DB and non-DB tests (#35160, #35333)
  • Add pytest db_test markers to our tests (#35264)
  • Add pip caching for faster build (#35026)
  • Upper bound pendulum requirement to <3.0 (#35336)
  • Limit sentry_sdk to 1.33.0 (#35298)
  • Fix subtle bug in mocking processor_agent in our tests (#35221)
  • Bump @babel/traverse from 7.16.0 to 7.23.2 in /airflow/www (#34988)
  • Bump undici from 5.19.1 to 5.26.3 in /airflow/www (#34971)
  • Remove unused set from SchedulerJobRunner (#34810)
  • Remove warning about max_tis per query > parallelism (#34742)
  • Improve modules import in Airflow core by moving some of them into a type-checking block (#33755)
  • Fix tests to respond to Python 3.12 handling of utcnow in sentry-sdk (#34946)
  • Add connexion<3.0 upper bound (#35218)
  • Limit Airflow to < 3.12 (#35123)
  • update moto version (#34938)
  • Limit WTForms to below 3.1.0 (#34943)

Doc Only Changes

  • Fix variables substitution in Airflow Documentation (#34462)
  • Added example for defaults in conn.extras (#35165)
  • Update datasets.rst issue with running example code (#35035)
  • Remove mysql-connector-python from recommended MySQL driver (#34287)
  • Fix syntax error in task dependency set_downstream example (#35075)
  • Update documentation to enable test connection (#34905)
  • Update docs errors.rst - Mention sentry "transport" configuration option (#34912)
  • Update dags.rst to put SubDag deprecation note right after the SubDag section heading (#34925)
  • Add info on getting variables and config in custom secrets backend (#34834)
  • Document BaseExecutor interface in more detail to help users in writing custom executors (#34324)
  • Fix broken link to airflow_local_settings.py template (#34826)
  • Fixes python_callable function assignment context kwargs example in params.rst (#34759)
  • Add missing multiple_outputs=True param in the TaskFlow example (#34812)
  • Remove extraneous '>' in provider section name (#34813)
  • Fix imports in extra link documentation (#34547)

Apache Airflow 2.7.2

12 Oct 10:58
2.7.2
c8b25cb
Compare
Choose a tag to compare

Significant Changes

No significant changes

Bug Fixes

  • Check if the lower of provided values are sensitives in config endpoint (#34712)
  • Add support for ZoneInfo and generic UTC to fix datetime serialization (#34683, #34804)
  • Fix AttributeError: 'Select' object has no attribute 'count' during the airflow db migrate command (#34348)
  • Make dry run optional for patch task instance (#34568)
  • Fix non deterministic datetime deserialization (#34492)
  • Use iterative loop to look for mapped parent (#34622)
  • Fix is_parent_mapped value by checking if any of the parent taskgroup is mapped (#34587)
  • Avoid top-level airflow import to avoid circular dependency (#34586)
  • Add more exemptions to lengthy metric list (#34531)
  • Fix dag warning endpoint permissions (#34355)
  • Fix task instance access issue in the batch endpoint (#34315)
  • Correcting wrong time showing in grid view (#34179)
  • Fix www cluster_activity view not loading due to standaloneDagProcessor templating (#34274)
  • Set loglevel=DEBUG in 'Not syncing DAG-level permissions' (#34268)
  • Make param validation consistent for DAG validation and triggering (#34248)
  • Ensure details panel is shown when any tab is selected (#34136)
  • Fix issues related to access_control={} (#34114)
  • Fix not found ab_user table in the CLI session (#34120)
  • Fix FAB-related logging format interpolation (#34139)
  • Fix query bug in next_run_datasets_summary endpoint (#34143)
  • Fix for TaskGroup toggles for duplicated labels (#34072)
  • Fix the required permissions to clear a TI from the UI (#34123)
  • Reuse _run_task_session in mapped render_template_fields (#33309)
  • Fix scheduler logic to plan new dag runs by ignoring manual runs (#34027)
  • Add missing audit logs for Flask actions add, edit and delete (#34090)
  • Hide Irrelevant Dag Processor from Cluster Activity Page (#33611)
  • Remove infinite animation for pinwheel, spin for 1.5s (#34020)
  • Restore rendering of provider configuration with version_added (#34011)

Doc Only Changes

  • Clarify audit log permissions (#34815)
  • Add explanation for Audit log users (#34814)
  • Import AUTH_REMOTE_USER from FAB in WSGI middleware example (#34721)
  • Add information about drop support MsSQL as DB Backend in the future (#34375)
  • Document how to use the system's timezone database (#34667)
  • Clarify what landing time means in doc (#34608)
  • Fix screenshot in dynamic task mapping docs (#34566)
  • Fix class reference in Public Interface documentation (#34454)
  • Clarify var.value.get and var.json.get usage (#34411)
  • Schedule default value description (#34291)
  • Docs for triggered_dataset_event (#34410)
  • Add DagRun events (#34328)
  • Provide tabular overview about trigger form param types (#34285)
  • Add link to Amazon Provider Configuration in Core documentation (#34305)
  • Add "security infrastructure" paragraph to security model (#34301)
  • Change links to SQLAlchemy 1.4 (#34288)
  • Add SBOM entry in security documentation (#34261)
  • Added more example code for XCom push and pull (#34016)
  • Add state utils to Public Airflow Interface (#34059)
  • Replace markdown style link with rst style link (#33990)
  • Fix broken link to the "UPDATING.md" file (#33583)

Misc/Internal

  • Update min-sqlalchemy version to account for latest features used (#34293)
  • Fix SesssionExemptMixin spelling (#34696)
  • Restrict astroid version < 3 (#34658)
  • Fail dag test if defer without triggerer (#34619)
  • Fix connections exported output (#34640)
  • Don't run isort when creating new alembic migrations (#34636)
  • Deprecate numeric type python version in PythonVirtualEnvOperator (#34359)
  • Refactor os.path.splitext to Path.* (#34352, #33669)
  • Replace = by is for type comparison (#33983)
  • Refactor integer division (#34180)
  • Refactor: Simplify comparisons (#34181)
  • Refactor: Simplify string generation (#34118)
  • Replace unnecessary dict comprehension with dict() in core (#33858)
  • Change "not all" to "any" for ease of readability (#34259)
  • Replace assert by if...raise in code (#34250, #34249)
  • Move default timezone to except block (#34245)
  • Combine similar if logic in core (#33988)
  • Refactor: Consolidate import and usage of random (#34108)
  • Consolidate importing of os.path.* (#34060)
  • Replace sequence concatenation by unpacking in Airflow core (#33934)
  • Refactor unneeded 'continue' jumps around the repo (#33849, #33845, #33846, #33848, #33839, #33844, #33836, #33842)
  • Remove [project] section from pyproject.toml (#34014)
  • Move the try outside the loop when this is possible in Airflow core (#33975)
  • Replace loop by any when looking for a positive value in core (#33985)
  • Do not create lists we don't need (#33519)
  • Remove useless string join from core (#33969)
  • Add TCH001 and TCH002 rules to pre-commit to detect and move type checking modules (#33865)
  • Add cancel_trigger_ids to to_cancel dequeue in batch (#33944)
  • Avoid creating unnecessary list when parsing stats datadog tags (#33943)
  • Replace dict.items by dict.values when key is not used in core (#33940)
  • Replace lambdas with comprehensions (#33745)
  • Improve modules import in Airflow core by some of them into a type-checking block (#33755)
  • Refactor: remove unused state - SHUTDOWN (#33746, #34063, #33893)
  • Refactor: Use in-place .sort() (#33743)
  • Use literal dict instead of calling dict() in Airflow core (#33762)
  • remove unnecessary map and rewrite it using list in Airflow core (#33764)
  • Replace lambda by a def method in Airflow core (#33758)
  • Replace type func by isinstance in fab_security manager (#33760)
  • Replace single quotes by double quotes in all Airflow modules (#33766)
  • Merge multiple isinstance calls for the same object in a single call (#33767)
  • Use a single statement with multiple contexts instead of nested statements in core (#33769)
  • Refactor: Use f-strings (#33734, #33455)
  • Refactor: Use random.choices (#33631)
  • Use str.splitlines() to split lines (#33592)
  • Refactor: Remove useless str() calls (#33629)
  • Refactor: Improve detection of duplicates and list sorting (#33675)
  • Simplify conditions on len() (#33454)

Apache Airflow Helm Chart 1.11.0

02 Oct 23:29
helm-chart/1.11.0
Compare
Choose a tag to compare

Significant Changes

Support naming customization on helm chart resources, some resources may be renamed during upgrade (#31066)

This is a new opt-in switch useStandardNaming, for backwards compatibility, to leverage the standard naming convention, which allows full use of fullnameOverride and nameOverride in all resources.

The following resources will be renamed using default of useStandardNaming=false when upgrading to 1.11.0 or a higher version.

  • ConfigMap {release}-airflow-config to {release}-config
  • Secret {release}-airflow-metadata to {release}-metadata
  • Secret {release}-airflow-result-backend to {release}-result-backend
  • Ingress {release}-airflow-ingress to {release}-ingress

For existing installations, all your resources will be recreated with a new name and Helm will delete the previous resources.

This won't delete existing PVCs for logs used by StatefulSet/Deployments, but it will recreate them with brand new PVCs.
If you do want to preserve logs history you'll need to manually copy the data of these volumes into the new volumes after
deployment. Depending on what storage backend/class you're using this procedure may vary. If you don't mind starting
with fresh logs/redis volumes, you can just delete the old PVCs that will be names, for example:

kubectl delete pvc -n airflow logs-gta-triggerer-0
kubectl delete pvc -n airflow logs-gta-worker-0
kubectl delete pvc -n airflow redis-db-gta-redis-0

If you do not change useStandardNaming or fullnameOverride after upgrade, you can proceed as usual and no unexpected behaviours will be presented.

bitnami/postgresql subchart updated to 12.10.0 (#33747)

The PostgreSQL subchart that is used with the Chart is now 12.10.0, previously it was 12.1.9.

Default git-sync image is updated to 3.6.9 (#33748)

The default git-sync image that is used with the Chart is now 3.6.9, previously it was 3.6.3.

Default Airflow image is updated to 2.7.1 (#34186)

The default Airflow image that is used with the Chart is now 2.7.1, previously it was 2.6.2.

New Features

  • Add support for scheduler name to PODs templates (#33843)
  • Support KEDA scaling for triggerer (#32302)
  • Add support for container lifecycle hooks (#32349, #34677)
  • Support naming customization on helm chart resources (#31066)
  • Adding startupProbe to scheduler and webserver (#33107)
  • Allow disabling token mounts using automountServiceAccountToken (#32808)
  • Add support for defining custom priority classes (#31615)
  • Add support for runtimeClassName (#31868)
  • Add support for custom query in workers KEDA trigger (#32308)

Improvements

  • Add containerSecurityContext for cleanup job (#34351)
  • Add existing secret support for PGBouncer metrics exporter (#32724)
  • Allow templating in webserver ingress hostnames (#33142)
  • Allow templating in flower ingress hostnames (#33363)
  • Add configmap annotations to StatsD and webserver (#33340)
  • Add pod security context to PgBouncer (#32662)
  • Add an option to use a direct DB connection in KEDA when PgBouncer is enabled (#32608)
  • Allow templating in cleanup.schedule (#32570)
  • Template dag processor waitformigration containers extraVolumeMounts (#32100)
  • Ability to inject extra containers into PgBouncer (#33686)
  • Allowing ability to add custom env into PgBouncer container (#33438)
  • Add support for env variables in the StatsD container (#33175)

Bug Fixes

  • Add airflow db migrate command to database migration job (#34178)
  • Pass workers.terminationGracePeriodSeconds into KubeExecutor pod template (#33514)
  • CeleryExecutor namespace depends on Airflow version (#32753)
  • Fix dag processor not including webserver config volume (#32644)
  • Dag processor liveness probe include --local and --job-type args (#32426)
  • Revising flower_url_prefix considering default value (#33134)

Doc only changes

  • Add more explicit "embedded postgres" exclusion for production (#33034)
  • Update git-sync description (#32181)

Misc

  • Default Airflow version to 2.7.1 (#34186)
  • Update PostgreSQL subchart to 12.10.0 (#33747)
  • Update git-sync to 3.6.9 (#33748)
  • Remove unnecessary loops to load env from helm values (#33506)
  • Replace common.tplvalues.render with tpl in ingress template files (#33384)
  • Remove K8S 1.23 support (#32899)
  • Fix chart named template comments (#32681)
  • Remove outdated comment from chart values in the workers KEDA conf section (#32300)
  • Remove unnecessary or function in template files (#34415)