07 Mar 01:38

flyte-bot

eba986b

Flyte v1.4.0 milestone release

Flyte 1.4 release

The main features of the 1.4 release are:

Suport for PodTemplate at the task-level
Revamped auth system in flytekit

As python 3.7 reached EOL support in December of 2022, we dropped support for that version on this release.

Platform

Support for `PodTemplate` at the task-level.

Users can now define PodTemplate as part of the definition of a task. For example, note how we have access a full V1PodSpec as part of the task definition:

@task(
    pod_template=PodTemplate(
        primary_container_name="primary",
        labels={"lKeyA": "lValA", "lKeyB": "lValB"},
        annotations={"aKeyA": "aValA", "aKeyB": "aValB"},
        pod_spec=V1PodSpec(
            containers=[
                V1Container(
                    name="primary",
                    image="repo/placeholderImage:0.0.0",
                    command="echo",
                    args=["wow"],
                    resources=V1ResourceRequirements(limits={"cpu": "999", "gpu": "999"}),
                    env=[V1EnvVar(name="eKeyC", value="eValC"), V1EnvVar(name="eKeyD", value="eValD")],
                ),
            ],
            volumes=[V1Volume(name="volume")],
            tolerations=[
                V1Toleration(
                    key="num-gpus",
                    operator="Equal",
                    value=1,
                    effect="NoSchedule",
                ),
            ],
        )
    )
)
def t1(i: str):
    ...

We are working on more examples in our documentation. Stay tuned!

Flytekit

As promised in https://github.com/flyteorg/flytekit/releases/tag/v1.3.0, we're backporting important changes to the 1.2.x release branch. In the past month we had 2 releases: https://github.com/flyteorg/flytekit/releases/tag/v1.2.8 and https://github.com/flyteorg/flytekit/releases/tag/v1.2.9.

Here's some of the highlights of this release. For a full changelog please visit https://github.com/flyteorg/flytekit/releases/tag/v1.4.0.

Revamped auth system

In flyteorg/flytekit#1458 we introduced a new OAuth2 handling system based on client-side grpc interceptors.

New sandbox features

In this new release flytectl demo brings the following new features:

Support for specifying extra configuration for Flyte
Support for specifying extra cluster resource templates for boostrapping new namespaces
Sandbox state (DB, buckets) is now persistent across restarts and upgrades

Flyteconsole

Assets 6

0 Join discussion

14 Feb 19:38

flyte-bot

v1.4.0-b0

d60c9af

Flyte v1.4.0-b0 milestone release Pre-release

Pre-release

Flyte v1.4.0-b0 Changelog

Pod Templates and changes to the sandbox experience (mainly around configuration reloading).

A full changelog is going to come in the official release.

Assets 6

0 Join discussion

11 Jan 23:57

flyte-bot

v1.3.0

f69fb09

Flyte v1.3.0 milestone release

Flyte v1.3.0

The main features of this 1.3 release are

Databricks support as part of the Spark plugin
New Helm chart that offers a simpler deployment using just one Flyte service
Signaling/gate node support (human in the loop tasks)
User documentation support (backend and flytekit only, limited types)

The latter two are pending some work in Flyte console, they will be piped through fully by the end of Q1. Support for setting and approving gate nodes is supported in FlyteRemote however, though only a limited set of types can be passed in.

Notes

There are a couple things to point out with this release.

Caching on Structured Dataset

Please take a look at the flytekit PR notes for more information but if you haven't bumped Propeller to version v1.1.36 (aka Flyte v1.2) or later, tasks that take as input a dataframe or a structured dataset type, that are cached, will trigger a cache miss. If you've upgraded Propeller, it will not.

Flytekit Remote Types

In the FlyteRemote experience, fetched tasks and workflows will now be based on their respective "spec" classes in the IDL (task/wf) rather than the template. The spec messages are a superset of the template messages so no information is lost. If you have code that was accessing elements of the templates directly however, these will need to be updated.

Usage Overview

Databricks

Please refer to the documentation for setting up Databricks.
Databricks is a subclass of the Spark task configuration so you'll be able to use the new class in place of the more general Spark configuration.

from flytekitplugins.spark import Databricks
@task(
    task_config=Databricks(
        spark_conf={
            "spark.driver.memory": "1000M",
            "spark.executor.memory": "1000M",
            "spark.executor.cores": "1",
            "spark.executor.instances": "2",
            "spark.driver.cores": "1",
        },
        databricks_conf={
            "run_name": "flytekit databricks plugin example",
            "new_cluster": {
                "spark_version": "11.0.x-scala2.12",
                "node_type_id": "r3.xlarge",
                "aws_attributes": {
                    "availability": "ON_DEMAND",
                    "instance_profile_arn": "arn:aws:iam::1237657460:instance-profile/databricks-s3-role",
                },
                "num_workers": 4,
            },
            "timeout_seconds": 3600,
            "max_retries": 1,
        }
    ))

New Deployment Type

A couple releases ago, we introduced a new Flyte executable that combined all the functionality of Flyte's backend into one command. This simplifies the deployment in that only one image needs to run now. This approach is now our recommended way for new comers to the project to install and administer Flyte and there is a new Helm chart also. Documentation has been updated to take this into account. For new installations of Flyte, clusters that do not already have the flyte-core or flyte charts installed, users can

helm install flyte-server flyteorg/flyte-binary --namespace flyte --values your_values.yaml

New local demo environment

Users may have noticed that the environment provided by flytectl demo start has also been updated to use this new style of deployment, and internally now installs this new Helm chart. The demo cluster now also exposes an internal docker registry on port 30000. That is, with the new demo cluster up, you can tag and push to localhost:30000/yourimage:tag123 and the image will be accessible to the internal Docker daemon. The web interface is still at localhost:30080, Postgres has been moved to 30001 and the Minio API (not web server) has been moved to 30002.

Human-in-the-loop Workflows

Users can now insert sleeps, approval, and input requests, in the form of gate nodes. Check out one of our earlier issues for background information.

from flytekit import wait_for_input, approve, sleep

@workflow
def mainwf(a: int):
    x = t1(a=a)
    s1 = wait_for_input("signal-name", timeout=timedelta(hours=1), expected_type=bool)
    s2 = wait_for_input("signal name 2", timeout=timedelta(hours=2), expected_type=int)
    z = t1(a=5)
    zzz = sleep(timedelta(seconds=10))
    y = t2(a=s2)
    q = t2(a=approve(y, "approvalfory", timeout=timedelta(hours=2)))
    x >> s1
    s1 >> z
    z >> zzz
    ...

These also work inside @dynamic tasks. Interacting with signals from flytekit's remote experience looks like

from flytekit.remote.remote import FlyteRemote
from flytekit.configuration import Config
r = FlyteRemote(
    Config.auto(config_file="/Users/ytong/.flyte/dev.yaml"),
   default_project="flytesnacks",
   default_domain="development",
)
r.list_signals("atc526g94gmlg4w65dth")
r.set_signal("signal-name", "execidabc123", True)

Overwritten Cached Values on Execution

Users can now configure workflow execution to overwrite the cache. Each task in the workflow execution, regardless of previous cache status, will execute and write cached values - overwritting previous values if necessary. This allows previously corrupted cache values to be corrected without the tedious process of incrementing the cache_version and re-registering Flyte workflows / tasks.

Support for Dask

Users will be able to spawn Dask ephemeral clusters as part of their workflows, similar to the support for Ray and Spark.

Looking Ahead

In the coming release, we are focusing on...

Out of core plugin: Make backend plugin scalable and easy to author. No need of code generation, using tools that MLEs and Data Scientists are not accustomed to using.
Performance Observability: We have made great progress on exposing both finer-grained runtime metrics and Flytes orchestration metrics. This is important to better understand workflow evaluation performance and mitigate inefficiencies thereof.

Assets 6

0 Join discussion

10 Jan 00:26

flyte-bot

v1.3.0-b9

7b401bc

Flyte v1.3.0-b9 milestone release Pre-release

Pre-release

v1.3.0-b9

CI changes.

Assets 6

0 Join discussion

30 Dec 00:59

flyte-bot

v1.3.0-b8

50036f4

Flyte v1.3.0-b8 milestone release Pre-release

Pre-release

v1.3.0-b8

Empty release for testing CI.

Assets 6

0 Join discussion

29 Dec 22:50

flyte-bot

v1.3.0-b7

3df9fcf

Flyte v1.3.0-b7 milestone release Pre-release

Pre-release

v1.3.0-b7

Empty release for testing CI.

Assets 6

0 Join discussion

29 Dec 18:33

flyte-bot

v1.3.0-b6

9154136

Flyte v1.3.0-b6 milestone release Pre-release

Pre-release

Flyte v1.3.0-b6 Changelog

Pull in Doc hub changes.

Assets 6

0 Join discussion

20 Dec 18:22

flyte-bot

v1.3.0-b5

e240038

Flyte v1.3.0-b5 milestone release Pre-release

Pre-release

Flyte v1.3.0-b5 Changelog

Databricks and dbx plugin changes.

Assets 6

0 Join discussion

15 Dec 18:43

flyte-bot

v1.3.0-b4

28583e2

Flyte v1.3.0-b4 milestone release Pre-release

Pre-release

Flyte v1.3.0-b4 Changelog

Pull in first batch of signaling changes.

Assets 6

0 Join discussion

07 Dec 23:37

flyte-bot

v1.3.0-b3

8ac8aae

Flyte v1.3.0-b3 milestone release Pre-release

Pre-release

Flyte v1.3.0-b3 Changelog

Use checksums to apply cluster resource changes from flyteadmin

Assets 6

0 Join discussion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flyte 1.4 release

Platform

Support for `PodTemplate` at the task-level.

Flytekit

Revamped auth system

New sandbox features

Flyteconsole

Flyte v1.4.0-b0 Changelog

Flyte v1.3.0

Notes

Caching on Structured Dataset

Flytekit Remote Types

Usage Overview

Databricks

New Deployment Type

New local demo environment

Human-in-the-loop Workflows

Overwritten Cached Values on Execution

Support for Dask

Looking Ahead

v1.3.0-b9

v1.3.0-b8

v1.3.0-b7

Flyte v1.3.0-b6 Changelog

Flyte v1.3.0-b5 Changelog

Flyte v1.3.0-b4 Changelog

Flyte v1.3.0-b3 Changelog

Releases: flyteorg/flyte

Flyte v1.4.0 milestone release

Flyte 1.4 release

Platform

Support for PodTemplate at the task-level.

Flytekit

Revamped auth system

New sandbox features

Flyteconsole

Flyte v1.4.0-b0 milestone release

Flyte v1.4.0-b0 Changelog

Flyte v1.3.0 milestone release

Flyte v1.3.0

Notes

Caching on Structured Dataset

Flytekit Remote Types

Usage Overview

Databricks

New Deployment Type

New local demo environment

Human-in-the-loop Workflows

Overwritten Cached Values on Execution

Support for Dask

Looking Ahead

Flyte v1.3.0-b9 milestone release

v1.3.0-b9

Flyte v1.3.0-b8 milestone release

v1.3.0-b8

Flyte v1.3.0-b7 milestone release

v1.3.0-b7

Flyte v1.3.0-b6 milestone release

Flyte v1.3.0-b6 Changelog

Flyte v1.3.0-b5 milestone release

Flyte v1.3.0-b5 Changelog

Flyte v1.3.0-b4 milestone release

Flyte v1.3.0-b4 Changelog

Flyte v1.3.0-b3 milestone release

Flyte v1.3.0-b3 Changelog

Support for `PodTemplate` at the task-level.