title	description	author	ms.author	ms.subservice	ms.topic	ms.custom	ms.date
Mapping data flows	An overview of mapping data flows in Azure Data Factory	kromerm	makromer	data-flows	conceptual	references_regions	09/25/2024

Mapping data flows in Azure Data Factory

[!INCLUDEappliesto-adf-asa-md]

What are mapping data flows?

Mapping data flows are visually designed data transformations in Azure Data Factory. Data flows allow data engineers to develop data transformation logic without writing code. The resulting data flows are executed as activities within Azure Data Factory pipelines that use scaled-out Apache Spark clusters. Data flow activities can be operationalized using existing Azure Data Factory scheduling, control, flow, and monitoring capabilities.

Mapping data flows provide an entirely visual experience with no coding required. Your data flows run on ADF-managed execution clusters for scaled-out data processing. Azure Data Factory handles all the code translation, path optimization, and execution of your data flow jobs.

Getting started

Data flows are created from the factory resources pane like pipelines and datasets. To create a data flow, select the plus sign next to Factory Resources, and then select Data Flow.

This action takes you to the data flow canvas, where you can create your transformation logic. Select Add source to start configuring your source transformation. For more information, see Source transformation.

Authoring data flows

Mapping data flow has a unique authoring canvas designed to make building transformation logic easy. The data flow canvas is separated into three parts: the top bar, the graph, and the configuration panel.

:::image type="content" source="media/data-flow/canvas-1.png" alt-text="Screenshot shows the data flow canvas with top bar, graph, and configuration panel labeled.":::

Graph

The graph displays the transformation stream. It shows the lineage of source data as it flows into one or more sinks. Sinks can be any data source destinations where you want to move the results of your transformed data. To add a new source, select Add source. To add a new transformation, select the plus sign on the lower right of an existing transformation. Learn more on how to manage the data flow graph.

:::image type="content" source="media/data-flow/canvas-2.png" alt-text="Screenshot shows the graph part of the canvas with a Search text box.":::

Configuration panel

The configuration panel shows the settings specific to the currently selected transformation. If no transformation is selected, it shows the data flow. In the overall data flow configuration, you can add parameters via the Parameters tab. For more information, see Mapping data flow parameters.

Each transformation contains at least four configuration tabs.

Transformation settings

The first tab in each transformation's configuration pane contains the settings specific to that transformation. For more information, see that transformation's documentation page.

Optimize

The Optimize tab contains settings to configure partitioning schemes. To learn more about how to optimize your data flows, see the mapping data flow performance guide.

Inspect

The Inspect tab provides a view into the metadata of the data stream that you're transforming. You can see column counts, the columns changed, the columns added, data types, the column order, and column references. Inspect is a read-only view of your metadata. You don't need to have debug mode enabled to see metadata in the Inspect pane.

:::image type="content" source="media/data-flow/inspect1.png" alt-text="Inspect":::

As you change the shape of your data through transformations, you can see the metadata changes flow in the Inspect pane. If there isn't a defined schema in your source transformation, then metadata isn't visible in the Inspect pane. Lack of metadata is common in schema drift scenarios.

Data preview

If debug mode is on, the Data Preview tab gives you an interactive snapshot of the data at each transform. For more information, see Data preview in debug mode.

Top bar

The top bar contains actions that affect the whole data flow, like saving and validation. You can view the underlying JSON code and data flow script of your transformation logic as well. For more information, learn about the data flow script.

Available transformations

View the mapping data flow transformation overview to get a list of available transformations.

Data flow data types

array
binary
boolean
complex
decimal (includes precision)
date
float
integer
long
map
short
string
timestamp

Data flow activity

Mapping data flows are operationalized within ADF pipelines using the data flow activity. All a user has to do is specify which integration runtime to use and pass in parameter values. For more information, learn about the Azure integration runtime.

Debug mode

Debug mode allows you to interactively see the results of each transformation step while you build and debug your data flows. The debug session can be used both when building your data flow logic and when running pipeline debug runs with data flow activities. To learn more, see the debug mode documentation.

Monitoring data flows

Mapping data flow integrates with existing Azure Data Factory monitoring capabilities. To learn how to understand data flow monitoring output, see monitoring mapping data flows.

The Azure Data Factory team has created a performance tuning guide to help you optimize the execution time of your data flows after building your business logic.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

concepts-data-flow-overview.md

concepts-data-flow-overview.md

Mapping data flows in Azure Data Factory

What are mapping data flows?

Getting started

Authoring data flows

Graph

Configuration panel

Transformation settings

Optimize

Inspect

Data preview

Top bar

Available transformations

Data flow data types

Data flow activity

Debug mode

Monitoring data flows

Related content

Files

concepts-data-flow-overview.md

Latest commit

History

concepts-data-flow-overview.md

File metadata and controls

Mapping data flows in Azure Data Factory

What are mapping data flows?

Getting started

Authoring data flows

Graph

Configuration panel

Transformation settings

Optimize

Inspect

Data preview

Top bar

Available transformations

Data flow data types

Data flow activity

Debug mode

Monitoring data flows

Related content