power bi dataflows bi dataflows… · dataflow capability power bi pro power bi premium scheduled...

33
Power BI DataFlows Julio Granados - [email protected]

Upload: others

Post on 25-Oct-2020

56 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Power BI DataFlowsJulio Granados - [email protected]

Page 2: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Session Goals

• What are the Power BI DataFlows?• Where is the Data Stored?

Page 3: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

How did we use to do it?

Decision making

Data PreparationData ModelingData DiscoveryData Insights

Page 4: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Decision MakingData Discovery

Data PreparationData Modeling

Data PreparationData Insights and Predictions

How did we use to do it?

Page 5: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Why were dataflows born?

Data PreparationData Modeling

Decision MakingData DiscoverySome Data Preparation

Data PreparationData Insights and Predictions

Page 6: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Dataflows?

DataFlow

DataSets

Reports And Dashboards

Entity Entity

Metadata

QueryDefintion

Data

Each

Manual or

scheduledrefresh

Workspaces

Page 7: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Dataflows Sources

No Live Query No Direct Query No Streaming Data

Page 8: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

DEMO

Create a Simple DataFlow

Demo

Page 9: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Questions

Q: Do dataflows replace Power BI datasets?

Page 10: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

DEMO

Dataflow as Source

Demo

Page 11: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Questions Q: Do dataflows replace Power BI datasets?

Sources

Datasets

Reports

Page 12: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Questions

Q: Can I reuse dataflows defined in other workspaces?

Page 13: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

DEMO

Using Linked and Computed Entities

Demo

Page 14: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Question

Q: Can I reuse dataflows defined in other workspaces?

Entity Entity

Entity

EntityEntity

Link

Compute• Reference• Join• Append• Incremental• Calculate tables

Page 15: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Question Q: Can I reuse dataflows defined in other workspaces?

Entity Entity

Entity

EntityEntity

Link

Compute

Physical folder

• Reference• Join• Calculate tables

Page 16: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Where is my Dataflow?

Global Or PremiumStorage

Default

Data Lake StoragePreview

Page 17: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Data Lake Storage Gen2 for Power BI

Page 18: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Data Lake Storage Gen2 and Power BI Dataflow Integration

Data Lake Storage gen2(Storage Account)

Create Blob container(Inside Storage Account)

Namedpowerbi

Admin Portal

Data Lake Storage gen2

(Storage Account)

Read Role

Access

Associate

All Write and Execute

Page 19: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

How Data Lake Storage Gen2 and Power BI Dataflow Works ?

Blob Container

DataFlow

Data Lake Storage gen2

(Storage Account)

CDM Folder

Refresh Data

Load ProcessCreate Process

Page 20: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

What is the Common Data Model Defintion?Shared data language and folder structure for business and analytical applications to use

Page 21: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Common Data Model Schema

Page 22: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Common Data Model Structure

DataFlow Folder

Entity Folder

SnapshotsFolderModel

Data

Blob Container

N Entities

N DataFlows

Definition:• Query• Refresh policie• Data Location• Entity• Entity Type• Columns• Relathionships

Page 23: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Common Data Model And Azure ServicesShared data language and folder structure for business and analytical applications to use

Page 24: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

DEMO

Using DataBricks and DataFactoryReading from CMD

Demo

Page 25: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

• Q: Dataflows replace Azure Data Factory?

Dataflows replace Data Factory?

Page 26: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Dataflows replace Data Factory?• Q: Dataflows replace Azure Data Factory?

DataFlowData Factory

• Data Engineers• All Sources• All Destinations• Complex transformation• drag and drop• Manually-Orchestated• Scalled ETL(SSIS+DTU)

• Business Users• Bach Sources• Just PBI Dataset destination• Complex transformation by code• Auto-Orchestated• Scalled depends of the PBI node

Page 27: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

DEMO

Auto-ML & AI in Dataflows

Demo

Page 28: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Dataflows Pro vs Premium

Dataflow Capability Power BI Pro Power BI PremiumScheduled refresh 8 per day 48 per dayTotal Storage 10 GB/user 100 TB/nodeDataflow Authoring with Power Query Online + +

Dataflow Management within Power BI + +

Dataflows Data Connector in the Power BI Desktop + +

Integration with Azure + +Computed Entities (in-storage transformations via M) +New connectors + +Dataflow incremental refresh +Running on Power BI Premium capacity / Parallel execution of transforms

+

Dataflow linked entities +Standardized Schema / Built-In Support for the Common Data Model + +

AutoML and AI +

Page 29: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

When and Why Dataflows?

• User Friendly (Power Query)• Reuse Logic• Auto Orchestration• Integration with Data Lake+CMD• Auto-ML & AI

• Data Discovery• Simple transformations• Quick Action Scenario

Page 30: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Why were dataflows born?

BI Team ML & AI Team

Business team

Development Cycle

Page 31: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Summary Power BI DataFlows Dataflows Components Differences between DataFlows and DataSets Computed and Linked Entities Azure Data Lake Data Lake + Power BI DataFlows Integration Common Data Model Power BI DataFlow and Common Data Model Dataflow with Databricks and DataFactory DataFlow vs Data Factory DataFlow Pro vs Premium AutoML- and AI

Page 32: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Q&A

Thank you! Julio Granados [email protected]

Page 33: Power BI DataFlows BI Dataflows… · Dataflow Capability Power BI Pro Power BI Premium Scheduled refresh 8 per day 48 per day Total Storage 10 GB/user 100 TB/node Dataflow Authoring

Resources

https://go.microsoft.com/fwlink/?linkid=2011419&clcid=0x409 https://docs.microsoft.com/en-us/power-bi/service-dataflows-overview https://docs.microsoft.com/en-us/business-applications-release-

notes/October18/intelligence-platform/power-bi-service/self-service-data-prep https://myignite.techcommunity.microsoft.com/sessions/65437 https://www.microsoft.com/en-us/businessapplicationssummit/video/BAS2018-

2117 https://ssbipolar.com/category/power-bi/dataflows/ https://blog.crossjoin.co.uk/category/dataflows/ https://github.com/Azure-Samples/cdm-azure-data-services-

integration/blob/master/Tutorial/CDM-Azure-Data-Services-Integration-Tutorial.md