Day02_03_WeeHyong Tok_Data Factory_Integration

advertisement
Titanium Sponsors
Platinum Sponsors
Extract
Original Data
Transform
ETL Tool
(SSIS, etc)
Load
Transformed
Data
EDW
(SQL Svr, Teradata, etc)
BI Tools
Data Marts
Data Lake(s)
Dashboards
Apps
Extract
Original Data
Transform
ETL Tool
(SSIS, etc)
Load
Transformed
Data
EDW
(SQL Svr, Teradata, etc)
BI Tools
Data Marts
Data Lake(s)
Dashboards
Ingest (EL)
Original Data
Apps
Extract
Original Data
Transform
ETL Tool
(SSIS, etc)
Load
Transformed
Data
EDW
(SQL Svr, Teradata, etc)
BI Tools
Data Marts
Data Lake(s)
Dashboards
Ingest (EL)
Original Data
Scale-out
Storage &
Compute
(HDFS, Blob Storage,
etc)
Streaming data
Transform & Load
Apps
Extract
Original Data
Transform
ETL Tool
(SSIS, etc)
Load
Transformed
Data
EDW
(SQL Svr, Teradata, etc)
BI Tools
Data Marts
Data Lake(s)
Dashboards
Ingest (EL)
Original Data
Scale-out
Storage &
Compute
(HDFS, Blob Storage,
etc)
Streaming data
Transform & Load
Apps
Azure Data Factory
Capabilities at Public Preview
• Compose storage, movement, and processing services into data pipelines
• Initial data sources
• SQL Server, SQL Server in IaaS Virtual Machines, Azure SQL Database,
Azure Blobs, and Azure Tables
• Initial processing services
• Hive, Pig, C# code running on HDInsight
• hybrid data movement
• PowerShell developer experience for pipeline composition and deployment
• Cluster management for on-demand or bring your own
• Rich visual monitoring experience for a single view of all pipelines and
datasets that provides lineage dependencies, health monitoring, and error
identification
• Consume datasets by BI tools and applications
Example: customer profiling, game analytics
Example: customer profiling, game analytics
Copy new users
to blob storage
Xbox New
Users
Daily
Game New
Users
Game Activity
Per Week
Join and aggregate
activity per week
and user table
Weekly
New User Activity Per
Week
Privacy: Contains PII
Refresh: Weekly, Mon
by 8AM
All data, and all systems
Download