- BI Monkey

advertisement
Data Quality Services
+
What’s new in SSIS
in
SQL Server 2012
James Beresford
www.bimonkey.com
@BI_Monkey
But first….
Data Quality Services
SQL 2012 RC0 Edition
The Data Quality problem
!=
Hands up if you’ve been 2 years old
Your first data quality problem
Data Quality = Shape Sorting
There is Good Data
There is Bad Data
There is Repairable Data
Data Quality Services Terms
A Shape = Domain
Data Quality Services Terms
Collection of Shapes = Knowledge Base
Data Quality Services Terms
Shape Sorting = Domain Rule
The Data Quality Client
Allows you to:
• Create or Maintain
• Knowledge Bases
• Data Quality Projects
• Manage and Configure DQS
The Data Quality Client
Create a Knowledge Base
The Data Quality Client
Create a Knowledge Base
What did we see?
• Create a Knowledge Base from Data using
Knowledge Discovery
• Set values as Correct, Error or Invalid
• Teach the KB some simple auto correction using
Domain Rules
The Data Quality Client
Pop Quiz: Who owns a Knowledge Base?
Maintaining KB’s is done by:
a)
b)
c)
d)
Qualified Data Professionals
A BI Developer
Users
Tony Abbott
The Data Quality Client
Pop Quiz: Who owns a Knowledge Base?
Maintaining KB’s is done by:
a)
b)
c)
d)
Qualified Data Professionals
A BI Developer
Users
Tony Abbott
The Data Quality Client
Create a Composite Domain
+
n
The Data Quality Client
Create a Composite Domain
What did we see?
• Use values from one domain to affect another
The Data Quality Client
Other Domain Functions
Things that I won’t demonstrate
• Term Based Relations – autocorrect
substrings (e.g. Inc. > Incorporated)
• Reference Data – Validation against external
sources on Azure DataMarket, e.g. Melissa
Data
The Data Quality Client
Create a Data Quality Project
The Data Quality Client
Create a Data Quality Project
What did we see?
• Interactive Cleansing of data
• Export of cleansed data set
The Data Quality Client
Matching Policies
Fuzzy & Domain matching
• Matching Polices
• Exact – Must match to count in score
• Similar – Approximate match counts
• Prerequisite – Must match or match fails
• De-Duplication
Automation with SSIS
Automation with SSIS
The DQS Cleansing Task
What’s new in SSIS
Denali CTP3 Edition
What’s new in SSIS
Projects & Parameters
What’s new in SSIS
The SSIS Catalog
What’s new in SSIS
Developer Experience
• Visual Studio 2010 & .Net 4 support
• Undo / Redo
• Auto-save project recovery
What’s new in SSIS
Developer Experience
New Toolbox
What’s new in SSIS
Developer Experience
Expressions
• New Expressions
• LEFT
• TOKEN
• TOKENCOUNT
• Removal of 4000 character limit
What’s new in SSIS
Developer Experience
New Data Flow Column Mapper
What’s new in SSIS
Developer Experience
Expression Markers on icons
What’s new in SSIS
Developer Experience
Variable Window improvements
1. Package Scope by Default
2. Change Scope button
What’s new in SSIS
Developer Experience
Data Quality Services Integration – DQS Cleansing Task
What’s new in SSIS
Administration Experience
• Simplified deployment & configuration
• Automatic logging
• Management dashboard
• Built-in reports
• Improved SSMS management experience
• Advanced troubleshooting features
• Package performance analysis
What’s new in SSIS
SQL 2012 RC0 Edition
What’s new in SSIS
Developer Experience
Change Data Capture
What’s new in SSIS
Developer Experience
ODBC Support
• Source and Destination Adapters
What’s new in SSIS
Developer Experience
Connection Manager Changes
What’s new in SSIS
Developer Experience
Flat File Source Improvements
What’s new in SSIS
Developer Experience
•
•
•
•
Package Format Changes
Visual Studio Configurations
Scripting Improvements
Troubleshooting & Logging
What’s new in SSIS
Administration Experience
• Data Taps
• Server Management with PowerShell
What’s new in SSIS
Developer Experience
Other Changes
• Updated look for the Control Flow and Data Flow
• Pivot UI
• Row Count UI
• New Expression:
◦REPLACENULL
• BIDS is now SQL Server Data Tools
• Many small fixes and improvements based on CTP
feedback
Download