DATA ARCHITECTURE, WAREHOUSING

advertisement
DATA ARCHITECTURE, WAREHOUSING , MODELLING
Note: A test on each and every topics mentioned
1)
2)
3)
4)
5)
6)
7)
8)
9)
10)
11)
12)
13)
14)
15)
16)
17)
18)
19)
20)
21)
22)
23)
SQL overview ( 2 hr)
Concept of Keys ( 2 hr)
Basic Queries ( 2 hr)
Linking the tables ( Joins) (2 hr)
Normalization in a database ( 1 hr)
PL/SQL ( Functions, Triggers and Stored Procedures) ( 8 hr)
Data Warehousing – Overview ( 2 hr)
Readiness Factors ( 2 hr)
Business Value in data warehousing ( 2hr)
Components/Architecture of typical data warehouse ( 4 hr)
Data acquisition and cleansing ( 1 hr)
Data Warehouse vs Data Marts ( 1 hr)
Technology Selection ( 2 hr)
Metadata ( 1 hr)
Differences between different approaches of DW ( 2 hr)
Best Practices ( 1 hr)
Principles of Extract, Load and Transformation ( 8 hr)
Informatica Power Center and Power Mart ( 24 hr)
Class Room Exercises ( 40 hr)
Reports ( Business Objects) 4 hr
Detailed exams ( 4 hr open book)
Tips and Tricks for Interview ( 2 hr)
Resume Preparation and Mock Interviews ( 4 hr)
Detailed DM curriculum:
1. Covered the entire concept of data modeling ( 4 hr)
 What is data model
 Data Model Development Cycle
 Why data model is necessary
 Advantages/ uses of data model
 Building Conceptual/ Logical/ Physical models
 Normalization and normal forms
 Entities, Attributes, Relationships and relationship keys
 Relationship cardinality
 Recursive relationship
 Different types of entities and attributes







Tables, columns, Contraints, etc
Sypertype/ Subtype concepts
Detailed Review Process for logical and Physical models
Forward and Reverse engineering
Compare, update model and alter table functionality
Subject Areas
Generating Reports
2. Briefed about DataMarts and Datawarehouse ( 2 hr)
 Data warehouse Architecture
 FACT and Dim tables
 OLTP and OLAP
 STAR and SNOWFLAKE schema
3. Data Model examples ( 8 hr)
Online transaction scenario
Student – course scenario
Library scenario
Mortgage loans scenario
Insurance
4. Generic things ( 2 hr)
How to gather information for building the data model
Metadata concept
5. Interview ( 2 hr)
How to face the interview
How to respond for the questions when they don't know the answer
Interview tips
Share the data models which I'm currently working on.
BUSINESS ANALYST
1. Introduction to Business Analysis
2. Role of a Business Analyst in total Project Life Cycle
3. Requirement gathering process.
4. Writing the Business requirements with practical
templates
5. Converting user requirements into Business
requirements with practical
examples.
6. How to draw use cases
7. How to draw UML diagrams
8. Analyzing the Business Scenarios
INFORMATICA
Data Warehousing Basics





What is Data Warehousing
Why Use a Data Warehouse ?
Available Tools in the Market
Key Terms
Architectural Concepts
o Primary Components
o What is ETL ?
o Data Cleansing
o Data Merging
o Data Aggregation
o Data Scrubbing
Data Warehousing Concepts




Historic Data
o Current Data & Historical
o Complete History
o Limited/Unlimited Number of Histories
Load Types
o First Time Load ( Initial Extraction )
o Delta Load
o Full Load
o Incremental Loads
Schema Types
o Star Schema
o Snowflake Schema
o Integrated Star Schema
o Advantages of Each Schema
Dimensional Modeling Techniques
o Conceptual Modeling
o Logical Modeling
o Physical Modeling
o Named Input
o Formatted Input
Data Warehousing Concepts .. Continued

Data marts
o Concept
o Dependent vs Independent Data Mart
o



Enterprise Warehouse Schema
Implementation Methodology
o Kimball Methodology
o Inmon Methodology
Facts
o Additive Facts
o Semi Additive Facts
o Non Additive Facts
o Measures
o HPQs
Dimension Table Types
o Slowly changing Dimensions
o Type 1/2/3 Dimensions
o Zunc Dimensions
o Confirmed Dimensions
o Need of Time Dimension Comparison
Data Warehousing Concepts.. Continued











Why is BI (Business Intelligence ) Required ?
o OLTP
o DSS
o OLAP
o Types of OLAP
 ROLAP
 MOLAP
 HOLAP
 DOLAP
o DDE Triplet
Reporting Architecture
Database Layer
Interface Layer
Server Layer
Presentation layer
BI Server Systems
BI Client Systems and Functionalities
Hieracheis
Need of Hierarchies in Reporting
Need for ODS in Data Warehousing
Informatica

Informatica Concepts
o Basic Concepts
o Key Terms
o Product Architecture
o
o
o


Various Tools and thier roles
Power Center vs Power Mart
Informatica Server Configuration
Informatica Designer
o Basic Concepts
o Using the Designer
o Working With Sources
o Working with Flat Files, Targets
o Mappings, Transformations, Mapplets
o Working with Debugger
o Implementing Type I , Type 2 , Type 3 mappings
o Loading the Data into different Fact Tables
Transformations
o Aggregator Transformations
o XML Transformations
o Expression
o Filter
o Joiner
o Loook up Transformations
o Rank, Router
o Sequence Generator
o Sorter Transformation
o Source Qualifier Transformation
o Stored Procedure Transformation
o Transaction Control Transformation
o Union Transformation
o Update Stretegy Transformation
o Connected vs Unconnected Transformations
Informatica

Workflow
o Understanding the Server Architecture
o Working with Workflows
o Working with task assignment, command, decision, control, eventraise,
event wait, timer, email, session task
o working with work ltes
o working with sessions
o understanding commit points
o recovering data
o monitoring workflows
o using multiple servers
o log files
o row error logging
o session parameters
o using incremental aggregation

Other Implementation Considerations
o Understanding the Server Architecture
o Working with Workflows
o Working with task assignment, command, decision, control, eventraise,
event wait, timer, email, session task
o working with work ltes
o working with sessions
o understanding commit points
o recovering data
o monitoring workflows
o using multiple servers
o log files
IBM Datastage
DBMS principles and basic SQL
Data Warehouse concepts
Datastage 8 architecture
Datastage archtecture differences (7.5+/8)
DataStage client components(Desginer,Director, Administrator)
Datastage version differences
Datastage features
Datastage Users
Datastage Active stages/Passive stages.
stages: aggregator,transformer,funnel,copy,remove
duplicates,sort,join,lookup,merge,funnel,column import,column export,change
capture,change apply,sequence
generator,modify,filter,pivot,head,tail,sample,peak,column generator,row
generator,sequential file,dataset,fileset,annotation,xml input stage/xml output stage
Datastage Job types(Server/Parallel)
Job properties and Parameters sets.
Containers
Job Sequence
Troubleshooting DataStage Jobs
Development Best Practice & Standard
Performance Tuning Techniques
Real time exercise using Live environment
500 Interview Questions
Web Logic Administration
WebLogic Server Administration
Overview and Setting up a WebLogic Server Environment
·
Install the BEA WebLogic Server 9.1
·
Learn how to create a new custom domain template
·
Create a new domain using the custom domain template
Lab Exercises
·
Install BEA Software
Configure a Domain Template
Configuring a WebLogic Server Environment
·
Become familiar with the Administration Console
·
Practice modifying properties of WebLogic Server through the Administration
Console
·
Configure and view log file entries for a WebLogic Server using the
Administration Console
·
Configure the WebLogic Server using WLST
Lab Exercises
·
Using the Administration Console
·
Using the Command Line
.NET Developer
Overview of .NET Framework Programming
Introducing C# Programs
Data Types in C#
Operators and Expressions
Control Structures
Object Oriented Programming
Classes
Methods, Properties and Operators
Characters and Strings, Arrays and Indexers
Virtual Methods and Polymorphism
Formatting and Conversion
Exceptions and Interfaces
.NET Interfaces and Collections
Delegates and Events
Introduction to Windows Forms and New Features
SHAREPOINT DEVELOPER/ADMIN
Understand IIS Web Server and hosting websites in IIS.
Install and configure all the required software’s products to work with SharePoint 2010.
Create SharePoint Web Applications, Site Collections, Sites and Sub sites.
Create Custom Lists and Libraries.
Understand and use out of the box lists and libraries to build the SharePoint site.
Manage permissions for Active directory or users in database based on Forms Based
Authentication.
Manage large amount of data and documents using Content management features like
document sets, document IDs & content organizer.
Configure and manage user profile service.
Implement Search Features in the site.
Create forms for lists and libraries using InfoPath.
Use Excel for designing and publishing reports using excel service.
Customize most widely used sections of the SharePoint API (site collections, sites, lists,
lists items, list queries, fields, user profiles, web applications).
Use out of the box workflow management and also be able to develop custom workflow
solutions for SharePoint.
Create custom event handlers.
Use features and solutions for deploying projects in SharePoint Server 2010.
Build and deploy web parts.
Develop Sandboxed solutions with limited access.
Access external data using BCS.
Use VS.NET 2010 to Create custom lists that use custom content types.
IBM Websphere Administration
1)WebSphere Overview and new features
Architechure overview
Toplogy and terminalogy
Topology and terminalogy - Express
Main themes, product packaging and development tools
Overview of new features
2.WebSphere Installation and migration
installation and migration
Installation overview
Profiles - details
V6 Install - Network deployment and express
Web server plug-in installation
Websphere Application Server V6 migration
3. WebSphere system management
System management
System management architecture
Resource scoping
Configuration repository
File synchronization
Administrative clients overview
Browser-based administrative console
Start/stop/monitor processes
Other commands
Build cell - add / remove nodes
Cell,deployment manager, node and node agent
Export and import configuration archives
Manage generic servers
Manage node groups
Application management overview
Application management install and uninstall
Application management Lab - install application
Managed application resources - enhanced EARs
Lab - enhanced EAR files
Fine -grained application update
Lab - application update
JDBC management
Manage Web server nodes
WRD Overview
WRD deployment automation
Administrative security
4.WebSphere Security
Security Overview and architecture
J2EE security for applications
Java 2 security for applications
CSIv2 security
JACC
Security for system administrator
Authentication mechanism and user reqistries
Adding security fir J2C/JDBC resources
Adding security for JMS resources
5.WebSphere Service integration technologies
Service integration technologies
Overview
Architecture
Configuration service integration bus resources
Configuring JMS resources for service integration bus
Workload management(WLM) and high availability (HA)
Mediation
6.WebSphere Workload management and high availability
workload management and high availability
Administrative details
Data replication services(DRS)
High Availability Overview
High Availability Details
7.WebSphere Runtime functions
classloader and jndi
lassloader Overview
Details
Examples
Problem determination and best practice
Dynamic application reload
JNDI Overview
JNDI Basics
JNDI Advanced
JNDI Examples
JNDI Problem determination
Transport channel service
8.WebSphere Performance
performance and demonstrations
Application server performance overview
Websphere Application Server dynamic cache
Performance advisor and Tivoli Performance Viewer(TPV)
Request metrics
9.WebSphere Problem determination
Problem determination
Overview - where to start
Log files overview
Loading and tracing
Log and trace analyzer
Hung thread detection
Generating an IBM heap dump
Connection leak diagnostics
SAP FI /BW
Data warehouse fundamentals
• Introduction to SAP R/3 ( ECC ) and BW
• BW architecture
• Multi dimensional Modeling
• Star Schema, BW extended Star Schema
• Administrator Work Bench
• InfoObjects ( attributes and texts and hierarchies)
• InfoCubes, Transactional ( Real-time ) InfoCubes,
• ODS ( DSO) – Standard, Transactional and Write Optimized
• PSA, Source Systems
• InfoSource
• InfoPackage
• Transfer Rules
• Update Rules
• Transformation
• Data Transfer Process
• Loading Master Data using flat files
• Loading transaction data using flat files
• ODS - Change Log, Active Table and New Table
• Managing Delta using ODS
• MultiProviders
• Process Chains
• Open Hub service
• Data Marts
• InfoSpokes
• Open Hub Destination
• Aggregates
• Activating the Business content
• Loading Master data from ECC
• Delta Management
• Various types of R3 Extractors
• Extraction from LO Cockpit
• User Exits
• SAP landscape and Transports
• Real Time Data Acquisition
• BEx reporting – BEx Query Designer, Calculated and Restricted Key
Figures, Variables ( Characteristic, Formula and Text Variables ),
Query properties, Exceptions and Conditions, BEx Analyzer,
Workbooks, Report to Report Interface ( RRI )
• Information Broadcasting
Download