Uploaded by Sandeep Hipparagi

Data warehouse Project

advertisement
Project Requirements
ACME Traders, a ctitious speciality foods export/import company, keeps track of orders
placed by customers in a number of spreadsheets as follows:
• Customers.xlsx: Details of customers (companies).
• Locations.xlsx: Details of the location of customers (city...).
• Products.xlsx: Details of products traded.
• ProductCategories.xlsx: Details of the category of each product.
• Time.xlsx: Detailed date attributes corresponding to the range of dates of customer
orders.
• Orders.xlsx: Details of orders (order date and cost of freight) placed by customers.
• OrderDetails.xlsx: Details related to each order (quantity of each product contained in
an order).
As a rst step, ACME decided to design a data warehouse (star schema) that will host all
the source data. The SQL script of this data warehouse can be found in Moodle.
In your VM, go to Microsoft SQL Server, copy/paste the script into a new query window
and execute it. This will create three dimension tables (customer, Product and Time) as
well as a fact table (Sales).
The aim is to allow ACME to analyse sales amount (quantity x unit price) by customer,
product, and time.
Task 1 -ETL
Using Microsoft Visual Studio, create an SQL Server Integration Services (SSIS) ETL
project to load data from the source (Excel spreadsheets) into the data warehouse. Your
ETL should accommodate the following requirements:
(1) Newproductscanbeadded and existing products can be updated. In particular, the
product’s category name should be treated as a “changing” attribute, and (ii) the
product’s unit price as a “historical” attribute.
(2) New customers can be added, but it is anticipated that no updates will be necessary.
(3) New dates can be added, but it is anticipated that no updates will be necessary.
(4) New orders can be placed, and the sales amount of an existing order can be updated.
Note 1: When using a “Data Conversion” task in your ETLs, you may nd the following
resource1 useful to help you match between SSIS data types and SQL Server data types.
fi
fi
fi
Note 2: If you encounter an error when using an “Excel Source” task, try the following:
right-click your SSIS project, select “Properties”, then under “Con guration Properties”
select “Debugging” and switch “Run64BitRunTime” to False.
Task 2 – Multi-Dimensional Modelling
Using Microsoft Visual Studio, create a SQL Server Analysis Services (SSAS) project to
build a sales cube and three dimensions, customer, product and time, each with a
relevant hierarchy.
Tasks – format (Deliverables)
Submit a (Word or PDF) report containing the following:
Deliverables of Task 1:
A screenshot of the overall ETL work ow. In addition, for each ETL task,
include the following:
•
A screenshot of the successful (or otherwise) running of that task (showing
green ticks and the number of rows loaded).
•
A screenshot, from SQL Server Management Studio, that shows data
loaded in the table related to that task (with a count of rows).
•
A brief explanation of each work ow.
Deliverables of Task 2:
A screenshot of the cube, in browser mode, showing the total sales
amount for each month in 1997.
fi
•
fl
For each dimension: Three screenshots: (i) dimension hierarchy; (ii)
attribute relationships; and (iii) browser view.
fl
•
Related documents
Download