Assessment of University of Washington Data Warehouse Brian Palmer and Laura Reeves CONNECT, The Knowledge Network July 15, 2010 1 Agenda • • • • • • About the Assessment – Scope and Process Current State of the EDW Future Vision for the EDW Moving the EDW Forward Key Recommendations and Next Steps Summary 2 About the Assessment 3 Assessment Scope The scope of this effort is to provide the University with an Enterprise Data Warehouse (EDW) Assessment Report to address the following: • • • • • • • Validate the strategy and provide additional recommendations on all aspects of data architecture and information delivery. Assess current technologies in use and provide recommendations for improvement. Evaluate the overall program approach including organization and support models, staffing levels, operations and project delivery processes. Assess program risks to the existing strategy, architecture and technologies. Assess progress made since the last review of the BI program. Evaluate proprietary data access control system. Evaluate the iStrategy feasibility study. The scope of this assessment did not include: • • Scoping and costing of specific projects (OPB, RADC, etc.) Identification of new opportunities or quantification of associated business value. 4 Assessment Process To achieve these goals, CONNECT has gathered insight for this report by: • Interviewing key stakeholders, users and team members of the current data warehouse to determine the business value they are receiving from today’s EDW, and to ascertain an understanding of their future Business Intelligence needs. • Evaluating current state data solutions including; technical environments, data architecture, and processes in use for the delivery of Business Intelligence • Reviewing documentation to understand the history, tactical plans, projects and long term strategy for Information Management placed in the context of the broader business and technical strategies of the university. • Interviewing Business Intelligence leaders from Peer Institutions to provide additional context to the assessment and associated recommendations. 5 The Current State of the EDW 6 Progress and Notable Accomplishments • Staffing – Have assembled a group of quality individuals able to provide a strong foundation for the team needed to move the EDW forward. • EDW Environment – Providing a stable and reliable environment where source system data can be accessed for operational reporting. – Deploying databases which are segregated, maintained, and being upgraded to SQL Server 2008 using new features such as partitioning. – Consolidated reporting technology onto the Microsoft product line. – Providing role-based security in accordance with federal and state data access control and privacy stipulations. – EDW data extracts feeding other data stores supporting additional reporting and analysis. – Progressing on plans for evolving the data architecture of the EDW. 7 Progress and Notable Accomplishments • Reporting – Highly disciplined process to create standard reports – Currently over 1,700 users accessing more than 150 standard reports – Created two financial analytical cubes with critical subsets of data to support flexible access and analysis. • Business Metadata – Metadata concepts are clearly understood as evidenced by the initial work and accountabilities of the Data Management Committee. • Support – DSS web site to communicate critical details about the EDW with links to DMC web pages with data definitions and security roles. • The site is one of the best that we have seen implemented. – Actively engaged with user issues through the Reporting Services SIG. 8 EDW Business Value to Date Consumers of EDW sourced data provided many positive insights on the value proposition of the existing EDW in terms of the data’s ability to support improved quality and efficiency of reporting for their business areas. Here are several examples that highlight this point: • Paper report replacement resulting in improved operational efficiency, making data available for use sooner, and reducing mainframe reporting overhead. Significant impact identified in the financial management decision support area. • Data mart / cube created to support financial analysis to review what has been identified as taxable (use/sales tax). This resulted in the identification of the potential refund of millions of dollars, per Purchasing and Accounting. • Overcoming operational systems shortcomings through structured reporting. Creating reports using a disciplined processes to ensure report quality, and providing them to users in a well documented reports library. Academic Advisement for Mathematics and Computer Science are examples of areas that have benefited from the EDW structured reporting. 9 Business Perception • While significant progress has been made, the current data warehouse capabilities are not able to meet user expectations. • The biggest challenges we heard from the business users of the EDW stated as simply as possible were: – We are unable to view information across the organization (i.e. lack of integrated data). – It is too hard to use. – It takes too long to get things done. – The source application systems are out-dated. – De-centralized organization and culture make it difficult to know who has what data, and how to get it. – Use of the data warehouse has been for financial and operational reporting rather than analysis. • While some changes are needed from an IT perspective, it is also important for the institution as a whole to: – Begin to shift towards using the DW to drive institutional effectiveness – Take a more analytical approach to decision making 10 Current Business Intelligence Challenge • Highly de-centralized organization and culture creates challenges. • Fragmented data environment with many “stove pipe” data marts. • Lack of integrated data and limited Business Intelligence tool capabilities make it time consuming and technically challenging to get information out. • Gaps exist in foundational components of organization and process. • Limited data definitions and inconsistent hierarchies impact ability to view across the organization. • EDW requires the addition of other data sources to provide a more complete view of the organization. • Fragmented support to users coming from 5 separate decision support teams outside of the EDW. • Campus systems limitation have resulted in a focus on data warehousing as an operational reporting environment, as opposed to focusing on analyzing institutional effectiveness. 11 Current EDW Data Architecture Source Systems Staging Layer Extract Raw Data HR-Payroll Systems Subset & Summarize HR Data User Access Transform & Warehouse Minimal Transformations HR Data ODS (Current Data) Structured Reports Finance FR Data Mart Financial Systems Financial Data Financial Data Structured Reports GL Data Mart Student Admin Sys Academic Data Academic Data Census Day Mart Academics Master Data HR Data Mart Structured Reports UWSDB (Student ) Current Structures Departmental and External Data Stores 12 The Future Vision for the EDW 13 Future Vision of Business Intelligence Criteria for the future EDW data architecture to provide a credible, consistent and reliable data environment: • • • • • • • • • • • • Integrates data from different source systems as well as departmental data. Supports different types of historical data including transaction detail, static snapshots (such as census day), etc. Continues to support the ability for operational inquiry and reporting. Provides mechanism to integrate departmental data w/core institutional data. Continues to provide appropriate security in the environment. Data organized to facilitate ease of use and good query performance. Is cost effective to use and to enhance (extensible). Supports timely development of new reports. Provides users with other flexible methods to access information with an emphasis on “self-service” and ad-hoc analytics. Employs data quality management practices. Provides transparency through business Metadata content and presentation. Has high performing incremental load of source data. 14 EDW Data Architecture Vision Source Systems HR-Payroll Systems Staging Layer Extract Raw Data HR-Payroll Data Financial Systems Financial Data Student Admin Sys Student Data Facilities Serv. Sys Research Admin Sys Integration Layer Integrate & Cleanse Integrated Presentation Layer User Access Business Intelligence Applications Transform for User Access HR Data Financial Data Academic Data Other Data HR, Financial, Academic, Facilities, Advancement, Other… Analytics Integrated Departmental Data Departmental Data Dashboards Facilities Data Departmental Data Entry Research Adm Data Scorecards Analytics Advancement Sys Advancement Data Other Systems Other Sys Data Current Structures New Structures Archive Dept Input Structured Reports ODS (Current Data) Structured Reports Departmental and External Data Stores 15 Business Value: Investing in the Future Vision Many opportunities exist to provide added value to the university: Gaining Greater Control Over Information Assets • The Institution has an obligation to the community and the individuals it serves to protect the information accumulated in the performance of its mission. The EDW can play a significant role in demonstrating the Institution’s commitment to using due diligence in its effort to maintain control of information assets. Improving Business Process through Data Integration & Analytics • Good decision making depends on the availability and use of common, consistent information. Investing in a well designed integrated data environment in the EDW accompanied by strong business metadata could significantly enhance the role of the EDW in moving the organization towards proactive decision making. Managing Costs • • Through the Return on Investment associated with EDW related business process improvement initiatives. Through the elimination of costs associated with the current highly decentralized and redundant information environment. 16 Moving The EDW Forward 17 Short-Term Approach to Business Intelligence • Incorporate recommendations into the current priority project work as first step towards the future vision architecture. • Leave much of the existing environment in place to continue to support any existing users and reports. • Focus on building out the data structures needed to provide flexible integrated institutional data: – Leverage the data as it is being sourced into the warehouse today, however, develop a new integration layer. – Build and Deploy a new integrated presentation layer based on dimensional data structures using conformed dimensions with appropriate history. – Purchase and deploy a “classical” Business Intelligence user interface to enable end-user self-service reporting and ad-hoc analytics (examples include MS ProClarity and Business Objects). 18 Short Term EDW Data Architecture User Access Source Systems Staging Layer Integration Layer Extract Raw Data HR-Payroll Systems Business Intelligence Applications Integrated Presentation Layer Transform for User Access Integrate HR Data Transform & Warehouse HR Data Financial Data Academic Data Fin, Academic & Select HR Data Analytics Dashboards HR Data Financial Systems Financial Data Subset & Summarize Student Admin Sys Student Data Select HR Data Financial Data Scorecards Academic Data Structured Reports Academic Data Production Data Extracts Archive UWSDB (Student ) ODS Current EDW Structures (Current Data) HR Data Mart Finance New EDW Structures Structured Reports Departmental and External Data Stores 19 People and Process Gaining true value from a data warehouse is more than moving data more efficiently. It is highly dependent upon the people and processes to support the entire environment. • Staffing Recommendations – Add specific roles to manage warehouse specific work including metadata management, testing, data modeling, and end-user support. – Fill these roles with people that have data warehouse experience. • Work Intake and Prioritization Recommendations – Recognize different types of work: Project, Report Development, Management of Access and Security, and Maintenance – Track the volume of requests and effort associated with involved with each type of work. – Develop a business prioritization process to set direction, manage priorities, measure the outcomes. – Allocate resources based upon priorities. 20 People and Process • End-User Support / Client Facing Services – End-user support must be recognized as key factor for wide-spread adoption of the data warehouse. – Support resources can report through business or IT. – Develop tiered support approach to meet the full range of departmental needs. – Address the need to integrate user support services across the five decision support teams outside of the EDW. – Training services should be expanded to include new BI tool functionality, data content, and metadata usage. • Several recommendations have also been made to begin addressing data quality and make refinements to the system development lifecycle. 21 Key Recommendations & Next Steps Our recommendations are targeted towards helping UW address these challenges in a realistic way given the current economic environment. The following high level recommendations summarize the assessment report details: • Business Partnership – Business Leadership is necessary for the EDW program to succeed. Clarify business accountabilities and make the commitments. – Establish practices that ensure priorities, expectations and commitments are clearly communicated throughout the organization. – Business constituents must be advocates for the EDW. • Initial Project and Staffing – Allocate resources at the level required to effectively complete data warehouse development projects: • Immediately acquire incremental resources to ensure successful completion of the OPB/ABB project. (Estimated as an additional 4.5 FTE’s) • Fill the critical roles that are currently absent from the EDW team. • Incorporate business expertise (Business Analysts) into project teams. • Concurrent projects would require additional incremental staff & increase risks. – External resources should mentor UW staff to reduce long term dependence. 22 Key Recommendations & Next Steps (2) • Data Architecture – Make it a priority to document and publish institutional data definitions, often called business metadata. – Leverage the data as it is being sourced into the warehouse today. – Develop a new integration layer to prepare the data for the presentation layer. – Extend the EDW data architecture to support self-service reporting and analysis . Specifically build a presentation layer with dimensional data structures that use conformed dimensions and appropriately handles history. – Re-engineer the beginning layers of the EDW data architecture in conjunction with the replacement of the source application systems. 23 Key Recommendations & Next Steps (3) • Data Delivery – Purchase and deploy a “classical” Business Intelligence user interface for the enablement of end-user self-service reporting and ad-hoc analytics. – Build and deploy a suite of starter business intelligence applications. These should provide flexible, parameter driven reports that can be easily modified on the fly to support analysis and decision making. • Deployment and Support – Sufficient resources must be allocated to focus on support and deployment of business intelligence (Client facing services ) provided in combination from business and IT. – Determine tiered support model and method for delivering coordinated support services. – Employ a work intake and prioritization mechanism. 24 Peer Institution Benchmarking • Conducted phone interviews and a survey with the following Peer Institutions: – – – – – University of Michigan Rensselear Polytechnic Institute (RPI) University of Illinois Seattle University Princeton University • University of Michigan Highlights – – – – Most comparable peer in terms of size and research DW started in 1994 Replaced core systems starting in 1998 (PeopleSoft) Provide reporting and analysis to a large audience • Business Objects has 5,500 users • M-Reports has 7,000 users – Internally developed report delivery interface for “guided analysis” – Large EDW Staff with 28.8 FTEs – Metadata Presentation Capabilities (delivered with each project) – Next generation DW expansion is on the books and in planning phase. 25 iStrategy Evaluation • iStrategy reviewed the UW environment to assess the feasibility, benefits and challenges of implementing their products (June, 2010) • CONNECT reviewed iStrategy assessment, but did not directly participate. – In general we agree with UW observations based on iStrategy’s extremely candid evaluation. – iStrategy provides strong user interface to facilitate reporting & analysis – iStrategy provides a library of higher education metrics which are likely to require extensive customization. – The iStrategy product line does not support an integrated data model – Core benefit of the iStrategy product line is their integration with third party applications. Since UW is not using these packages, this would not be a benefit. – Need to explore technical direction of their product line – currently using SQL Server 2005 and MS ProClarity 26 University of Michigan - iStrategy Experience University of Michigan implemented several iStrategy products. The critical lessons learned from their experience include: – Biggest benefit was to launch the DW team and expand their skill set into dimensional data structures. – Implemented using the University of Michigan developed interface. End users did not know they were using iStrategy. – Would suggest implementation without customizing to get your feet on the ground. Then customize as appropriate. – Are planning to move away from the iStrategy product with the implementation of their new integrated data warehouse architecture. – The vendor provided excellent guidance and consulting during the initial financial implementation project. 27 Summary / Conclusions 28 EDW Assessment Summary The University of Washington has been investing in the data warehouse environment for a number of years. While there has been progress, there are still many needs that are not being met with the current Enterprise Data Warehouse (EDW). • The current EDW does provide value to the institution. • There is a common and realistic understanding across the university of what the current EDW can do, as well as its limitations. • There is also an appreciation for the commitment, quality and work of the current EDW staff. • The DSS Team understands and acknowledges the challenges with the current EDW and has developed plans to address these issues. • The assessment validates that the plans are fundamentally sound and with the additional refinements suggested should move forward. 29 EDW Assessment Summary The University of Washington is poised to make significant strides in how the data warehouse is used across the institution. Critical factors to achieve this success include: • Business Commitment – Make the financial investment to implement the plans. – Gain executive support to ensure organizational alignment around a clearly defined set of priorities and goals. – Determine organizational roles with responsibility, accountability and resources to carry out the work. • IT Commitment – Execute the plans effectively – Facilitate the process to define roles and work associated – Provide the technical and data environment to empower the University. • Partnership – is crucial to achieving good results. 30