UCPath Common Data Mart

advertisement
The UCPath Common Data Mart
Submitter:
Al Course, Manager of Data Services, Albert.Course@ucop.edu, 510.987.0761
Project Leaders:
Donna Capraro
Al Course
UCLA Director of Information and Data Strategy
UCOP Senior Application Manager, Data Services
Core Team:
Deborah Hill
Hilda Duggan
Greg Hamilton
Micheal Schwartz
UCOP
UCLA
UCB
UCOP
Project Manager
Data Architect
Data Architect
Data Architect
Project Description:
The Challenge: Integrating UCPath into Campus Decision Support Systems
UC campuses have long-established data warehouse and business intelligence environments that
integrate data and deliver decision support derived from UC’s 35 year old Payroll and Personnel System
(PPS).
The current replacement of PPS with Oracle’s PeopleSoft Human Capital Management application
(UCPath) will not however “plug and play” into the existing PPS-based data warehouse and business
intelligence environments. Highly revised if not entirely new data warehouse and business intelligence
environments are required to integrate UCPath into campus-based decision support systems.
The total cost for campuses to separately provide solutions to this challenge is substantial. PeopleSoft is
new to most of UC and the application structure and operations must be independently learned by the
campuses. The learning process is complicated because PeopleSoft does not maintain referential
integrity or business rules in the database. Data and business rules are enforced by the application and
campuses are dependent on UCPath functional experts to understand these processes.
The estimated summed cost to UC of the separate campus efforts to integrate UCPath into their decision
support systems is over $10 million dollars (see Appendix A: Table A1: Total Costs without Collaboration).
The UCPath Common Data Mart – Collaboration
The Business Intelligence Collaboration Group (BICG) is a working subcommittee of the UC Information
Technology Leadership Council (ITLC). BICG membership is comprised of representatives across UC
campuses. The mission of BICG is to plan and coordinate UC-wide business intelligence programs
related to common administrative systems and to establish an integrated data management environment.
Members of BICG and other campus stakeholders approached BICG with the proposition that while
campuses are responsible for meeting their data warehouse and business intelligence requirements,
substantial efficiencies might be gained through a campus-wide collaborative design of a common
UCPath data warehouse.
BICG agreed to develop and promote the proposal to design an UCPath common data warehouse and
after several reviews and revisions the proposal for the “UCPath Common Data Mart” (CDM) was
submitted to ITLC and successfully approved for funding in March of 2014.
Cost and Funding
$375,000 of seed funding covering the project’s one year timeline was provided by the Office of the
President. $865,000 of contributed staff costs were provided primarily by UCLA and also by UCB and
UCOP.
Total Project Cost: $1,240,000. See Appendix A: Table A2: UCPath CDM Funding by Source and Stage.
Resources
1
Resources for the project included 6.0 FTE for the project leads, project manager, data architects, data
modeler, ETL mapping specialist and a reporting developer. Others included UCPath and campus
functional experts that were often called up to provide an understanding of PeopleSoft and UC
customizations. These contributions were not minor.
The Process – Innovation, Collaboration and Communication
The UCPath CDM was structured to ensure the efficiencies of collaboration among all participating
campuses. This required that the step-by-step procedures for designing and producing the deliverables
were well understood by the participating stakeholders, including, with some emphasis, the procedures
for communication.
All communication was channeled through a single point person, the Project Manager, who used the
UCPath CDM Response Tracker to log items. The Core Team reviewed the Response Tracker on a
weekly basis, providing responses to new issues and tracking the progress on outstanding issues. If
issues required call outs to UCPath and campus functional experts, then the Project Manager sent these
requests and was the contact person to receive and post the returned answers to the Response Tracker.
The Response Tracker was also formally reviewed throughout the project and extensively with all of the
stakeholders at each UCPath CDM Design Review Session.
This communication procedure resulted in reduced confusion, kept all stakeholders “in the loop,” and
facilitated cross-campus discussions and broad based stakeholder participation.
The general strategy that led us to our project deliverables is presented below.











target one subject area at a time – person, job, absence management, benefits, payroll,
commitment accounting, labor ledger, budget export, job profile management
capture and review business use cases from campuses by subject area
o seek assistance from UCPath functional experts as required to determine how UCPath
CDM can meet the requirements
begin design of logical dimensional model with ULCA’s initial dimensional model design
o review and revise design as required
o rename tables and attributes according to the agreed upon design standards
apply data naming abbreviation standards
capture entity and attribute definitions in data dictionary
develop and document ETL mapping specifications for all data
prepare sample reports
publish deliverables for review by all stakeholders
document review questions and comments in Response Tracker
o seek assistance from UCPath functional experts as required
schedule and hold UCPath CDM Design Review Session to review all deliverables by subject
area and review Response Tracker
make all final revisions to deliverables and publish to SharePoint
2
Deliverables
The UCPath CDM proved the feasibility and efficiencies of campus collaboration on a complex
system-wide project. The project reduced the collective as well as the local costs for the design of
UCPath data warehouse and business intelligence environments.
Over the course of its one year timeline UCPath CDM met and surpassed the project
deliverables. All the deliverables discussed below are published on the UCPath CDM
SharePoint.
https://sp2010.ucop.edu/sites/itlc/committees/bicollaboration/BI%20Collaborative%20Data%20Warehous
e%20Project/Forms/AllItems.aspx
Business Use Cases
256 distinct business use cases were received from UC campuses and were the requirements
foundation of the project. Business use cases were discussed and assessed by the Core Team
with the campuses through several meetings and during the UCPath CDM Design Review
Sessions. The design review sessions were held six times, three of which were face-to-face
meetings of all campus stakeholders and were held at UCLA (twice) and UCI.
Dimensional Models
Driven by the business use cases, 30 dimensional models were designed across the subject
areas of person, job, absence management, benefits, payroll, commitment accounting, labor
ledger, budget export, and job profile management; 100 total tables with over 3,000 columns.
The data models were delivered in multiple formats including ERwin so that campuses were able
to easily visualize and share the dimensional models with different audiences.
Common Data Mart Data Dictionary
Two data dictionaries were created to facilitate an understanding of the business meaning of the
data across all of the models. The UCPath CDM Data Dictionary provides definitions for 97
tables and 2,752 columns. The UCPath CDM Entity Definitions is an “at a glance” dictionary to go
with the “at a glance” entity only diagrams.
ETL Mapping Specifications
Over 160 ETL mapping specifications were developed to assist all campuses in the load of their
local UCPath CDM, providing maps of the data from the local ODS into the local UCPath CDM.
Sample Reports
Sample reports and associated reporting frameworks were developed so that campuses could
both test the dimensional models and begin developing other reports. It was agreed by campuses
that sharing reports and the code to query the UCPath CDM was a significant and would be a
continuing cost saving benefit.
Design Standards
Early in the project it was agreed that the known and generally accepted data warehouse
practices, terminology and architecture of Ralph Kimball would be adhered to throughout the
design of the UCPath CDM. This was beneficial to this project as it standardized communication
between participating stakeholders and set expectations regarding the format of the deliverables.
UCPath CDM Design Standards were also published for data naming, abbreviations and
definitions. Over 1000 standardized abbreviations were applied to derive the abbreviated column
names in the physical model.
These design standards established a common understanding across campus stakeholders of
the business meaning of the data in the UCPath CDM.
3
Response Tracker
The UCPath CDM Response Tracker was also developed to capture and track questions,
comments and issues regarding the UCPath CDM design. The availability of the Response
Tracker on SharePoint improved communication between campus stakeholders. The responses
that were received were discussed, resolved and shared with all stakeholders through regular
meetings and at the UCPath CDM Design Review Sessions. 249 total responses were received
and resolved.
DDL
The code (DDL) to create the UCPath CDM will be published in conjunction with the final
revisions to UCPath PeopleSoft at the end of SIT testing and just prior to the UCOP UCPath Go
Live. This publication will include the development and burst of masked data to all campuses.
The DDL will be burst for Oracle, SQLServer and IBM.
Implementation and Operation
To facilitate post project implementation both the UCLA Informatica Handbook, aka “RunBook”
and a UCPath CDM Location Implementation PowerPoint presentation were developed. Both
outline the steps required to implement and operate the UCPath CDM, from establishing and
loading the local ODS to building and loading the UCPath CDM as well as executing queries.
The Efficiencies of Collaboration
In the year prior to the UCPath CDM, UCLA designed initial dimensional models and Cognos report
frameworks for their perceived UCPath data warehouse and business intelligence environment. At that
time UCLA was considered the front runner for UCPath Go Live and this drove their initial work.
When the campuses agreed to move forward with the design of a common UCPath data warehouse,
UCLA contributed their initial dimensional models and reporting frameworks to this collaborative effort.
This was a major contribution to the project and one of the first realized gains of collaboration.
Some campuses such as UCSC, UCI and UCB had existing implementations of PeopleSoft applications.
Collaboration on UCPath CDM enabled these campuses to share their expertise and experience.
The UCPath CDM also drove the collaborative creation of data naming, abbreviation and data definition
standards. These may now be utilized across UC and will enable accurate UCPath reporting, the sharing
of UCPath reports, and provide a foundation for future collaboration across campuses.
Concrete Savings
The collaborative design of UCPath CDM constitutes 80% of the design and system integration work that
campuses would have to undertake individually. The project has resulted in significant collective savings
for UC and will enable campuses to efficiently implement local UCPath data warehouse and business
intelligence solutions.
The campuses agree that an estimated 2.0 FTE per year has been saved from UCPath data warehouse
and business intelligence design and development work the will be required to integrate UCPath into
campus decision support systems. This equates to total UC savings of $3,500,000. See Appendix A:
Table A3 - Cost Savings through Collaboration.
4
Conclusion
UCPath CDM used effective cross-functional collaboration to design a system-wide information
technology solution. The deliverables of the project are a collective product of numerous stakeholders
across all 10 UC campuses and UCOP.
UCPath CDM has been a collaborative success that may be emulated by future system-wide information
technology projects.
Appendix A: Calculations of Cost, Funding and Savings
Assumptions
1920 person hours for a yearly FTE
$91 average hourly rate of the 6 required FTEs
A1: Total Costs without Collaboration
Totals
Assumption: Person Hours Per Year
Per 1 FTE
Required FTEs Per Year/
Total Hours Per Year
Average Hourly Rate for FTEs/
Total Cost Per Campus
1920
6
11520
$ 91
$ 1,048,320
Total Cost for 10 Campuses
$10,483,200
A2: UCPath CDM Funding by Source and Stage
UCOP
- incremental costs specific to the UCPath CDM project
$
375,000
UCLA
- initial version 1 UCPath dimensional models designs prior to UCPath
CDM project
$
692,700
UCLA
- incremental costs specific to the UCPath CDM project
$
125,000
UCB
- incremental costs specific to the UCPath CDM project
$
23,650
UCOP
- incremental costs specific to the UCPath CDM project
$
23,650
$
1,240,000
Total Project Funding and Cost
A3: Cost Savings through Collaboration
Total
Person Hours per Year
FTEs
Average Hourly Rate
1920
1920
2
3840
$ 91
$ 349,440
Total for 9 Campuses and UCOP
$3,494,400
*UCLA not included as UCLA expended funds for full development of the solution
5
Download