Information and IT architecture development

advertisement

APPLIED GSBPM IN GSO

OUTLINE

By Mr Do Ngoc Ha

GSO OF Viet Nam

Introduction

Develop Enterprise architecture (EA) for GSO

Business architecture development: Applied GSBPM for GSO

Information architecture development

Build application

Application architecture development

Technology architecture development

Finding and conclusion

1

The GSO vertical structure is supportive of consistent statistical management from central to local levels (HQ, 63 PSOs and 708 DSOs). There is clear specification of responsibilities between the GSO and other institutions contributing to the Vietnam Statistical System as specified in National statistical indicator system (NSIS). Data collection systems through reporting regimes, sample surveys (43 surveys was conducted by GSO every year) and administrative records are well developed. GSO is adopting International standards in many fields of Statistics. Strong in certain statistical methodologies that is: available methodologies for most domains; metadata definition follows standards and is largely in compliance with General Data Dissemination System.

However, GSO has inherited weaknesses related to statistic production process; there are deficiencies in the implementation of the data collection systems, and technical aspects of statistical work, each department has its own collection methods and no standardized data between survey forms. The system of classifications is the weakest element among statistical tools for the development of data production, dissemination and analysis. Difficulties in the process of development of databases including data (micro and macro databases) and metadata. GSO departments working in silos – difficult to share and integrate data across departments. Lack of systematic process to integrate data across GSO departments.

Following “

Vietnam Statistical Development Strategy 2011-2020 and Vision to 2030”.

In 2011, GSO had a bidding package to build Enterprise Architecture

(EA) under supported of WB; EA was applied base on OIO framework to built BA

– business architecture, IA – information architecture, TA – technical architecture and AA - application architecture follows four strategies: (i) Metadata-Driven

Statistical Production; (ii) One Centralized Virtual Database; (iii) Synchronized

Data Collection; (iv) Intelligent Statistical Analysis and Modeling.

Business architecture development - BA( Applied GSBPM for GSO ) : 2011,

GSO setups Core team includes key persons from all departments to work with the

EA contractor, including:

Population and Labour Statistics Department

Agriculture, Forestry and Fishery Statistics Department

Industrial Statistics Department

Construction and Investment Capital Statistics Department

2

Trade and Service Statistics Department

Price Statistics Department

Socio-Environmental Statistics Department

National Account Department

Integrated Statistics Department

Statistical Standard Methodology and IT Department

Core team and EA contractor had reviewed business process of all surveys in all departments. At high level, statistical processes are quite similar across all departments and consists 5 steps: (1) Survey Preparation / Formulation; (2) Data

Collection; (3) Data Processing; (4) Data Analysis; (5) Information Dissemination.

Eurostat SBPM version had been choosing as the international standard reference.

The GSO SBPM was designed for all surveys in GSO include sample surveys as well as Census with sub-processes were indicated and order to cover all steps and suitable with all survey domains. In the first phase, this model was designed for all surveys in National survey program; next phase, the model will be reviewing in the future for other statistic sources such as the integrated report regimes and administrative records. In order to support effectively GSO strategies and the BA, the processes model relating to the statistical production lifecycle is metadatadriven.

3

Identify needs

Collect and consolidate information needs, check available information

Identify information need to be collected

Develop survey sample frame*

Prepare collection

Develop survey plan*

Build metadata

Quality management/ Metadata management

Data collection

Interview for data collection

Correct data

Create output template

Code and classify data

Review survey plan

Consolidate collected data

Conduct survey training

Input data

Data processing

Perform data cleaning and verification

Calculate and integrate data

Calculate weight

Data analysis

Analyze entire survey data

Calculate report

Verify survey result

Produce final report

Information dissemination

Draft publishing report

Review publishing report

Publish statistical product

Archive

Restore survey data*

Conduct evaluation of process*

Conduct evaluation of end users

1) Identify needs i.

Collect and consolidate information needs, check available information; ii.

Identify information need to be collected; iii.

Develop survey sample frame: identify objective and scope of information, design sample survey;

2) Prepare collection i.

Develop survey plan –survey methodologies; design questionnaire; choose the data collection method; built resource plan (finance, human, equipment), checking and supervisory plans, pilot if any; ii.

Build metadata: refer to metadata on previous survey plans iii.

Create output template iv.

Review survey plan – refer to metadata checklist and ensure that all details are included v.

Conduct survey training

3) Data collection i.

Interview for collecting data & supervise

4

ii.

Correct data iii.

Code and classify data iv.

Consolidate collected data v.

Input data

4) Data processing i.

Perform data cleaning and verification ii.

Calculate and integrate data iii.

Calculate weight

5) Data analysis i.

Analyze entire survey data ii.

Calculate report iii.

Verify survey result iv.

Produce final report

6) Information dissemination i.

Draft publishing report ii.

Review publishing report iii.

Produce and publish statistical product including the soft copy (GSO website)

7) Archive i.

Restore survey data: except restore survey data after data entry and process, analysis, step; purpose to category data and build data warehouse to support search statistic information. This step also digitalizes statistic information in document format by using Data Document Initiative - DDI protocol. The questionnaires of several surveys were digitalizing when the data entry is used scanning technology. ii.

Conduct evaluation of process iii.

Conduct evaluation of end users

Several sub-steps were customized to suitable with GSO, metadata building step is focused to ensure data linkage and no conflict data between the difference survey data. Some steps are putted in the third step such as correct data, code and classify data steps instead of put in data processing step. Because the third step is done by statistician without IT skill, the fourth step is done by IT skill persons but

“input data” step is in the third step to suitable with both of the data collection ways that is classic way and supported mobile divide way.

Information and IT architecture development - IA : statistic information is owner in difference department and was supported by more than 35 silo applications and many of them execute similar sets of business processes, and these applications have no integration. Statistic information needs to be standardized, centralized and follow unique information flow under GSBPM. GSO had chosen

GSIM as approach to implement. Metadata-driven is the way to standard statistic

5

indicator and information across difference domains to improve communication between the difference steps of statistic production, organization and sharing micro, macro data between domains.

Therefore, GSO departments and PSOs should interface to just ONE (1) GSO

IT system. This is the reason for the introduction of a configurable workspace application (CWA). CWA is a virtual workspace function wherein each GSO department/PSO is given access to its own segment of the GSO Integrated

Applications System, which contains all applications contained in the green box illustrated below. The virtual workspace shares the same meta-application but each is configurable to fit the user needs. Data are also allocated to each application segment with controlled access privileges to ensure data privacy while still maintaining the integrated data system at the database level. These virtual workspaces can be created, deleted and controlled from a single management interface.

6

Application architecture development - AA : the GSO AA has the purpose of designing the best architecture alternatives to serve GSO Business needs. AA indicate the principles that affected the development, maintenance, and use of the enterprise architecture; related guidance for designing and developing information systems. Three most significant important principles are: (1) Open, service-oriented

Application Architecture; (2) Meta-data driven; (3) One Centralized Virtual

Database. AA has chosen DDI and SDMX standards as approach to develop the statistic application.

AA created a GSO integrated IT platform framework. All users will interface via the unified Configurable Workspace Applications and Portal. All applications will be hosted here. All data will be hosted in GSO Integrated Data Store and available via Data Services. System administration is centralized via the use of

Common Communication Services.

GSO built the System of statistic information collection (SSIC) and Statistical

Hub (SH) application to supported all step in the GSBPM.

7

This figure describe work flow of application supported GSBPM, the application including 2 main components that are System of statistic information collection (SSIC) and Statistical Hub (SH).

SSIC supports three first steps:

Identify needs (accept metadata management, that is done in SH and integrated with SSIC)

Prepare collection

Data collection

Data entry form designing is simple way as drop down questions to questionnaire and is done by statistician without IT skill. Data entry will be inputted online by PSO (province level)

SH supports fourth last steps:

Data processing

Data analysis

Information dissemination

8

Archive

All survey data is centralizes, system help end user share data easily between domains and between departments. Report is public directly through intranet portal after validate.

Technology architecture (TA) development : describes the technology components and platforms used as reference in building the GSO IT Infrastructure.

The process of deriving the TA involves correlating information that has been built up in the Business, Information and Application Architecture analyses.

TA develops Technical Reference Model that is a hierarchical model used to categorize and classify available technologies. Each service defines the group standards and technologies that can be using in GSO.

Finding and conclusion

1) Establish the Core team include key person from all difference department who understand current business process to give situation and problem need to be improve.

2) Choose the Model as reference and customize, at this time we have chosen

Eurostat GSBMP. Version 5.0 was released in 2013

3) Request to have application support all main step of GSBPM to keep union business process between difference domains and difference departments.

4) Standardize the statistic data is the focus of work and takes time.

5) Metadata-driven is the key to connect all component and statistic indicators therefore we can share data and communicate more easily.

9

Download