CSE
5810
Zhitong Fei
Computer Science & Engineering Department
The University of Connecticut
CSE5810: Introduction to Biomedical Informatics
Professor: Steven A. Demurjian’s
CSE
5810
1. Introduction
2 Background and Motivation
3 Big Data for Biomedical Informatics
4 Information Security and Privacy
5 Conclusion
Introduction
CSE
5810
What is “Cloud Computing”?
A model for enabling ubiquitous, convenient, on-demand access to a shared pool of configurable computing resources.
To provide virtualized IT resources as cloud services by using the Internet technology.
Current Situation in Health Care (IOM 2005):
Limited collaboration, coordination in Health Care.
Outcome (IOM 2007):
High costs and inefficient patient treatment.
Medical errors and increased adverse drug events.
Redundancy of clinical data and medical actions.
3
CSE
5810
Introduction
4
CSE
5810
Why Cloud
Resource outsourcing, utility computing, Large numbers of machines, automated resource management, virtualization and parallel computing.
A style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet.
High-Performance Computing(HPC)
Lower cost
Virtualization technology
Parallel characteristics
5
CSE
5810
Cloud Computing components
cloud infrastructure, cloud platform and cloud application
6
CSE
5810
Background & Motivation
Classic Cloud architecture
Elastic Compute Cloud (EC2), Simple Storage
Service (S3), and Simple Queue Service (SQS).
7
Background & Motivation
CSE
5810
Cost
Efficient, Economical, variable cost basis
An comparison experiment :
Amazon cloud vs Local Compute Cluster
8
Background & Motivation
CSE
5810 o Could not use a 3-year amortized cost estimate for the cloud-based system.
o substantial delay required to purchase and install a local cluster was not taken into account.
9
CSE
5810
Big Data for Biomedical Informatics
10
Big Data for Biomedical Informatics
CSE
5810
Big Data: Why Bother?
“Big data are data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it.”
11
Big Data for Biomedical Informatics
CSE
5810
Biomedical Informatics
Genomics, Proteomics, Metabolomics
Eg. 1000 Genomes Project, up to 200 terabytes.
Imaging informatics: at the level of tissues and organs
Breakthroughs in diagnosis, prognosis, and highquality healthcare
12
Big Data for Biomedical Informatics
CSE
5810
Cloud Computing and Big Data technologies
Cloud storages allow large-scale
Virtualisation technology -- Virtual machine
A Hypervisor, a virtualisation management layer
13
Big Data for Biomedical Informatics
CSE
5810
Task Scheduling for Cloud computing
Satisfy cloud users and improve profits of cloud providers
Cluster computing: minimize the completion time
Grid computing: improvement of specific performance metrics
14
CSE
5810
Information Security and Privacy
15
Information Security and Privacy
CSE
5810
Security Risks
Access, Availability, Network load, Integrity, Data
Security, Data Location, Data Segregation.
Authenticated users only
Overload and High network load
Natural disasters
Flexibility and scalability
Data integrity
Cloud Security Control
Deterrent Controls, Prevention Controls,
Detective Controls, Corrective controls.
16
CSE
5810
Information Security and Privacy
17
Information Security and Privacy
CSE
5810
Additional Security in the Cloud
Public encryption key and Private decryption key.
Restricted users accounts
Private clouds, encrypted file systems, and encrypted data volumes
18
CSE
5810
Biomedical Informatics and Cloud Computing
19