“INTEROPERABILITY AMONG NoSQL DATABASES IN CLOUD” KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad AGENDA 1 Overview 2 Introduction Challenges & Motivations 3 4 5 KTH Applied Information Security Lab 6 7 8 9 Literature Survey Problem Statement Architecture & Workflow Standards & Technologies Roadmap References Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad OVERVIEW OF CLOUD COMPUTING On-demand Self Services Broad Network Access Rapid Elasticity Measured Services Resource Pooling Software-as-aservice KTH Applied Information Security Lab Public Platform-as-aservice Private Hybrid Infrastructureas-a-service Community Reference: http://cloudblueprint.wordpress.com/cloud-taxonomy/ Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad DBaaS “A managed service on cloud, offered on a pay-per-usage basis, that provides on-demand access to a database for the storage of application data”- Cloud Computing: Methodology, Systems, and Applications by Lizhe Wang, Rajiv Ranjan, Jinjun Chen, Boualem Benatallah.. DBMSs are extremely complex to deploy, setup, and maintain Relieves application developers from burden of managing and monitoring performance KTH Applied Information Security Lab Physically launching a virtual machine instance is not needed Installing and maintaining the database by application owners required Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad is not ADVANTAGES Increased Accessibility Automatic Failover Fast Automated Recovery From Failures Automated on-the-go Scaling KTH Applied Information Security Lab EXAMPLES Amazon SimpleDB NoSQL key-value stores Amazon Relational Database Minimal Investment DynamoDB Potentially Better Performance Google’s Big Table Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad CLOUD DBaaS CHALLENGES Data Security Network Security Heterogeneous Environment Multi-tenancy KTH Applied Information Security Lab Outside Attackers Loss of Control Data Replication & Consistency Management Data Sanitization Auditing & Monitoring Scalability Query and Transactional Loads Interoperability Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad DATA INTEROPERABILITY Ability of making systems and organizations to work together (inter-operate)-Interoperability Definition: “A database front-end which communicates with multiple heterogeneous databases and makes them appear as a single homogenous entity with semantic calls”- Encyclopedia KTH Applied Information Security Lab Data and queries of one database system made usable by the users of another database system. Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad NoSQL DATABASES •BIG data. •Simplicity of design (Flexible data model) •Horizontal scaling •Finer control over availability •Economics Column Value Store KTH Applied Information Security Lab Document Oriented Key-value stores Graph Database CATEGORIES Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad INDUSTRIAL SURVEY “It is possible we will see standards begin to emerge, both in on-the-wire protocols and perhaps in query languages, allowing interoperability between NoSQL database technologies similar to the kind of interoperability we’ve seen with SQL and relational database technology.” KTH Applied Information Security Lab Reference: http://www.odbms.org/blog/2011/03/the-evolving-market-for-nosql-databases-interview-with-james-phillips/ Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad “Every week, I talk to people that are considering replacing an existing database infrastructure with the NoSQL databases,” said David Rosenthal, co-founder and engineer at FoundationDB KTH Applied Information Security Lab “Companies have a desire to not operate four or five or six databases” Reference: http://www.infoworld.com/t/nosql/nosqls-real-time-analytics-database-consolidation-entice-enterprises-225396 Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad LITERATURE SURVEY KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad 1. Cloud Databases: A Paradigm Shift in Databases Cloud computing has become a reality due to its lesser cost, scalability and pay-asyou-go model. NoSQL databases have become synonym for Cloud databases. Challenges to develop Cloud Databases KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad Amazon S3 /SimpleDB Google App’s Bigtable MongoDB Cloud Databases in Industries CoucheDB KTH Applied Information Security Lab Dynamo MapReduce Hadoop Windows Azure Reference: Cloud Databases: A Paradigm Shift in Databases By Indu Arora1 and Dr. Anu Gupta2, IJCSI, Vol. 9, Issue 4, No 3, July 2012 Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad 2. Hibernating in the Cloud – Implementation and Evaluation of Object-NoSQL -Mapping Problem: Object-relational impedance mismatch KTH Applied Information Security Lab Vulnerability: Not usable for other categories of NoSQL databases. Hibernate along with life cycle management is re-implemented. Solution: Non-relational approach which integrates NoSQL support in hibernate (Key Value-RIAK) Improvement: Hibernate OGM Usable for other NoSQL categories. Reference: Florian Wolf, Heiko Betz, Francis Gropengießer, and Kai-Uwe Sattler,” Hibernating in the Cloud – Implementation and Evaluation of Object-NoSQL-Mapping”, 2011. Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad 3. SOS (Save Our Systems): A uniform programming interface for non-relational systems KTH Applied Information Security Lab Problem: Heterogeneity of languages and interfaces offered by NoSQL systems Solution: Common interface and hides all underlying details of every NoSQL database Vulnerability: Not for Cloud Allows put, delete and get methods only. Only for 3 NoSQL databases Improvement: Hibernate OGM Cloud Usable for other NoSQL databases Reference: Paolo Atzeni, Francesca Bugiotti, Luca Rossi , “SOS (Save Our Systems): A uniform programming interface for non-relational systems”, EDBT Berlin Germany, March 2012. Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad 4. ONDM: an ObjectNoSQL Datastore Mapper KTH Applied Information Security Lab Problem: Heterogeneity of NoSQL databases (Different data models and API’s) Solution: ONDM- Layered architecture , uniform programming interface and transparent access to different NoSQL stores Vulnerability: Not for Cloud, Only for Apache Cassandra, Couchebase, MongoDB, Oracle NoSQL and Redis, Implement separate connector for every data store Improvement: Implement for Cloud, Hibernate OGM , Usable for other NoSQL databases (whole category) Reference: Luca Cabibbo, “ONDM: an ObjectNoSQL Datastore Mapper”, 2013. Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad PROBLEM STATEMENT KTH Applied Information Security Lab • Businesses and other organizations are increasingly adopting DBaaS to handle large data set applications in Cloud. However, there are numerous issues causing hindrance in the successful adoption of DBaaS. These issues are not holistically surveyed and there is a requisite need to properly explore those issues. • Several NoSQL databases exist which are becoming essential in the cloud infrastructure but they are prone to a legitimate problem of heterogeneous languages and interfaces offered to developers and users for accessing database services. This unfortunately cause interoperability issues . Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad MOTIVATION Security Module Common Interface Generic Modeling Approach KTH Applied Information Security Lab Different interfaces (Data models & API’s) Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad PROPOSED SYSTEM ARCHITECTURE APPLICATION (OBJECT MODEL) Interacts with Hibernate API Query Engine Handler KTH Applied Information Security Lab Handler Handler Handler Object NoSQL Mapping NoSQL Data Store Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad CONTRIBUTION KTH Applied Information Security Lab Highlight all the potential issues and limitations and suggest latest research, developments and R&D solutions to benefit the computing community. Address interoperability issue among cross-domain Cloud DBaaS. To give proof of concept, we are aiming to provide a secure solution for transparent access of underlying heterogeneous NoSQL data stores particularly related to column oriented family. Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad TOOLS & TECHNOLOGIES KTH Applied Information Security Lab Java (Eclipse) Hibernate OGM OR Mapping NoSQL System OpenStack/Amazon Java Security Annotations Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad ROADMAP KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad REFERENCES i. ii. iii. iv. v. vi. vii. viii. KTH Applied Information Security Lab ix. x. xi. http://www.dbta.com/Articles/Editorial/Trends-and-Applications/The-Rise-of-theCloud-Database-88941.aspx http://www.itsabhik.com/cloud-database-advangates-and-disadvantages/ http://en.wikipedia.org/wiki/Cloud_database http://www.itbusinessedge.com/slideshows/show.aspx?c=96438 http://searchitchannel.techtarget.com/tip/DBaaS-pros-and-cons-for-solutionproviders http://blogs.vmware.com/vfabric/2012/08/why-dbaas-6-trends-pushing-database-asa-service.html http://dbaas.wordpress.com/2008/05/14/what-exactly-is-database-as-a-service/ http://www.infoworld.com/d/cloud-computing/data-interoperability-challenge-cloudcomputing-259 http://www-db.ics.uci.edu/pages/research/das/ http://www.disasterresource.com/index.php?option=com_content&view=article&id=335%3Afacing-thechallenge-of-data-interoperability&catid=9%3Acrisis-response&Itemid=15 Yvette E. Gelogo1andSunguk Lee, Pohang, Gyeongbuk, Korea, “Database Management System as a Cloud Service “, International Journal of Future Generation Communication and Networking Vol. 5, No. 2, June 2012 Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad REFERENCES (Cont..) http://www.disaster resource.com/index.php?option=com_content&view=article&id=335%3Afacing-thechallenge-of-data-interoperability&catid=9%3Acrisis-response&Itemid=15 xiii. http://docs.huihoo.com/oracle/enterprisemanager/12c/doc.121/e28814/cloud_db_overview.htm xiv. vFabric Team ,”Why DBaaS? 5 Trends Pushing Database as a Service”, August 24, 2012, Available: http://blogs.vmware.com/vfabric/2012/08/why-dbaas-6-trends-pushingdatabase-as-a-service.html xv. David Linthicum | InfoWorld , “Interoperable Database”, January 12, 2010, Available: http://dictionary.reference.com/browse/interoperable+database. xvi. “The data interoperability challenge for Cloud computing “, Available: http://www.infoworld.com/d/Cloud-computing/data-interoperability-challengeCloud-computing-259. xvii. Bob Buda, http://www.budaconsulting.com/blog/bid/60156/Database-SecurityIssues-in-the-Cloud-Part-1, Mon, Mar 07, 2011 xviii. “Cloud Tweaks, “A Hitchhikers Guide to the Cloud – Database Challenges To Consider”, September 13, 2012. xix. CLOUD SECURITY ALLIANCE, “Top Ten Big Data Security and Privacy Challenges”, November 2012 xii. KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad REFERENCES (Cont..) xxi. xxii. xxiii. xxiv. xxv. Zhizhong Zhang, Chuan Wu, David W.L. Cheung, “A Survey on Cloud Interoperability: Taxonomies, Standards, and Practice”, Performance Evaluation Review, Vol. 40, No. 4, March 2011, pp. 13-22 Elena Ferrari, “Database as a Service: Challenges and Solutions for Privacy and Security”, IEEE Asia-Pacific Services Computing Conference (IEEE APSCC), 2009, pp. 4651 Imal Sakhi, “Databases Security in Cloud”, 2012 Carrenza, “DATABASE AS A SERVICE”, Available: http://carrenza.com/services/usecases/database-as-a-service/ Zhizhong Zhang, Chuan Wu, David W.L. Cheung, “A Survey on Cloud Interoperability: Taxonomies, Standards, and Practice”, Performance Evaluation Review, Vol. 40, No. 4, March 2011, pp. 13-22 KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad KTH Applied Information Security Lab Department of Computing, School of Electrical Engineering and Computer Sciences, NUST Islamabad