FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media) THE WORLD OF DATA IS CHANGING Cloud CREATING NEW BUSINESS OPPORTUNITIES 1. Increases ad revenue by processing 3.5 billion events per day Measures and ranks online user influence by processing 3 billion signals per day Uses sentiment analysis and web analytics for its internal cloud Massive Volumes Cloud Connectivity Real-Time Insight Processes 464 billion rows per quarter, with average query time under 10 secs. Connects across 13 social networks via the cloud for data and API access Improves operational decision making for IT managers and users Klout Case Study: http://www.microsoft.com/casestudies/Microsoft-SQL-Server-2012-Enterprise/Klout/Data-Services-Firm-Uses-Microsoft-BI-and-Hadoop-toBoost-Insight-into-Big-Data/710000000129 BIG DATA REQUIRES A HOLISTIC APPROACH THAT LEVERAGES TRADITIONAL AND NEW CAPABILITIES TRADITIONAL Relational Database Management System NEW Petabyte-Scale Services MICROSOFT’S APPROACH TO BIG DATA Immersive Insight, Wherever you are Connecting with the World’s Data Any Data, Any Size Anywhere Analyze Big Data with familiar tools Share your data with the world via Azure Marketplace Simplicity and manageability of Windows to Hadoop Enrich with social media data via Social Analytics Extended data warehousing with Hadoop Advanced analytics with Hadoop Scale & elasticity of cloud Immersive insights from any data JavaScript based simple programming Benefits Interaction and analysis of unstructured data in Hadoop from Microsoft Excel Key Features WE DELIVER INSIGHTS TO EVERYONE BY ENABLING BIG DATA ANALYSIS WITH FAMILIAR END USER TOOLS Hive add-in for Excel Benefits UNLOCKING NEW INSIGHTS FROM ALL DATA WITH MICROSOFT BI TOOLS Key Features Familiar BI tools with structured and unstructured data Hive ODBC Driver integrates Hadoop to SQL Server Analysis Services, PowerPivot, and Power View Benefits WHILE DRAMATICALLY SIMPLIFYING PROGRAMMING ON HADOOP WITH JAVASCRIPT MapReduce programs in JavaScript Key Features Simplified programming Simplified deployment of MapReduce jobs JS New JavaScript libraries for Hadoop Deploy JavaScript Hadoop jobs from a simple web browser Key Features Benefits MICROSOFT UNIQUELY CONNECTS HADOOP TO THE WORLD VIA WINDOWS AZURE MARKETPLACE Sharing of data and insights through Windows Azure Marketplace Mashing up of internal and public data sets via Data Explorer Integration with Windows Azure Marketplace Integration with third-party data and services Key Features Benefits ENRICHES ANALYSIS WITH SOCIAL MEDIA DATA VIA SOCIAL ANALYTICS Stronger customer relationships Models augmented with publicly available data from social media sites Integration of social information with business applications Microsoft Codename "Social Analytics" Integration with social media sites Key Features Benefits AND ENHANCES YOUR DATA THROUGH PREDICTIVE ANALYSIS ON HADOOP New business insights with predictive analytics from Microsoft Unlock rare patterns from bespoke data mining models Hive ODBC Driver connects Hadoop to SQL Server Data Mining tools Support for open source predictive analytics tools such as R and Mahout Key Features Benefits MICROSOFT BRINGS THE SIMPLICITY AND MANAGEABILITY OF WINDOWS TO HADOOP Simplified management of Hadoop on Windows Enterprise-class security Easy setup on-premises and in the cloud Smart packaging of Hadoop on premises Integration with Microsoft System Center Integration with Windows Server® Active Directory Fast deployment of Hadoop on Azure WHILE EXTENDING YOUR ENTERPRISE DATA WAREHOUSE WITH HADOOP Benefits Integration with Microsoft Enterprise Data Warehouses Key Features Integration with enterprise BI solutions Microsoft SQL Server connector for Apache Hadoop with SQOOP (SQL to Hadoop) Deeper insights from structured and unstructured data SQL Server Parallel Data Warehouse connector for Apache Hadoop with SQOOP Key Features Benefits AND PROVIDING CHOICE OF DEPLOYMENT OPTIONS Elastic peta-scale analytics on Microsoft’s cloud platform Enterprise-class Big Data platform on-premises Hadoop-based Service on Windows Azure platform Hadoop-based distribution on Windows Server Benefits DELIVERED THROUGH OPEN PLATFORM WITH A RICH PARTNER ECOSYSTEM Key Features 100-percent compatibility with Apache Hadoop Choice of tools from rich ecosystem of Hadoop partners Giving back to Hadoop • Accelerating the delivery of Hadoop for Microsoft • • Hadoop Service for Windows JavaScript libraries Hive ODBC Drivers A HOLISTIC BIG DATA SOLUTION FROM MICROSOFT SPANNING RELATIONAL AND NON-RELATIONAL WORLDS SELF-SERVICE INSIGHTS MOBILE PREDICTIVE COLLABORATIVE DATA ENRICHMENT DISCOVER AND RECOMMEND TRANSFORM AND CLEAN SHARE AND GOVERN DATA MANAGEMENT 1 011 01 RELATIONAL NON-RELATIONAL MULTIDIMENSIONAL STREAMING MARKETPLACE REAL-TIME External Data and Services OPERATIONAL HADOOP ON WINDOWS & AZURE: ROADMAP Excel Integration Preview 2 INSIGHTS DATA ENRICHMENT • Hive Add-in for Excel • PowerPivot Add-in for Excel • Power View for SharePoint Hadoop Connectors Azure Data Market Hive ODBC Driver Preview 2 Azure Labs • Data Explorer • Social Analytics • Data Hub (Private Data Market) Hadoop on Azure GA • Portal Integration & Billing • Azure SDK integration DATA MANAGEMENT Hadoop on Azure Private CTP Hadoop on Server Private TAP • Hadoop Core & Common • JavaScript Framework CY 17 H2 2011 Hadoop on Azure Preview 2 • More capacity • Disaster Recovery for HDFS • Support for Mahout Hadoop on Server GA • JavaScript, PIG, Hive, Hbase • Active Directory Integration • Systems Center Integration 2012 ADDITIONAL RESOURCES LEARN MORE • Microsoft Big Data Solution: www.microsoft.com/bigdata • Windows Azure: www.windowsazure.com/en-us/home/scenarios/big-data TRY NOW • Preview of the Hadoop-based service for Windows Azure: https://www.hadooponazure.com APPENDIX