Real-Time Analytics & Metrics Monitoring (Observability) Platform Anil Gupta, vExpert Wavefront Lead – APJ @legraswindow +65-92983137 guptaanil@vmware.com © 2017, Wavefront. All rights reserved. Wavefront Confidential. Modern Applications demands Modern Monitoring Platform Rapid software change, the rise of containers and microservices Continuous delivery and DevOps goes mainstream Lack of unified visibility impacts availability of critical services Confidential │ ©2019 VMware, Inc. Why Metrics? Large, Slower and expensive to store LOGS A Numerical Measurement Timestamped APM METRICS Arbitrary Annotations Duplicates / Conflicts Invalid Large, Intrusive, Slow and Expensive (5x – 10x) Functional Metaphor Small and Fast (less expensive) Strongly Connected cpu.usage.system.1m 0.03 1526731054 source=appserv1.wavefront.com #legraswindow(optional field) Confidential │ ©2019 VMware, Inc. What is Wavefront? A SaaS delivered, cloud native high performance Real Time metrics monitoring and analytics platform that: Provides 3D visibility through metrics, histograms, and traces • • • • • • • • • Full Stack Correlation - health of every cloud, every app Identify problems quickly - 10x faster Correlate related and unrelated data 90% Reduction in time taken to Isolate & Triage Enterprise scale 5x the Adoption of APM 200+ OOTB integrations (ServiceNow, Openshift, etc) 18 month data retention with no averaging No Lock-in, Open data collection and transparent consumption pricing 4 Confidential │ ©2019 VMware, Inc. Full Stack Metrics Business Metrics/KPIs User Metrics Transaction Metrics Application Code Distributed Frameworks (Mesos / Kubernetes) Platform Apps MQ Proxy Data Stores Containers Operating System Virtualization / Hypervisor Compute Internet Access Storage Network Power Cloud First / Cloud Native / Private Cloud / Hybrid Cloud / Multi Cloud Confidential │ ©2019 VMware, Inc. Key Wavefront Differentiators for DevOps and Developers Advanced Query-Driven Analytics Massive Scale & Availability Self-Service Metrics, Customizable & Sharable Dashboards Intelligent Alerting for Proactive Monitoring CONFIDENTIAL Confidential │ ©2019 VMware, Inc. 6 Many Easy Ways of Sending Metrics into Wavefront vCenter Telegraf - Collectd Logs – TCP or Filebeat Wavefront Proxy 3rd Party time-series DB Prometheus Nagios Zabbix Solarwinds Etc.. Custom / App Metrics Public Cloud / Cloud Services Confidential │ ©2019 VMware, Inc. Wavefront 200+ Integrations: Instant Visibility Across Technology Tiers with Packaged Dashboards Applications DevOps Tools Web Alert Notification Containers REST API UI Data Stores Messaging OS/Hypervisor Monitoring Tools Cloud Confidential │ ©2019 VMware, Inc. Many Use cases = One Solution Analytics lends itself to more…but let’s start from common ones The First Pane of Glass Kubernetes & Containerized Applications Monitoring Microservices Observability CI/CD Pipeline Monitoring Confidential │ ©2018 VMware, Inc. Public Cloud Monitoring 9 Confidential │ ©2019 VMware, Inc. Use Case 1: The First Pane of Glass - FPOG “When there is an issue everyone is on webex sharing 15+ tools for hours” Wavefront Becomes The Master Aggregator of Metrics Aggregation of data/metrics from multiple sources Simple correlation analysis to ID events and possible causal areas Prove innocence, get to the last mile in 2 minutes – hence FPOG Cloud Infrastructure Aggregate! Selected** “Logs/ Events” APM Analyze! Business KPI Act! Confidential │ ©2019 VMware, Inc. 10 Wavefront Cloud-Native Analytics and Monitoring Platform Delivers 3D Observability • Full stack monitoring/observability. Any App. Any Cloud. • Detect, trend, and alert on anomalies • Ingest, analyze and visualize metrics, histograms and traces at scale. • Advanced query language for data analytics • 200+ integrations with OOTB dashboards 𝑓(x) Applications Any Cloud Serverless Microservices Containers Infrastructure K8S/PKS/PCF Confidential │ ©2019 VMware, Inc. 11 IoT Use Case 3: Microservices Monitoring Enhanced Application Observability With Unified Metrics and Distributed Tracing EASILY MONITOR MICROSERVICES • Out-of-the-box visibility • Quickly pinpoint issues • OpenTracing/OpenCensus compliant • Easily find long/errored API • Jaeger/Zipkin drop-in replacement OPTIMIZE APPLICATION PERFORMANCE TROUBLESHOOT FASTER calls • Find performance bottlenecks • Query and narrow down traces with long spans • Find APIs that can be optimized • Troubleshoot errored API responses Confidential │ ©2019 VMware, Inc. 12 Use Case 4 - Public Cloud Monitoring Turn-key visibility across AWS, Azure and Google Cloud Platform Real-time into AWS, Azure, and GCP helps troubleshoot cloud applications. Correlate app and cloud metrics. Optimize utilization with analytics-driven insights. Rightsizing. Use Wavefront and CloudHealth integration to balance cloud costs with performance. Amazon EC2 Amazon ECS Amazon CloudWatch Amazon EBS Amazon SQS AWS Elastic Beanstalk AWS KMS Amazon Kinesis Data Firehose Amazon EFS Amazon RDS Amazon EMR AWS AutoScaling Azure App Service Google Cloud App Engine Amazon EKS Azure Redis Cache Google Cloud Datastore AWS Lambda Amazon CloudFront Azure AD Google Cloud Machine Learning Confidential │ ©2019 VMware, Inc. Azure SQL Database Google Cloud Big Query Amazon DynamoDB AWS Direct Connect Amazon Route53 Amazon SNS Amazon Redshift AWS KMS Amazon Cache AWS OpsWorks Amazon S3 Azure Application Gateway Google Cloud Pub/Sub 13 Azure Functions Google Cloud Billing Azure Container Instances Google Cloud Router Use case: CI/CD Monitoring Example App Dev Environment Plan Code Architecture consulting Dev + Ops decide on sprint stories App Code Infra-as-Code Commit Continuous Integration Upload / commit of code automatically triggers app + infra + security code review Dev Test Artifact Management If code passes review, deploy to production-like env for automated testing. Continuous Deployment Configuration Management Auto deploy to production-like environment for user acceptance testing, STAGE. Ops Security Operations Feedback Both Dev and Ops can monitor Continuous learning and cooperation between Dev and Ops Infrastructure-as-Code DevOps Stacks PlanStack CodeStack CommitStack CIStack TestStack ArtifactStack CDStack ConfigStack OpsStack FeedbackStack Plugins Issues Confidential │ ©2019 VMware, Inc. Use case: CI/CD Monitoring Example App Dev Environment Plan Code Architecture consulting Dev + Ops decide on sprint stories App Code Infra-as-Code Commit Continuous Integration Upload / commit of code automatically triggers app + infra + security code review Dev Test Artifact Management If code passes review, deploy to productionlike env for automated testing. Continuous Deployment Configuration Management Auto deploy to production-like environment for user acceptance testing, STAGE. Ops Security Operations Feedback Both Dev and Ops can monitor Continuous learning and cooperation between Dev and Ops Infrastructure-as-Code Fully Instrumented pipeline. Analytics feedback loop from every delivery stage DevOps Stacks PlanStack CodeStack CommitStack CIStack TestStack ArtifactStack CDStack ConfigStack OpsStack FeedbackStack Plugins Issues Confidential │ ©2019 VMware, Inc. Wavefront Helps Reddit Modernize Cloud-Native Monitoring #3 most visited US website, #6 in the world 500 million monthly visitors Challenges Graphite could not scale, issues with maintenance Why Wavefront? Microservices monitoring and troubleshooting Accelerated anomaly detection Visibility across Kubernetes all levels Avoiding cardinality issues with Graphite Easy adoption across hundreds of developers ©2018 VMware, Inc. 16 Confidential │ ©2019 VMware, Inc. Lyft Business Metrics in Wavefront Passenger metrics • • New user signups / installs / activations Current passengers with the app open Driver metrics • • New driver applications / activations Current drivers with the app open Ride metrics • • • Rides requested / accepted / dropped off / canceled / lapsed Lyft Line rides dropped off Paid vs. Couponed rides dropped off Marketplace metrics • • • Drivers available Drivers en route Driver utilization % ©2018 VMware, Inc. 17 Confidential │ ©2019 VMware, Inc. One Billions Rides with Lyft and Wavefront Unify Tools -> One Click Away from Resolution Overview Challenge Lyft is one of the two leading US companies in the field of Transportation as a Service currently operating in 300+ cities. 10,000+ EC2 instances. • Phenomenal growth of Lyft cloud service resulted in enormous amount of metrics • Graphite could not scale to meet the demand of observability team – issues with reliability, query performance and maintenance 1,000+ Developers 200+ 18,000+ Microservices Alerts 1000+ Dashboards Why Wavefront? @ Cloud Microservices Applications Service Mesh Mobile Clients • Observability team and developers avoid multiple-system syndrome and expensive context switching • Fewer tools for triage, better and faster resolution • Metrics and data from numerous sources available in a single click • Real-time visibility into the performance of Lyft services • Smart and highly efficient Alert Engine helps dynamically filter noise and capture veritable anomalies • Powerful metrics explorer and chart view Infrastructure Business Metrics 18 Confidential │ ©2019 VMware, Inc. Over 2.8 million Avengers (/Marvel) movie fans trusted BookMyShow to book their tickets prior to the movie’s release, avoiding long wait times and system delays to reserve their seat well in advance. Thanks to BookMyShow for visited their trust in Wavefront as#3a most weapon to US website, #6 in the world 500 million monthly visitors find the right doors of anomalies early, Challenges maximized uptime and fight back Grphite could not scale, issues with maintenance trouble proactively, Avengers-style. Why Wavefront? Microservices monitoring and troubleshooting Accelerated anomaly detection Visibility across Kubernetes all levels Avoiding cardinality issues with Graphite Easy adoption across hundreds of developers 19 Confidential │ ©2019 VMware, Inc. Enter Wavefront: Cloud Services Monitoring at Box Today • Amazon EC2 insight • Real-time cost analytics - spot & reserved instances pricing • OS metrics monitoring - CPU, disk, network • Application performance monitoring - 200s, 500s,latency • More than 800,000 metrics/second (peaked at 1,000,000 pps) • 400 engineers using Wavefront • 848 Wavefront dashboards as of Oct 2017 Confidential │ ©2019 VMware, Inc. Space Ape Ups Time-Sensitive Cloud Applications Performance with AWS Analytics Wavefront SaaS analytics lets us troubleshoot very accurately any problem before our cloud game customers get impacted. We can track our user experience, service, operational and business performance in real time.” Understanding real user experience with Amazon DynamoDB performance metrics Making business & operational decisions from AWS lambda insight Improving game performance - Louis McCormack Lead DevOps Engineer, Space Ape Games with AWS load balancer metrics Confidential │ ©2019 VMware, Inc. The First Pane of Glass for Large European Energy Supplier • Aggregate monitoring and alerting • Seamless integration • Metrics-analytics driven business decisions UK SRE practitioner blog: https://sysdef.xyz/ 22 Setting up a trial is easy! 1 Wavefront website: wavefront.com 2 Wavefront free-trial: wavefront.com/sign-up 3 Wavefront blog: wavefront.com/blog Anil Gupta Wavefront Lead - APJ guptaanil@vmware.com Confidential │ ©2019 VMware, Inc. DEMO Confidential │ ©2019 VMware, Inc. 24 Thank You Anil Gupta Wavefront Lead - APJ guptaanil@vmware.com Confidential │ ©2019 VMware, Inc.