Uploaded by danielbnip

wavefrontbyvmware-june2019-legraswindow-190602165215

Real-Time Analytics &
Metrics Monitoring
(Observability) Platform
Anil Gupta, vExpert
Wavefront Lead – APJ
@legraswindow
+65-92983137
guptaanil@vmware.com
© 2017, Wavefront. All rights reserved. Wavefront Confidential.
Modern Applications demands Modern Monitoring Platform
Rapid software
change, the rise of
containers and
microservices
Continuous delivery
and DevOps goes
mainstream
Lack of unified
visibility impacts
availability of critical
services
Confidential │ ©2019 VMware, Inc.
Why Metrics?
Large, Slower
and expensive to
store
LOGS
A Numerical Measurement
Timestamped
APM
METRICS
Arbitrary Annotations
Duplicates / Conflicts Invalid
Large, Intrusive,
Slow and Expensive
(5x – 10x)
Functional Metaphor
Small and Fast
(less expensive)
Strongly Connected
cpu.usage.system.1m 0.03 1526731054 source=appserv1.wavefront.com #legraswindow(optional field)
Confidential │ ©2019 VMware, Inc.
What is Wavefront?
A SaaS delivered, cloud native high performance
Real Time metrics monitoring and analytics platform
that:
Provides 3D visibility through metrics, histograms,
and traces
•
•
•
•
•
•
•
•
•
Full Stack Correlation - health of every cloud, every app
Identify problems quickly - 10x faster
Correlate related and unrelated data
90% Reduction in time taken to Isolate & Triage
Enterprise scale
5x the Adoption of APM
200+ OOTB integrations (ServiceNow, Openshift, etc)
18 month data retention with no averaging
No Lock-in, Open data collection and transparent consumption pricing
4
Confidential │ ©2019 VMware, Inc.
Full Stack Metrics
Business Metrics/KPIs
User Metrics
Transaction Metrics
Application Code
Distributed Frameworks (Mesos / Kubernetes)
Platform Apps MQ Proxy
Data Stores
Containers
Operating System
Virtualization / Hypervisor
Compute
Internet Access
Storage
Network
Power
Cloud First / Cloud Native / Private Cloud / Hybrid Cloud / Multi Cloud
Confidential │ ©2019 VMware, Inc.
Key Wavefront Differentiators for DevOps and Developers
Advanced Query-Driven
Analytics
Massive Scale &
Availability
Self-Service Metrics,
Customizable &
Sharable Dashboards
Intelligent Alerting for
Proactive Monitoring
CONFIDENTIAL
Confidential │ ©2019 VMware, Inc.
6
Many Easy Ways of Sending Metrics into Wavefront
vCenter
Telegraf - Collectd
Logs – TCP or
Filebeat
Wavefront Proxy
3rd Party time-series DB
Prometheus
Nagios
Zabbix
Solarwinds
Etc..
Custom / App Metrics
Public Cloud /
Cloud Services
Confidential │ ©2019 VMware, Inc.
Wavefront 200+ Integrations:
Instant Visibility Across Technology Tiers with Packaged Dashboards
Applications
DevOps Tools
Web
Alert Notification
Containers
REST API
UI
Data Stores
Messaging
OS/Hypervisor
Monitoring Tools
Cloud
Confidential │ ©2019 VMware, Inc.
Many Use cases = One Solution
Analytics lends itself to more…but let’s start from common ones
The First Pane
of Glass
Kubernetes &
Containerized
Applications
Monitoring
Microservices
Observability
CI/CD Pipeline
Monitoring
Confidential │ ©2018 VMware, Inc.
Public Cloud
Monitoring
9
Confidential │ ©2019 VMware, Inc.
Use Case 1: The First Pane of Glass - FPOG
“When there is an issue everyone is on webex sharing 15+ tools for hours”
Wavefront Becomes The Master
Aggregator of Metrics
 Aggregation of data/metrics from
multiple sources
 Simple correlation analysis to ID
events and possible causal areas
 Prove innocence, get to the last mile
in 2 minutes – hence FPOG
Cloud
Infrastructure
Aggregate!
Selected**
“Logs/
Events”
APM
Analyze!
Business KPI
Act!
Confidential │ ©2019 VMware, Inc.
10
Wavefront Cloud-Native Analytics and Monitoring Platform
Delivers 3D Observability
•
Full stack monitoring/observability. Any App.
Any Cloud.
•
Detect, trend, and alert on anomalies
•
Ingest, analyze and visualize metrics,
histograms and traces at scale.
•
Advanced query language for data
analytics
•
200+ integrations with OOTB dashboards
𝑓(x)
Applications
Any Cloud
Serverless
Microservices
Containers
Infrastructure
K8S/PKS/PCF
Confidential │ ©2019 VMware, Inc.
11
IoT
Use Case 3: Microservices Monitoring
Enhanced Application Observability
With Unified Metrics and Distributed Tracing
EASILY MONITOR
MICROSERVICES
•
Out-of-the-box visibility
• Quickly pinpoint issues
•
OpenTracing/OpenCensus
compliant
• Easily find long/errored API
•
Jaeger/Zipkin drop-in
replacement
OPTIMIZE APPLICATION
PERFORMANCE
TROUBLESHOOT FASTER
calls
•
Find performance
bottlenecks
•
Query and narrow down
traces with long spans
•
Find APIs that can be
optimized
• Troubleshoot errored API
responses
Confidential │ ©2019 VMware, Inc.
12
Use Case 4 - Public Cloud Monitoring
Turn-key visibility across AWS, Azure and Google Cloud Platform
Real-time into AWS, Azure, and
GCP helps troubleshoot cloud
applications. Correlate app and
cloud metrics.
Optimize utilization with
analytics-driven insights.
Rightsizing. Use Wavefront and
CloudHealth integration to
balance cloud costs with
performance.
Amazon EC2
Amazon ECS
Amazon
CloudWatch
Amazon EBS
Amazon SQS
AWS Elastic
Beanstalk
AWS KMS
Amazon Kinesis
Data Firehose
Amazon EFS
Amazon RDS
Amazon EMR AWS AutoScaling
Azure App
Service
Google Cloud
App Engine
Amazon EKS
Azure Redis
Cache
Google Cloud
Datastore
AWS Lambda Amazon CloudFront
Azure AD
Google Cloud
Machine Learning
Confidential │ ©2019 VMware, Inc.
Azure SQL
Database
Google Cloud
Big Query
Amazon DynamoDB
AWS Direct
Connect
Amazon
Route53
Amazon SNS
Amazon
Redshift
AWS KMS
Amazon Cache AWS OpsWorks
Amazon S3
Azure Application
Gateway
Google Cloud
Pub/Sub
13
Azure
Functions
Google Cloud
Billing
Azure Container
Instances
Google Cloud
Router
Use case: CI/CD Monitoring
Example App Dev Environment
Plan
Code
Architecture
consulting
Dev + Ops
decide on
sprint stories
App Code
Infra-as-Code
Commit
Continuous
Integration
Upload / commit
of code
automatically
triggers app +
infra + security
code review
Dev
Test
Artifact
Management
If code passes
review, deploy to
production-like
env for
automated
testing.
Continuous
Deployment
Configuration
Management
Auto deploy to
production-like
environment for
user acceptance
testing, STAGE.
Ops
Security
Operations
Feedback
Both Dev and
Ops can
monitor
Continuous
learning and
cooperation
between Dev
and Ops
Infrastructure-as-Code
DevOps
Stacks
PlanStack
CodeStack
CommitStack
CIStack
TestStack
ArtifactStack
CDStack
ConfigStack
OpsStack
FeedbackStack
Plugins
Issues
Confidential │ ©2019 VMware, Inc.
Use case: CI/CD Monitoring
Example App Dev Environment
Plan
Code
Architecture
consulting
Dev + Ops
decide on
sprint stories
App Code
Infra-as-Code
Commit
Continuous
Integration
Upload / commit
of code
automatically
triggers app +
infra + security
code review
Dev
Test
Artifact
Management
If code passes
review, deploy
to productionlike env for
automated
testing.
Continuous
Deployment
Configuration
Management
Auto deploy to
production-like
environment for
user acceptance
testing, STAGE.
Ops
Security
Operations
Feedback
Both Dev and
Ops can
monitor
Continuous
learning and
cooperation
between Dev
and Ops
Infrastructure-as-Code
Fully Instrumented pipeline. Analytics feedback loop from every delivery stage
DevOps
Stacks
PlanStack
CodeStack
CommitStack
CIStack
TestStack
ArtifactStack
CDStack
ConfigStack
OpsStack
FeedbackStack
Plugins
Issues
Confidential │ ©2019 VMware, Inc.
Wavefront Helps Reddit Modernize
Cloud-Native Monitoring
#3 most visited US website, #6 in the world
500 million monthly visitors
Challenges
Graphite could not scale, issues with maintenance
Why Wavefront?
Microservices monitoring and troubleshooting
Accelerated anomaly detection
Visibility across Kubernetes all levels
Avoiding cardinality issues with Graphite
Easy adoption across hundreds of developers
©2018 VMware, Inc.
16
Confidential │ ©2019 VMware, Inc.
Lyft Business Metrics in Wavefront
Passenger metrics
•
•
New user signups / installs / activations
Current passengers with the app open
Driver metrics
•
•
New driver applications / activations
Current drivers with the app open
Ride metrics
•
•
•
Rides requested / accepted / dropped off / canceled / lapsed
Lyft Line rides dropped off
Paid vs. Couponed rides dropped off
Marketplace metrics
•
•
•
Drivers available
Drivers en route
Driver utilization %
©2018 VMware, Inc.
17
Confidential │ ©2019 VMware, Inc.
One Billions Rides with Lyft and Wavefront
Unify Tools -> One Click Away from Resolution
Overview
Challenge
Lyft is one of the two leading US companies in the
field of Transportation as a Service currently operating
in 300+ cities. 10,000+ EC2 instances.
•
Phenomenal growth of Lyft cloud service resulted in enormous
amount of metrics
•
Graphite could not scale to meet the demand of observability
team – issues with reliability, query performance and
maintenance
1,000+
Developers
200+
18,000+
Microservices Alerts
1000+
Dashboards
Why Wavefront?
@
Cloud
Microservices
Applications
Service Mesh
Mobile Clients
•
Observability team and developers avoid multiple-system syndrome
and expensive context switching
•
Fewer tools for triage, better and faster resolution
•
Metrics and data from numerous sources available in a single click
•
Real-time visibility into the performance of Lyft services
•
Smart and highly efficient Alert Engine helps dynamically filter noise
and capture veritable anomalies
•
Powerful metrics explorer and chart view
Infrastructure
Business Metrics
18
Confidential │ ©2019 VMware, Inc.
Over 2.8 million Avengers (/Marvel) movie
fans trusted BookMyShow to book their
tickets prior to the movie’s release,
avoiding long wait times and system
delays to reserve their seat well in
advance. Thanks to BookMyShow for
visited
their trust in Wavefront as#3a most
weapon
to US website, #6 in the world
500 million
monthly visitors
find the right doors of anomalies
early,
Challenges
maximized uptime and fight
back
Grphite could not scale, issues with maintenance
trouble proactively, Avengers-style.
Why Wavefront?
Microservices monitoring and troubleshooting
Accelerated anomaly detection
Visibility across Kubernetes all levels
Avoiding cardinality issues with Graphite
Easy adoption across hundreds of developers
19
Confidential │ ©2019 VMware, Inc.
Enter Wavefront: Cloud Services Monitoring at Box Today
• Amazon EC2 insight
• Real-time cost analytics - spot & reserved
instances pricing
• OS metrics monitoring - CPU, disk,
network
• Application performance monitoring -
200s, 500s,latency
• More than 800,000 metrics/second
(peaked at 1,000,000 pps)
• 400 engineers using Wavefront
• 848 Wavefront dashboards as of Oct 2017
Confidential │ ©2019 VMware, Inc.
Space Ape Ups Time-Sensitive Cloud Applications Performance with
AWS Analytics
Wavefront SaaS analytics lets us
troubleshoot very accurately any problem
before our cloud game customers get
impacted. We can track our user
experience, service, operational and
business performance in real time.”
 Understanding real user
experience with Amazon
DynamoDB performance
metrics
 Making business &
operational decisions from
AWS lambda insight
 Improving game performance
- Louis McCormack
Lead DevOps Engineer,
Space Ape Games
with AWS load balancer
metrics
Confidential │ ©2019 VMware, Inc.
The First Pane of Glass for Large European Energy Supplier
•
Aggregate monitoring and alerting
•
Seamless integration
•
Metrics-analytics driven business
decisions
UK SRE practitioner blog:
https://sysdef.xyz/
22
Setting up a trial is easy!
1
Wavefront website: wavefront.com
2
Wavefront free-trial: wavefront.com/sign-up
3
Wavefront blog: wavefront.com/blog
Anil Gupta
Wavefront Lead - APJ
guptaanil@vmware.com
Confidential │ ©2019 VMware, Inc.
DEMO
Confidential │ ©2019 VMware, Inc.
24
Thank You
Anil Gupta
Wavefront Lead - APJ
guptaanil@vmware.com
Confidential │ ©2019 VMware, Inc.