Memecached_Webinar-20131026

advertisement
Launch with Confidence
Improving Website
Performance and Scalability
with Memcached
Presented by: Shawn Smiley [Lead Architect]
About Me
LinkedIn: www.linkedin.com/in/shawnsmiley/
Twitter: @shawnsmiley
3
•
Shawn S. Smiley
•
Lead Architect at
Achieve Internet
•
15+ years experience
in Internet software
and hosting
•
5+ years with Drupal
•
Recently married
Agenda
About Achieve

At a Glance

Key Clients

Achieve Services
Intro to Caching

How it Works?
What is Memcached?
When and Why You Should Use It?

Performance and Stats
Minimal Setup

Components of a Memcache Solution

PECL Memcache vs Memcached

Commands
Verifying and Troubleshooting

Minimal Install Verification
Distributed Configurations

When to Use It

Types

Configuration

Tuning Tips
Are There Alternatives?
Conclusion
Q&A
4
Achieve at a Glance
legal name
founded
headquarters
leadership
mission
primary
contact info
Achieve Internet, Inc.
1998
1767 Grand Avenue, Suite #2 | San Diego, CA 92109 | U.S.A.
Ron Huber, Chief Executive Officer
Achieve leverages best-in-class Open Source web application development experience, and tools to create
optimized platforms for enterprise clients.
800.618.8777 | info@achieveinternet.com
Achieve is a leading software development company that leverages Drupal to develop advanced
enterprise web and mobile solutions for an array of clients in varying verticals. Achieve has a singular
goal: To help you launch your advanced digital platforms with confidence.
Our ethos is “it’s not magic, it’s excellence,” and our all-star developers deliver on that promise
with every new project we take on by consistently exceeding client expectations.
5
Key Clients: Media & Entertainment
clients
6
Additional Key Clients
clients
7
Achieve Services
Achieve builds custom enterprise IT management systems to solve the most complex business challenges.
Application
Development
Achieve makes integration easy so you can focus on what is most important to your business – your customers.
Platform
Integration
Optimize the performance of your web site or application to increase productivity and streamline operations.
Infrastructure
Services
Achieve has vast experience in creating global web solutions that far out perform the competition.
Globalization
Services
‘Going Mobile’ is no longer an option. With Achieve you won’t have to compromise design for utility.
Mobility & End User
Services
8
Intro to Caching
9
Intro to Caching
• A cache is a system that stores the results of some
processing for repeated later use.
• Example in Drupal is the page cache.
• Utilized via the cache_set() and cache_get() Drupal API
calls.
• Caches in Drupal are Key/Value pairs.
10
Intro to Caching: How it Works?
REQUEST
CACHE_
GET()
CHECK CACHE
RETURN
CACHED
DATA
11
QUERY CACHE
Intro to Caching: How it Works?
REQUEST
CACHE_
GET()
CHECK CACHE
QUERY CACHE
GENERAT
E DATA
RETURN
CACHED
DATA
CACHE_SET()
12
SAVE CACHE
What is Memcached?
13
What is Memcached?
• “Memcached is an in-memory key-value store for small
chunks of arbitrary data (strings, objects) from results of
database calls, API calls, or page rendering.” –
memcached.org
• A system to cache data in RAM rather than in database.
• Replaces Drupal’s default database cache.
• Just one component of your high performance scalable
architecture.
14
When & Why Should You Use
It?
15
Why Use Memcached?
• Improves cache read/write performance
• Reduces the number of database queries
• Reduces database load
• Can improve page load performance*
• Allows distributing the cache load to separate servers
• Moves the threshold for performance degradation to a higher
number of concurrent users.
* By itself, Memcache.d won’t provide a significant performance improve on low traffic sites. The page load performance
boost comes mostly from offloading work from your database server so that the system can handle more requests.
16
Performance Stats
• Performance & Scalability
• Cache read performance: 4X Faster
• Cache write performance: 60X Faster
• Actual statistics* (Drupal core vs. Memcached)
• 1000 Cache write operations: 12 seconds vs. 0.2 seconds
• 1000 Cache read operations: 0.4 seconds vs. 0.1 seconds
• DB queries: 3028 vs. 13
* Cache Read/Write stats are for performing 2,000 operations (1,000 write, 1,000 read)
* Your results will vary, tests performed on a single AWS instance with default D7 install
17
Performance Stats
Caching Performance Comparison: 2,000 Operations
Execution time (seconds)
12
Core DB
Memcache
1
0.0833333
Cache Set
Cache Get
18
Performance Degradation Threshold
NOTE: This is an idealized graph of the average change in response time across
several projecs.
19
When to (or not) use Memcached
• Use Memcached when:
• Significant portion of site traffic is authenticated users or when
static page caching isn’t an option.
• Have adequate server resources to allocate to Memcached
without negatively impacting other services.
• Have enough traffic to make a difference.
• Don’t use Memcached when:
• Site is completely anonymous traffic
•
Static caching is better for this (such as Varnish or Boost)
• Low traffic sites that are not straining server resources
•
Probably won’t see any noticeable benefit and have increased the
complexity of the system.
• Severely resource constrained single server hosting
•
Can negatively impact performance/scalability in this case.
20
Setup
21
Components of Memcached Solution
• Memcached Daemon (v 1.4.x)
• PHP Extension
• PECL Memcache (> 3.x)
• PECL Memcached (> 2.x)
• TIP: Use the setting “memcache_extension” to select
which library to use if you have both installed.
• Memcache Module
• Drupal 8, Drupal 7.9+, Pressflow 6, or Drupal
6 with core patch
22
PECL Memcache vs. Memcached
PECL Memcache
PECL Memcached
Stable, not as frequently updated
Newer, more features
Easy to install
More complicated install with multiple
dependencies
Lower PHP memory footprint in some
cases.
Can be slightly faster in some cases
Slightly better for single server
Memcached daemon installations.
Slightly better for distributed or
clustered Memcached daemon
installations.
igbinary support
* Generally, we default to PECL Memcache unless a project can take
advantage of the additional features provided by PECL Memcached.
23
Minimal Install: Commands (Ubuntu)
• Enable apt multiverse repositories
• sudo apt-get install php-pear build-essential
• sudo apt-get install memcached
• sudo apt-get install php5-memcache
• drush dl memcache devel
24
Minimal Install: Daemon Configuration
/etc/memcached.conf (or /etc/default/memcached)
• Configuration file for the memcached daemon
• Important settings:
• -m 64
•
Amount of memory in MB to allocate.
• -p 11211
•
TCP/IP Port to listen for cache requests on.
• -l 127.0.0.1
•
IP Address to listen for cache requests on.
25
Determining Memcached size
• Look at the data size of all cache* tables in your DB.
• Use this handy query to get the total size:
SELECT count(*) TABLES,
sum(table_rows) ROWS,
concat(round(sum(data_length)/(1024*1024),2),'MB') DATA,
concat(round(sum(index_length)/(1024*1024),2),'MB') idx,
concat(round(sum(data_length+index_length)/(1024*1024),2),'MB') total_size
FROM information_schema.TABLES
WHERE TABLE_SCHEMA=’dbname' AND TABLE_NAME LIKE "cache%"
• Total memory allocated to Memcached should be at least
the size returned in the “DATA” column.
• I usually like to add at least 50% to this size to start with.
26
Note about Memcached security
• Basically there are no security controls in Memcached.
• Any user or application that can establish a TCP
connection to the Memcached daemon can access all
data stored within Memcached.
• Security Best Practices:
• Always set the daemon to listen on a specific IP
address/network interface.
• Use 127.0.0.1 if there is only 1 Memcached server and it is
running on the same server as the web server.
• Use iptables or other firewall to restrict access to the
Memcached TCP/IP Port(s).
27
Minimal Install: Drupal Settings.php
// Required minimum settings.
$conf['cache_backends'][] = 'sites/all/modules/contrib/memcache/memcache.inc';
$conf['cache_default_class'] = 'MemCacheDrupal';
$conf['cache_class_cache_form'] = 'DrupalDatabaseCache';
// Optional, but recommended settings.
$conf['memcache_key_prefix'] = 'ai_'; // Remember about 250 byte key size limit.
$conf['lock_inc'] = 'sites/all/modules/contrib/memcache/memcache-lock.inc';
$conf['memcache_stampede_protection'] = TRUE;
// The following are required only if Memcached is on a different server.
$conf['memcache_servers’] = array(’10.11.12.13:11211' => 'default');
$conf['memcache_bins’] = array('cache' => 'default');
// Only if both PECL Memcache and Memcached are installed.
$conf['memcache_extension'] = 'Memcache';
28
Minimal Install: Drupal Configuration
Drupal Performance Settings
29
Verifying & Troubleshooting
30
Minimal Install: Verifying
• Memcache Admin Module
• Adds configuration page at Configuration => Memcached
• Enable the option “Show Memcache statistics” to see a log of
all Memcached calls on each page.
• Devel Module
• Enable the Query Logging option.
• Memcache Admin UIs:
• http://livebookmark.net/journal/2008/05/21/memcachephpstats-like-apcphp/
• https://code.google.com/p/phpmemcacheadmin/
31
Minimal Install: Verifying - Memcache
• Enable the module “Memcache Admin”
• Config page at Configuration => Memcache
• Enable the option “Show Memcache statistics” to see a log of
all Memcached calls on each page.
32
Minimal Install: Verifying - Devel
• In Devel Settings, enable “Display query log”
• Reload pages, you’ll see a list of all queries executed
• Ensure there are no calls to any “cache*” tables (other than
cache_form)
• Ensure there are no calls to the “semaphore” table (lock.inc)
33
Minimal Install: Verifying – Admin UIs
Demo
• Drupal Memcache Statistics
• memcache.php
• Memcache Admin
34
Distributed Configurations
35
Distributed Install: Overview
• When to use
• Types of distributed installs
• Configuration Steps
36
Distributed Install: When To Use It
• High traffic sites making more cache requests than a
single instance can efficiently handle.
• The default config of a Memcached daemon will support up to
1,024 simultaneous connection requests.
• Large caches where it makes sense to separate out cache
data by type or update frequency (e.g. cache_menu is
usually pretty static whereas the default cache could be
constantly changing)
• I start looking at this as an option when the memory allocation
for Memcached starts getting over 128MB.
37
Distributed Install: Types
• Multi-server/Clustered
• Distributes cache load across multiple Memcached servers.
• Allows for handling larger numbers of cache operations.
Useful for high traffic sites.
• NOTE: Caches are not replicated between multiple servers by
default.
• Multi-bin
• Segments different types of caches to different servers.
• Useful for very large caches with varying characteristics (e.g.
segmenting mostly read-only caches from caches that are
frequently updated)
• Hybrid
• You can combine both of the above approaches (e.g. have
multiple bins with each bin spread across multiple servers).
38
Distributed Install: Configuration
• Server configuration is same as the basic install, just on
multiple servers.
• Make sure the Memcached daemon is configured to listen on
each servers external IP address (not 127.0.0.1).
• All daemons within a cluster should be configured with the
same memory allocation.
• You can have multiple Memcached daemons running on a
single server. Just use different ports.
39
Distributed Install: Configuration
Additional settings needed in the site settings.php file
// Must be exactly the same on all web servers.
$conf['memcache_servers' = array(
'10.11.12.13:11211' => 'default',
'10.11.12.14:11211' => 'default',
'10.11.12.15:11211' => 'my_readonly_bin',
);
$conf['memcache_bins’] = array(
'cache' => 'default', // must specify if setting this variable.
'cache_bootstrap' => 'my_readonly_bin',
'cache_menu' => 'my_readonly_bin',
'cache_path' => 'my_readonly_bin',
);
// If using PECL Memcached.
$conf['memcache_options'] = array(
Memcached::OPT_DISTRIBUTION => Memcached::DISTRIBUTION_CONSISTENT,
);
40
Tuning Tips
• Watch the “Available Memory”, “Evictions” (==0), and
“reclaimed” (==0) stats to determine if memory allocation
needs to be adjusted.
• Watch “Max Connection Errors” (==0), “Accepting
Connections” (==1), “Connection Yields” (==0), and
“Listen Disabled Number” (==0) to determine when you
need to move from a single memcached server to
multiple servers.
Tip: See this page for a description of various stat counters:
http://www.pal-blog.de/entwicklung/perl/memcached-statistics-statscommand.html
41
Alternatives
42
Are There Alternatives?
Redis (https://drupal.org/project/redis)
• Great for persistent or replicated caches.
• Supports additional caching options such as structured data
caching.
• More efficient support for selective cache delete/expiration.
• Nice comparison to Memcache:
http://noblogdeffound.blogspot.com/2013/04/memcached-vsredis-comparison-name.html
MongoDB Cache (https://drupal.org/project/mongodb)
• Great for extremely large caches or when a large amount of
data must be stored in a single cache key.
43
Conclusion
By adding Memcached to our hosting solution we:
• Reduced the number of database queries.
• Reduced the load on our database servers.
• Reduced the amount of time the site spends accessing
cached data.
• Reduced the rendering time for page requests.
• Improved the overall scalability of the site.
44
Thank You
Shawn Smiley
 Lead Engineer
 shawn.smiley@achieveinternet.com
 Drupal: shawn_smiley
 800.618.8777
Connect with Achieve
If you have any further questions or would like to get in touch with
Achieve you can do so on these various channels:
info@achieveinternet.com
linkedin.com/company/Achieve-Internet
plus.google.com/101805819007866218919
@AchieveInternet
facebook.com/pages/Achieve-Internet
46
Q&A
47
Download