Microsoft® Lync™ Server 2010 High Availability and Resiliency Module 12 Microsoft Corporation Session Objectives and Takeaways Session Objectives: • Resiliency Architecture • Branch Office Resiliency • Data Center Resiliency Takeaways: • Microsoft Lync Server 2010 Architecture provides High Availability for Voice services • Survivable Branch Appliance built by UC partners delivers Voice High Availability for branch users • Metropolitan Data Center Resiliency delivers High Availability for all UC modalities across datacenters separated by high bandwidth & low latency • Voice Data Center Resiliency delivers Voice High Availability across geo-dispersed datacenters 2 Agenda • • • • • Main Concepts High Availability Branch Office Resiliency Data Center Voice Resiliency Metropolitan Data Center Resiliency 3 Main Concepts 4 Survivable Branch Appliance (SBA) A purpose-built appliance optimized to provide resilient multi-modal communication for maximizing branch office user productivity Data Center Edge Server Branch Office Lync Pool SBA WAN PSTN Components Windows Server® 2008 R2 Mediation Server Registrar PSTN Gateway Functionality Normal/Failover mode SIP Registrar SIP Proxy and Routing engine PSTN connectivity Voicemail routing PSTN re-routing Centrally provisioned Up to 1000 user support Go-To Market OEM (Embedded channel) 5 partners • Audiocodes • Dialogic • Ferrari • HP • NET 5 Terminology • DNS Load Balancing (DNS LB) is NOT the same as DNS Round Robin (DNS RR) • DNS RR: DNS Server is configured for Round Robin and returns a list of IP addresses for a Pool FQDN in different order • • • Pool.contoso.com resolves to IP1, IP2, IP3 • DNS Server replies to 1st Query with (IP1, IP2, IP3) • DNS Server replies to 2nd Query with (IP2, IP1, IP3) • DNS Server replies to 3rd Query with (IP3, IP2, IP1) DNS Load balancing • Lync Clients and Servers are able to understand multiple IP addresses (Ex: IP1, IP2, IP3) • Clients failover if connectivity to one IP fails • Servers failover as well as load balance traffic across these IPs DNS RR complements DNS Load Balancing 6 DNS Load Balancing Goals DNS Load Balancing Goals • Simplify Hardware Load Balancer (HLB) Configuration • Reduce dependence on HLB • DNS LB supported for Internal Pool, Director Pool • All Server-Server and Client-Server SIP traffic • All Server-Server HTTP traffic • Media traffic • Support Draining of Applications • Improve Load balancing of server-server traffic (Ex: Access Edge – Director) Eliminating HLB is not a goal • HLB still be required • Client – Server HTTP & HTTP(s) traffic (ABS, DLX, LIS, etc.) • Edge – High availability for PIC and legacy clients 7 Machine DNS A Entries Pool DNS A Entries Domain Name System Load Balancing Sample Configuration For a Lync Server Pool ocspool1.contoso.com with 3 FEs: FE1, FE2, FE3 Pool VIP DNS FQDN DNS A Record IP ocspool1.contoso.com 172.24.32.150 FE FE1.contoso.com 172.24.32.151 FE FE2.contoso.com 172.24.32.152 FE FE3.contoso.com 172.24.32.153 OCS 2007 R2 HLB - DNS Configuration DNS FQDN DNS A Record IP Web VIP lyncweb1.contoso.com 172.24.32.150 Pool lyncpool1.contoso.com 172.24.32.151 Pool lyncpool1.contoso.com 172.24.32.152 Pool lyncpool1.contoso.com 172.24.32.153 FE FE1.contoso.com 172.24.32.151 FE FE2.contoso.com 172.24.32.152 FE FE3.contoso.com 172.24.32.153 Lync Server 2010 DNS LB - DNS Configuration 8 Other Resiliency Enhancements • DNS Based Load Balancing for Internal Pools • • • • Draining: Ability to “drain a server” before taking the server down Session Dialog Resiliency for Conferencing • • Only Client – Server HTTP traffic can be DNS Load Balanced All other traffic CANNOT be DNS Load Balanced • Still require HLB for this traffic (NLB not supported) User can still participate in a conference even if the FE the user is connected to goes down Client caches successful connections to Lync Server 2010 • SIP Registrar, Media Relay Authentication Server (MRAS)/Media Relay (MR) FQDN and IP 9 High Availability 10 High Availability in OCS 2007 / 2007 R2 Office Communications Server (OCS) 2007 and R2 Registration Routing Presence Conferencing HLB required for all traffic Bob’s Lync Bob’s Phone 2010 Architecture: • One monolithic Front End Service • Dependency on single shared backend database (Registration, Routing, Presence, Conferencing) 11 High Availability – Lync Server 2010 Microsoft Lync Server 2010 User Services Database (Presence and Conferencing) Registrar Database (Registration and Routing) Bob’s Lync Bob’s Phone 2010 HLB is optional for SIP traffic (DNS LB is recommended) HLB still required for client-server HTTP Traffic Architecture: • Registrar Role (Registration and Routing). Each registrar has its own SQL Express database • User Services Role (Presence and Conferencing) • Registrar and User Services are collocated in the datacenter (but on different servers) • All user end points register with same Front End • Users are load balanced by Registrars using a Distributed Hash Algorithm • Registrar can be installed in remote locations 12 Resiliency Architecture Branch Office Registrar Data Center - EE Pool 1 Presence Conferencing Active Directory AD & DNS Registrar (Registration & Routing) Bob’s Primary Registrar & User Services = EE Pool 1 Backup Registrar Pool Backup Registrar Pool Alice’s Primary Registrar & User Services = EE Pool 2 SBA Joe’s Primary Registrar = SBA., User Services = EE Pool1 Data Center - EE Pool 2 Active Directory Presence Conferencing AD & DNS Registrar (Registration & Routing) Architecture: 1. Each user has a “Primary Registrar Pool”. Each Registrar Pool can have a “Backup Registrar Pool” 2. User’s client discovers a Registrar Pool through DNS SRV. Directed to “Primary & Backup Registrar Pool” 3. Backup Registrar heart-beats Primary Registrar. If heart-beat not received within Configurable Failover Interval (default = 120 sec for branch offices), Backup starts accepting client registrations 13 Branch Resiliency 14 Branch Resiliency Options Large Branch (>1000 users) Survivable Branch Server (or SE Server) AND Separate Media Gateway 1000 SBA Medium Branch (25-1000 Lync PSTN / WAN users) SBA 500 25 Small Branch (<25 users) No Local Infrastructure Or gateway (GW) Only 15 Users Branch Client Registration Scenarios Lync Server 2010 Pool Data Center Edge Server WAN Data Center Active Directory Lync Server 2010 Pool Edge Server Active Directory Lync Server 2010 Pool PSTN SBA Normal Mode PSTN WAN PSTN WAN Users Register with SBA SBA SBA SBA Signaling WAN Down No change SBA Down Users register with Backup Registrar Pool in the datacenter 16 Client Media and Signaling Paths WAN available Data Center Edge Server WAN Data Center Active Directory Lync Server 2010 Pool Edge Server Active Directory Lync Server 2010 Pool PSTN WAN PSTN WAN PSTN Intra-Branch calls Signaling/media don’t traverse wide area network (WAN) SBA SBA SBA routes the call between the users Media Signaling Inter-Branch/ HQ calls Signaling/media traverse WAN SBA routes the call to the other cluster Branch PSTN calls SBA sends the call to Public Switched Telephone Network (PSTN) No WAN involved 17 Client Voice Media and Signaling Paths Key failure scenarios Data Center Edge Server Data Center Active Directory Lync Server 2010 Pool Gateway Gateway Lync Server 2010 Pool WAN Edge Server Active Directory Lync Server 2010 Pool PSTN WAN PSTN PSTN WAN Intra-Branch calls No impact on two party voice/video /IM/App sharing SBA SBA SBA Audio conferencing possible (through PSTN) No Presence or Video conferencing Media Signaling SBA Down Inter-Branch/ HQ calls SBA initiates PSTN re-routing Clients register with Backup Registrar Pool in datacenter All functionality driven by that pool over the WAN Media is over PSTN (only voice) No loss in functionality 18 User Interface in Normal and Survivability Mode WAN Up All Features Available WAN Down Basic Voice Features Available Features Available when WAN Down PSTN Inbound and Outbound calls Intra-Site calls Inter site calls (PSTN Rerouting) Hold, Retrieve, Transfer Authentication, Authorization Voicemail Deposit (Redirect to Exchange UM in Data Center) Voicemail Retrieve (through PSTN) Call Forwarding, SimulRing Boss-Admin, Team-call Call Detail Records (CDR) All 2 Party Intra Site communications Audio Conferencing through PSTN Contact Search Features Unavailable Inter-site Data (IM, App Sharing, etc.) Conferencing (IM, Video and Web) Presence & DND based routing Modify Presence or Change Call Forwarding Settings Contact List Resilient Lync Server 2010 clients: Lync 2010, Attendant Console, Lync Phone Edition Response Group and Call Park 19 Branch Office: Lync Server 2010 Discovery WAN Branch Office Data Center 1. DNS SRV Query Active Directory Response: (Multiple) Director Pool FQDN AD DS & DNS 2. TLS to Director Pool. SIP Register Response: 401 Redirect to Cert Provisioning 5. SIP Register Bob Response: 301 Redirect Primary: SBA FQDN, Backup: EE Pool FQDN Lync Server 2010 Director 3. https;//PoolFQDN/CertSvc 6. SIP-TLS Register 200 OK (Cert Auth) Client caches SBA FQDN And IP Address SBA IIS - IWA Auth Get Certificate 4. Certificate Replication Lync Server 2010 Enterprise Edition Pool 20 Branch Office – Server Connectivity when WAN Is Down Data Center Branch Office TLS to cached FQDN and IP SIP Register (Cert Auth) SBA Active Directory AD DS & DNS 200 OK Bob WAN Branch Office DHCP Options 120 Query SBA FQDN Alice (new client) TLS to SBA FQDN SIP Register (Cert Auth) Lync Server 2010 Director DHCP Server or Lync DHCP on SBA SBA Lync Server 2010 EE Pool 200 OK 21 Data Center Voice Resiliency 22 Data Center Voice Resiliency (EE) Failover to backup data center North America Data Center Backup Registrar Features Available to Users In Case of Data Center Disaster PSTN Inbound calls (if Carrier offers this resiliency) Europe Data Center PSTN Outbound calls Intra-Site calls and Inter site calls Hold, Retrieve, Transfer Authentication, Authorization Lync Server 2010 Edge1 Lync Server 2010 Pool 1 WAN Lync Server 2010 Pool 2 Lync Server 2010 Edge2 2 Party Intra Site Instant Messaging (IM) and Audio/Video (A/V) Call Detail Records (CDR) Call Forwarding, SimulRing Boss-Admin, Team-call Failover Join conferences scheduled by users homed on other pool Features Unavailable • • • • • • Lync Server 2010 Pool. That Lync Server 2010 Pool directs client to primary and backup SIP registrar Client attempts connect to Primary Registrar Pool, if fails, connects to Backup Limited feature set available on failover Enable/Disable Automatic failover, Configurable Failover interval Automatic Failback, Configurable Failback interval (No manual failback. Workaround: Stop Front End Services on Primary Registrar pool servers) What happens if Primary Data Center cannot be restored? Conferencing Auto Attendant (AA) (through PSTN) Schedule IM, A/V & Web Conferences Presence and Do Not Disturb (DND) based routing Updating Call Forwarding settings Response Group Service & Call Park Voicemail Deposit (Redirect to Exchange UM in the datacenter) Voicemail Retrieve (through PSTN) 23 Data Center Voice Resiliency (SE) Failover to backup data center North America Data Center Backup Registrar Features Available to Users In Case of Data Center Disaster PSTN Inbound calls (if Carrier offers this resiliency) Europe Data Center PSTN Outbound calls Intra-Site calls and Inter site calls Hold, Retrieve, Transfer Authentication, Authorization Lync Server 2010 Edge1 Lync Server 2010 Pool 1 WAN Lync Server 2010 Pool 2 Lync Server 2010 Edge2 2 Party Intra Site Instant Messaging (IM) and Audio/Video (A/V) Call Detail Records (CDR) Call Forwarding, SimulRing Boss-Admin, Team-call Failover Join conferences scheduled by users homed on other pool Features Unavailable • • • • • • • SE Servers operate as separate systems Client DNS SRV request discovers (one or multiple) Lync Server 2010 SE. That Lync Server 2010 SE sever directs client to primary and backup SIP registrar Client attempts connect to Primary Registrar, if fails, connects to Backup Limited feature set available on failover Enable/Disable Automatic failover, Configurable Failover interval Automatic Failback, Configurable Failback interval (No manual failback. Workaround: Stop Front End Services on Primary Registrar servers) If Primary Data Center cannot be restored: • Restore Central management Server in backup datacenter • Restore other services including Presence, Conferencing by “moving” users to other Pool Conferencing Auto Attendant (AA) (through PSTN) Schedule IM, A/V & Web Conferences Presence and Do Not Disturb (DND) based routing Updating Call Forwarding settings Response Group Service & Call Park Voicemail Deposit (Redirect to Exchange UM in the datacenter) 24 Voicemail Retrieve (through PSTN) 24 Data Center Voice Resiliency Failover to backup data center (discovery) North America Data Center Europe Data Center Backup Registrar Lync Server 2010 Edge1 Lync Server 2010 Director Lync Server 2010 Pool Pool 1 WAN Lync Server 2010 Pool 2 Lync Server 2010 Edge2 Active Directory AD DS & DNS (1) 1. 2. 3. 4. 5. 6. (2) (3) (4) (5) (6) Client DNS SRV request. Example: DNS SRV for _sipinternaltls._tcp.contoso.com DNS SRV Response includes • Lync Server 2010 Director Pool.contoso.com:5061 Priority=0, Weight=10 • Pool2.contoso.com:5061 Priority=1 , Weight=10 Client connects via TLS to Lync Server 2010 Director Pool. Sends SIP Register. Authenticates Lync Server 2010 Director Pool redirects client. SIP 301 includes Primary & Backup Registrar pool If Primary Registrar Pool is available, client connects and registers with it Else client connects and registers with Backup Registrar Pool (Lync Server 2010 Pool 2) 25 Metropolitan Data Center Resiliency 26 Metropolitan Data Center Resiliency Lync Server 2010 Pool extended across two data centers NY Data Center NJ Data Center Features Available to Users If One Data Center goes Down Passive SQL Active SQL PSTN Inbound calls PSTN Outbound calls Intra-Site calls and Inter site calls Hold, Retrieve, Transfer Lync Server 2010 Edge FE 1-2 Low-Latency WAN FE 3-4 Lync Server 2010 Edge Authentication, Authorization 2 Party Intra Site IM and A/V Call Detail Records (CDR) Call Forwarding, SimulRing Boss-Admin, Team-call Voice Apps (CAA, Response Group, Call Park) • • • • Lync Server 2010 pools operate as one logical system • Split Front End pool across two datacenters (all FEs active) • SQL Geo cluster for backend (Stretched Virtual Local Area Network (VLAN)) • Data replication is done by storage arrays (Ex: EMC SRDF, HP CLX EVA) • Requires low latency WAN (15 milliseconds) In one site is down, clients are serviced by FEs in other site Nearly all features available • PSTN termination may affect inbound calls Failback has to be manually initiated Conferencing (IM, A/V and Web) Presence and DND based routing Updating Call Forwarding settings Features Available Depending on Ex-UM Deployment Voicemail Deposit Voicemail Retrieve 27 Metropolitan Data Center Resiliency Lync Server 2010 Pool extended across two data centers NY Data Center Features Available to Users If One Data Center goes Down NJ Data Center PSTN Inbound calls Passive SQL Active SQL PSTN Outbound calls Intra-Site calls and Inter site calls Hold, Retrieve, Transfer Authentication, Authorization Lync Server 2010 Edge Low-Latency WAN FE 1-2 FE 3-4 Lync Server 2010 Edge 2 Party Intra Site IM and A/V Call Detail Records (CDR) Call Forwarding, SimulRing Boss-Admin, Team-call Voice Apps (CAA, Response Group, Call Park) Conferencing (IM, A/V and Web) Presence and DND based routing Updating Call Forwarding settings DNS Srv DNS Server Pool.contoso.com Features Available Depending on Ex-UM Deployment Voicemail Deposit Voicemail Retrieve 28 Survivable Branch Appliance 29 Survivable Branch Appliance Centrally Managed Voice high availability for branches SBA Sold and supported by UC partners Appliance form factor with Hardened Windows Server 2008 R2 30 Survivable Branch Appliance Deployment Simple, easy, repeatable AD Computer Lync Server 2010 Administrator (Data Center) Add SBA to Topology Move Users to SBA Data Center • • SBA Drop Shipped w/ Software installed • • Technician connects to SBA-WEB based GUI IP Configuration • Join to Domain • • Branch • Enable Lync Server 2010 Replica SBA downloads topology & config Request/Assign Certs • • Start Lync Server 2010 services Finish GW Config Test PSTN ConnectivityMake/Receive call using Test User account 31 SBA – Central Management Centralizing move, add, changes Lync Server 2010 administrator (Data Center) • Change Normalization Rules • Change Routing Rules • Change User Policies • • • • Lync Server 2010 Central Management Server Store Configuration Replicate to SBA Data Center Survivable Branch Appliance Branch Replicated to the SBA 32 Survivable Branch Appliance Support and service • • SBA is supported and serviced by the SBA partner Windows Server Updates • • • SBA partner can act as the gatekeeper OR Customers can deploy updates Microsoft Lync Server 2010 updates • SBA partner will test and release product notice on whether a particular update can be applied on the SBA Original equipment manufacturer (OEM) notified about impending Lync Server 2010 Updates X-5 Biz Days OEM Downloads and tests the updates Compatibility issues Date X MSFT Update Partner addresses the issue if any Partner notifies the customer on updates compatibility X+30 Days 33 Survivable Branch Appliance Partner Solutions Audiocodes two form factor •M1k –1U, 4E1/T1, redundant power supply •MSBG-Firewall, Routing engine •M2k – 1U, 480 concurrent calls, redundant power supply Audiocodes: Existing M1k and M2k gateways can be converted to SBA Dialogic DMG 4000 •1U, 4 E1/T1 •Redundant power supply Dialogic: Existing DMG4000 Hybrid can be converted to SBA NET UX Series •1U, 8 E1/T1, Redundant power supply •MSBG-Firewall, Routing engine NET: New HW to support SBA. Migration plan for customers wanting to move to UX platform Ferrari •1U, 4 E1/T1, Redundant Power supply HP Procurve •Survivable Branch Appliance Module running on 54xx switch Chassis •Redundant Power supply, 8 E1/T1, Layer 2 functionality Ferrari: Existing R2 Hybrid gateway can be converted into SBA HP: Customers using ProCurve switches can convert it into a SBA by buying the module 34 Summary • Lync Server 2010 Architecture provides High Availability for Voice services • Survivable Branch Appliance built by UC partners delivers Voice High Availability for branch users • Metropolitan Data Center Resiliency delivers High Availability for all UC modalities across datacenters separated by high bandwidth and low latency • Voice Data Center Resiliency delivers Voice High Availability across geographically dispersed datacenters 35 Q&A 36 Appendix – Survivable Branch Appliance Partners 37 Dialogic® 4000 Media Gateway Series Survivable Branch Appliance • Dialogic® 4000 Media Gateway Series (DMG4000) • • • Broad Hybrid Gateway Product Line and Scalability • • • • • Complete Support for Microsoft Unified Communications • Qualified Hybrid Gateway for Office Communications Server 2007 R2 • • Certified for Exchange Server 2010 Unified Messaging Ready for Lync Server 2010 • • Performs ‘Survivable Branch Appliance’ Server Role (Lync Server 2010 Release) ‘Survivable Branch Appliance’ Survivability Features • • • • • Local PSTN Connectivity Local Lync Server 2010 Wave 14 and Analog Endpoint Registration Alternative Routing Rules Active Directory® Domain Services Look Up • • • Support for Advanced Technology Attachments (ATAs) and Foreign Exchange Station (FXS) Gateways from Grandstream and Cisco • • Easy to deploy - 1, 2, 4, 8, 24 port solutions Low cost per port • • • • Conference Phones, Fax Machines, Plain Old Telephone System (POTS), elevator phones, loading docks, checkout registers, paging systems, etc . Ready for Lync Server 2010 RTP: G.711, G.729, sRTP w/ TLS Robust Windows 2008 Server Platform • • Exclusive V.34 Fax / FoIP speed Exceeds Transcoding and Security Requirements • • Interface for any number of Analog Devices • Analog Foreign Exchange Office (FXO), Primary Rte Interface (PRI), Basic Rate Interface (BRI), Integrated Services Digital Network (ISDN), code access security (CAS), Q.SIG protocols SIP (RFC32XX), Microsoft SIP (TCP Transport), Microsoft Secure SIP: TLS T.30 (PSTN) Fax to T.38 (FoIP) • • Flexible, Cost Effective Support for Analog Devices • Complete PBX, PSTN, IP and Fax Protocol Support • Performs Mediation Server Role Field Software Upgrade to Survivable ‘Survivable Branch Appliance’ • • 8 port, 8 channel FXO - Analog to SIP 8 port, 16 channel - BRI to SIP 24/30 channel - T1/E1 to SIP 48/60 channels - dual T1/E1 to SIP 96/120 channels - Quad T1/E1 to SIP Dual Xeon® 3.0 Ghz processors offers more resource processing power for up to 120 UC sessions and more Dual redundant hard disk drives with redundant array of independent disks (RAID)-1 Dual power supply option Additional chassis slots for expanding TDM connectivity Value Added Software Options • • • • Centralized Management System Includes ESTOS CallControlGateway (5 licenses) Local Microsoft Shared Fax Server Included Support for Value Added 3rd Party Applications 38 NET Survivable Branch Appliance 4 T1/E1 WAN Ports USB Ports Mgmt Port Indicator Lights Description QTY Channel DSP modules 1- 6 USB ports 2+2 Line Cards (rear) 2 Gig-E Ports 8+1 mgmt AC power supplies 1 or 2 SFP Optical & Copper Ethernet Ports COM Express Interface (SBA) Gig-E LAN/WAN ports Fans 39 Ferrari Survivable Branch Appliance • Unique hardware approach for Microsoft Lync Server 2010 SBA • • HP Server Hardware Certified for Windows Server 2008 R2 High performance, high availability, Integrated Lights Out (ILO) remote management Integrated Gateway Hardware: 4 analog ports, 4 digital interfaces: 4 BRI or 2 BRI/2 PRI, expandable PCIe Gateway card 100% self contained, using LAN-interface + 40 HP Procurve Survivable Branch Appliance HP ProCurve ONE Services zl Module: • Intel Core 2 Duo • 250G HDD • 2 x 10G Ethernet connections to network fabric PSTN Gateway •Up to 8 T1/E1 ports •Up to 4 FXS or FXO ports Supported in zl series chassis: ProCurve Switch 8212zl • 5400zl for edge and branch (4U/7U) • 8200 with high availability for core and distribution (9U) 41 Audiocodes Survivable Branch Appliance Uniform Functionality, Manageability and Protocol Support • Entry level and Mid Range – Mediant 1000 SBA • Up to 4 E1/T1 + a mix of up to 20 BRI/FXS/FXO in 1U chassis • Up to 120 concurrent PSTN Calls, 1000 Users • Hardware and software scalability option • High End – Mediant 2000 SBA • Up to 16 E1/T1 in 1U chassis • Up to 480 concurrent PSTN Calls, 1000 Users • Software scalability option 42 © 2011 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. This document may contain information related to pre-release software, which may be substantially modified before its first commercial release. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION. Unless otherwise noted, the example companies, organizations, products, domain names, e-mail addresses, logos, people, places and events depicted herein are fictitious, and no association with any real company, organization, product, domain name, email address, logo, person, place or event is intended or should be inferred. 43