Search and Find With SharePoint 2010 Thierry Gasser Technical Specialist Collaboration Platform Thierryg@microsoft.com SharePoint de Contact Agenda •SharePoint 6 pillars Overview •Search versions and Product comparison •Demos •Search Architecture and customization info SharePoint de Contact SharePoint 2010 Pillars • Business Connectivity Services • InfoPath Form Services • External Lists • Workflow • SharePoint Designer • Visual Studio • API Enhancements • REST/ATOM/RSS • • Ribbon UI SharePoint Workspace • SharePoint Mobile Office Client and Office Web App Integration • Standards Support • • Composites Sites • PerformancePoint Services • Excel Services • Chart Web Part • Visio Services • Web Analytics • SQL Server Integration • PowerPivot Insights Communities Search • • Social Relevance Phonetic Search • Navigation • FAST Integration • Enhanced Pipeline SharePoint de Contact Tagging, Tag Cloud, Ratings • Social Bookmarking • Blogs and Wikis • My Sites • Activity Feeds • Profiles and Expertise • Org Browser Content • • • Enterprise Content Types Metadata and Navigation • Document Sets • Multi-stage Disposition Audio and Video Content Types • Remote Blob Storage • List Enhancements Enterprise Search is Transforming Business Connecting people to information, driving better outcomes On a company website On a corporate network [Screenshot] [Screenshot] search helps your customers get what they want search helps your employees get their jobs done increasing revenue cutting costs SharePoint de Contact The Search 2010 Wave Internet Business Productivity FAST Search Stand Alone FAST Search for Internet Sites (ESP) for Internal Apps (ESP) Search Server 2010 Search Express2010 SharePoint for Internet Sites, Enterprise (can be FS4SP or SP2010 search) SharePoint + Enterprise CAL + FS4SP SharePoint Integrated SharePoint for Internet Sites, Standard SharePoint de Contact SharePoint Search + Standard CAL SharePoint Search Better Answers, Faster Knowledge Amplification Enterprise Deployment SharePoint de Contact FAST Search for SharePoint and ESP People & expertise search Presence Ranking via Social distance Expertise Finding Search Profiles User Context Pipeline Customization Editing XML Config files Writing python code Spy Stages GUI configuration PowerShell Managed Code Add-ons Separate + specialized • Featured Content • Unity • GeoSearch Integrated + extensible • Visual Best Bets • SharePoint Federator • Not OOB, done via sortformula() Today: partners or services Text analytics Classification Relationship Extraction Configuring different groups search behavior + more tomorrow: FS for SharePoint is the platform for future innovation SharePoint de Contact Capabilities from F4SP not in ESP (1/2) In FS4SP not in ESP Taxonomy management OOB GUI FAST ESP Only available via partners (such as Teragram/SAS) taxonomy and taxonomy management tools + Partners (Teragram/SAS, Factiva, Concept Searching, SchemaLogic, …) ESP does not ship with a production OOB GUI built from production web GUI typically GUIs are custom designed parts. or developed with a toolkit no-code customization via web parts. People Search Only Index information about people not integrated people and expertise search, relevancy tuned to people, phonetic search OOB, nicknames, etc. Easy Administration ESP has specialized tool for search administration with specific in-class training required Not available with ESP Relevance improves with use FS4SP SharePoint de Contact people and expertise search (source from SP) including phonetic search capabilities. Closely integrated with Active Directory and SharePoint social computing capabilities. Integrated admin UI Scriptable via PowerShell relevance based on clickthrough, provide related queries based on popularity, etc. Capabilities from F4SP not in ESP (2/2) In FS4SP not in ESP FAST ESP FS4SP Business Connectivity Services and BDC ESP has a connector toolkit Special programming skills to create new connectors Custom search analytics Possible via custom projects which Built-in query and click log access export logs, combine with business and extensible and customizable clickstream data, and do BI Claims Authorization security Not available with ESP security mechanism which can be used with federated ID, forms authorization, ADFS, SAML, etc.). 64 bit native app Not available with ESP FS4SP and all of SharePoint is a native 64 Development Tools with built-in tooling Not available with ESP SPD and VS2010 - tooling specifically for SharePoint applications including FS4SP Query Suggestions based on social behavior Not available with ESP search logs based (popular queries on closely related subjects out-of-thebox.) SharePoint de Contact Use BCS as a broad framework Demo Search Live MS search http://msw/searchcenter/Pages/default.aspx -> Fast for SharePoint 2010 -> People search, skills. SharePoint de Contact SharePoint Search Architecture ... SharePoint Sites SSA Database Metadata Database Crawl Database Index Repositories Remote File Shares Send Query to full text index Send Query Query Processor WFEs Get Results Query servers Get Results from full text index Index Servers Custom Repositories Propagate index + metadata to query server & DB Web Sites SharePoint de Contact ... FAST Search for SharePoint Integration Architecture SSA Database Metadata Database Crawl Database Index Repositories using SharePoint gatherer Index Servers Query request to FAST location WFEs Show results in SharePoint Final search results Query Processor Query servers Send Query to FAST backend Query Modified Processing Query Query & Results Search Server Results Results Processing ... SharePoint Sites Remote File Shares Send content to FAST for indexing Custom Repositories Content Search Engine Processin g servers Propagate index to search engine Index SharePoint de Contact Interface Index Repositories using FAST system Web Sites` ... Format Conversion Language Detection Stemming … Property Extraction … Mapper FS4SP – Data Flow Search Center People Search Federation OpenSearch Item Processing Pipeline Content Query Matching Query Processing Item Processing Crawler Indexing Index Column Web Link Analysis Crawl DB User Profiles SharePoint Server(s) FAST Server(s) SharePoint de Contact Other Server(s) FS4SP – Minimum Deployment Single-tier setup Two-tier setup SP2010 Farm FAST Search for SP 2010 Farm SP2010 Farm FAST Search for SP 2010 Farm All roles All roles All roles All roles Crawl DB Search Admin DB SQL 2008 Cluster SharePoint de Contact FS4SP – Large Deployment SP2010 Farm WFE Query SSA WFE Query SSA SP Crawl People Crawl SP Crawl People Crawl SP Crawl Crawl DB Search Admin DB SQL 2008 Cluster FAST Search for SharePoint 2010 Farm FAST-ADM-1 FAST-ADM-2 Admin Content Distributor 2 ConfigServer Web Analyzer Spelltuner 12 Docprocs+ SamAdmin Content Distributor 1 Web Analyzer 12 Docprocs+ FAST-ADM-3 Web Analyzer 12 Docprocs+ FAST-FSTIDX-11 Index (Search) 12 Docprocs+ FAST-FSTIDX-12 Index (Search) 12 Docprocs+ FAST-FSTIDX-13 Index (Search) 12 Docprocs+ FAST-FSTIDX-14 Index (Search) 12 Docprocs+ FAST-FSTIDX-15 Index (Search) 12 Docprocs+ FAST-FSTIDX-16 Index (Search) 12 Docprocs+ FAST-FSTIDX-21 (Index) Search QR Server FAST-FSTIDX-22 (Index) Search QR Server FAST-FSTIDX-23 (Index) Search QR Server FAST-FSTIDX-24 (Index) Search QR Server FAST-FSTIDX-25 (Index) Search QR Server FAST-FSTIDX-26 (Index) Search QR Server SharePoint de Contact FS4SP – Server Calculation Matrix Max item count (in Millions) 1 10 40 100 150 200 500 Adm + WA 0 1 2 3 5 6 10 Indexers (1 row) 1 1 3 6 10 14 34 SharePoint Crawlers 0 1 2 3 5 6 16 Crawl DB Server 0 1 1 1 1 2 2 Redundancy 1 2 3 6 10 14 34 Total 2 6 11 19 31 42 96 Disclaimer: This table is based on early testing and results from an internal dogfood project. The numbers might not be representative for the customer environment and data. Please use caution when using these numbers for sizing. SharePoint de Contact Customization • Search center based on Web parts • XSLT can be used to change the Result • Queries can be developed with XML and/or .NET – >Eg: www.swiss.ch SharePoint de Contact © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION. SharePoint de Contact