INDIA │ 18-20 august 2010 virtual techdays FAST Search for SharePoint 2010 Allirajan Ramachandran │ Technology Specialist, Microsoft Corp arajan@microsoft.com Microsoft SharePoint 2010 The Business Collaboration Platform for the Enterprise and the Internet Sites Deliver the Best Productivity Experience Composites Communities Cut Costs with a Unified Infrastructure Insights Content Rapidly Respond to Business Needs Search User Interface is visual and actionable Visual and conversational interaction with precise control Sort on any field Deep Refinement Thumbnails Similar Results Previews Built on SharePoint Search Center Visual results connects users with content Leverages all of innovations in SharePoint Open Web Parts, Federation, query suggestions, related queries, Did you mean? Thumbnails for Word and PowerPoint Visual Best Bets highlight premium content Preview in browser without leaving the results Visual Best Bets Visual cues to highlight essential content Visual Notification Web Part Flexibility Built on SharePoint Keywords Easy and quick to setup Matches keywords and synonyms that are contextually relevant to users. Include banners, videos, external websites. Point and click setup for site admins. Set and forget with content expiration dates . Web Parts allow for easy page customization Deep Refinement Enables precise control of results Sorted by frequency Enables conversational experience You will never miss any content; enabling better findability and exploration across the entire result set Provides a sorted view Each refiner is sorted by frequency, from highest to lowest, indicating the importance of each term Contains exact counts Leads to discovering nonobvious relationships, key data trends, and deep analysis of your content Exact Counts Customize search to meet your business needs Key ingredients to a great customized search experience Search in the language of your business Deliver results that are contextually relevant Tune relevancy to meet diverse needs Process content with advanced linguistics Customize the user experience to build engaging applications Map metadata to Managed Properties Automatic association of metadata to contentCrawled Properties Title Sales Forecast Companies Crawled Managed Properties Contoso Tailspin Toys Woodgrove Bank … Standard document Map one or more metadata discovered Crawled Properties by crawler to athe single field.or extracted from the Enables sorting, full text by the FAST refinement, Content Processing relevance tuning and Pipeline fielded searching Expertise Any data can be found and mapped Strategic Consulting Market Analysis IT Implementation … Industry Financial Services Manufacturing Technology Mapped in Central Admin Or PowerShell ... Index Schema: Type Managed Properties Doc ID Title Author Date Size Keywords Companies Expertise Industry ... Body Text 123 Sales Forec… John Doe 2010-04-15 386 KB sales; pipe… Contoso; Tail.. Strategic Con.. Financial Ser… … IT Market Gr… 456 … … … … … … … … … … 963 … … … … … … … … … … What can I do with a Managed Property? Metadata is also used for relevancy tuning, multilevel sorting and advanced search Metadata quality is critical to a good search experience File Formats , Companies Precise hit counts in deep refiners are computed across the whole result set. Products Concepts And many more… Enables deep refinement Enables precision relevancy Makes search conversational, guiding users to navigate and refine, while summarizing the results that are found Managed properties are also used for relevancy tuning & ranking, multi-level sorting, advanced (or fielded) search How does it work? Add refiners to user interface Built on a SharePoint List or custom extractor Edit the Search Center Results Page Modify the shared web part by adding tags to the refinement panel XML Create your own labels Save and Publish Custom Collections Context matters Users need to access multiple types of content Enterprise Content Marketing Customer Support Sales Manufacturing R&D Operations Professional Services HR Legal Finance Depends on role, location, responsibility and task. This can change day to day, or hour to hour. ... Deliver results that are contextually relevant with search that can understands your business and role Role-specific relevance Targeted Best Bets / Visual Best Bets Business driven refinement ”What should I know about selling ERP?” - Alan Brewer, Sales Lead ”What should I know about implementing ERP?” - Renee Lo, Consultant Tune relevancy to meet diverse needs A flexible solution for your organizations, groups and individuals Optimize Quickly tailor Relevancy relevancy for broad modelsintranet Deliver the right use Documentation, RFPs and SOWs are now promoted results to the right people by creating new Rank Profiles ”I want to know about my customer Woodgrove Bank and customers in Financial Services" - Alan Brewer, Sales Lead Query results with the default relevancy Same Results different New Default Sorting order Users can select rank profiles in the sort by box or create their own default views by Promotes relevant modifying the web Results part ”I want to get right to the technical documents" - Renee Lo, Consultant Rank Profiles Tune relevancy without impacting the default algorithm Quality Also known as static rank, consists of multiple managed properties including site, URL depth (preference for shorter URLs), and relative importance of links to this document. Authority Applies when the query word falls in the link or anchor text. Query Authority Maps the popularity of a document, or the click-through rate when documents are clicked as a result of a query Freshness Increases the relevancy if a document was recently created or modified, based on the last modified property. Proximity Applies to where query terms fall and how close they are to each other within a document Context Increases the rank of a document if the query term is a managed property associated with that document Managed Property Effects relevancy when a managed property contains a specific value, such as Woodgrove Bank or Financial Services Out of the box relevancy Extend the default algorithms Tuned for great general productivity experience, relevancy improves with click-throughs and link text analysis. Create new default relevancy models. Blend static and dynamic ranking parameters to instantly improve search results. How to create a Rank Profile IT Pros are empowered to create new profiles quickly Rank Profiles created in PowerShell by extending the default relevancy algorithm… … and are exposed in the user interface by modifying the sorting web part. Process content with advanced linguistics Automatic and detailed analysis creates a great search experience FAST Content Processing Pipeline Web Document Entity Lemmatization Date Map Link Tokenization Extraction Language and Analysis Crawled Format Vector Time Identifies Extracts Maps Analyzes Creates Converts Finds Apply terms the all the aplain of documents dates root unique the language the innative text of the and metadata arepresentation from content word times specific written formultiple hyperlinks for that toand rules alanguage a given standard was maps file of for discovered extracting alanguage. formats, identifying them document and representation, locale to encodings, predefined anchor by For that words, specific the English reflects various text toand it Normalization Encoding Conversion Properties and applications pipeline which important handle categories. maps concepts, encoding run, reinforces locale stages so idioms runs, terms Out that specific running of and the and the authority proper frequency phrases. representations. box and support dictionaries ran ranking Also of back occurrence. for applies to of People, For acan asingle document. example, custom beCompanies Used used lemma. word knows to by find theand Detection tokenization similar that Locations, Understands breakers 14-Mar-10 documents. found butand language can in islemmatization equivalent part be extended numbers specific March grammar to stages orany telephone 14,category. 2010. and context. numbers. Breaks down content to the smallest addressable chunks to build meaning Understands file encoding, data formats, and written languages Supports 400+ file formats, 80+ languages Robust query language Use FAST Query Language (FQL) for precise query development FQL provides a robust and expressive query language Wildcard support - *, ? Numeric Data types (Integer, Float, Decimal, Datetime) Operators Direct field access (e.g., title:othello, author:shakespeare) Numeric (COUNT, RANGE, <, <=, >, >=) Boolean (AND, OR, ANY, NOT) Rank (RANK, XRANK) Proximity (NEAR, ONEAR) String (operator support for strings) Boundary (starts-with, ends-with, equals) Filter FAST Search Server Architecture Common platform scales with your business need SharePoint Search User Experience FAST Search for SharePoint Information FAST Search Server 2010 Summary of architectural components Content Secure, unified access to information Index or federate with content, applications, and services OpenSearch Federation Search Index User Experience Enterprise Content Business Information Applications Services Indexing Connectors Connect to all of your enterprise content Extend your reach with Business Connectivity Services Quickly connect to content Build custom connectors Use a consistent framework to quickly connect both inside and outside of SharePoint including content management systems, web services, databases, and line of business systems Use SharePoint Designer to configure data model and connect to SharePoint. Connect to WCF services, or create your own .NET assembly connector with Visual Studio. Simplified, powerful administration A high-end enterprise search solution that’s easy to deploy and manage FAST Search for SharePoint Scaleout Back-end with extreme and flexible scale out options Query Volume Search and Indexing Query and Result Processing No theoretical upper bounds! Scale-out multiple “dimensions” Query Volume Content Volume Indexing freshness Content Volume Redundancy options Search Indexing Performance targets* 30M Docs/node 50 QPS/node 35 docs/sec Crawling and Content Processing *Depends on content and hardware specifics INDIA │ 18-20 august 2010 virtual techdays RESOURCES www.sharepoint.microsoft.com www.microsoft.com/enterprisesearch Channel9.msdn.com/posts/matthijs/fast-search-for-sharepoint-2010capabilities-deep-dive THANKS│18-20 august 2010 virtual techdays Thank You