Visual results connects users with content

advertisement
INDIA │ 18-20 august 2010
virtual techdays
FAST Search for SharePoint 2010
Allirajan Ramachandran │ Technology Specialist, Microsoft Corp
arajan@microsoft.com
Microsoft SharePoint 2010
The Business Collaboration Platform for the Enterprise and the Internet
Sites
Deliver the Best Productivity Experience
Composites
Communities
Cut Costs with a Unified Infrastructure
Insights
Content
Rapidly Respond to Business Needs
Search
User Interface is visual and actionable
Visual and conversational interaction with precise control
Sort on any field
Deep Refinement
Thumbnails
Similar Results
Previews
Built on SharePoint Search Center
Visual results connects users with content
Leverages all of innovations in SharePoint
Open Web Parts, Federation, query suggestions,
related queries, Did you mean?
Thumbnails for Word and PowerPoint
Visual Best Bets highlight premium content
Preview in browser without leaving the results
Visual Best Bets
Visual cues to highlight essential content
Visual Notification
Web Part Flexibility
Built on SharePoint Keywords
Easy and quick to setup
Matches keywords and synonyms that are
contextually relevant to users. Include banners,
videos, external websites.
Point and click setup for site admins. Set and forget
with content expiration dates . Web Parts allow for
easy page customization
Deep Refinement
Enables precise control of results
Sorted by frequency
Enables conversational
experience
You will never miss any content;
enabling better findability and
exploration across the entire
result set
Provides a sorted view
Each refiner is sorted by
frequency, from highest to
lowest, indicating the
importance of each term
Contains exact counts
Leads to discovering nonobvious relationships, key data
trends, and deep analysis of
your content
Exact Counts
Customize search to meet your business needs
Key ingredients to a great customized search experience
Search in the language of your
business
Deliver results that are
contextually relevant
Tune relevancy to meet diverse
needs
Process content with advanced
linguistics
Customize the user experience
to build engaging applications
Map metadata to Managed Properties
Automatic association of metadata to contentCrawled Properties
Title
Sales Forecast
Companies
Crawled
Managed
Properties
Contoso
Tailspin Toys
Woodgrove Bank
…
Standard
document
Map one or
more
metadata
discovered
Crawled Properties
by
crawler
to athe
single
field.or
extracted
from the
Enables sorting,
full
text by the FAST
refinement,
Content
Processing
relevance
tuning and
Pipeline
fielded searching
Expertise
Any data can be
found and mapped
Strategic Consulting
Market Analysis
IT Implementation
…
Industry
Financial Services
Manufacturing
Technology
Mapped in
Central Admin
Or PowerShell
...
Index Schema:
Type
Managed Properties
Doc ID
Title
Author
Date
Size
Keywords
Companies
Expertise
Industry
...
Body Text
123
Sales Forec…
John Doe
2010-04-15
386 KB
sales; pipe…
Contoso; Tail..
Strategic Con..
Financial Ser…
…
IT Market Gr…
456
…
…
…
…
…
…
…
…
…
…
963
…
…
…
…
…
…
…
…
…
…
What can I do with a Managed Property?
Metadata is also used for
relevancy tuning, multilevel sorting and
advanced search
Metadata quality is critical to a good search experience
File Formats ,
Companies
Precise hit counts in
deep refiners are
computed across the
whole result set.
Products
Concepts
And many more…
Enables deep refinement
Enables precision relevancy
Makes search conversational, guiding users to
navigate and refine, while summarizing the
results that are found
Managed properties are also used for relevancy
tuning & ranking, multi-level sorting, advanced (or
fielded) search
How does it work?
Add refiners to user interface
Built on a SharePoint List or custom
extractor
Edit the Search Center Results Page
Modify the shared web part by
adding tags to the refinement panel
XML
Create your own labels
Save and Publish
Custom Collections
Context matters
Users need to access multiple types of content
Enterprise
Content
Marketing
Customer
Support
Sales
Manufacturing R&D
Operations
Professional
Services
HR
Legal
Finance
Depends on role, location, responsibility and task.
This can change day to day, or hour to hour.
...
Deliver results that are contextually relevant
with search that can understands your business and role
Role-specific
relevance
Targeted Best
Bets / Visual
Best Bets
Business driven
refinement
”What should I know
about selling ERP?”
- Alan Brewer, Sales Lead
”What should I know
about implementing ERP?”
- Renee Lo, Consultant
Tune relevancy to meet diverse needs
A flexible solution for your organizations, groups and individuals
Optimize
Quickly tailor
Relevancy
relevancy for
broad
modelsintranet
Deliver the right
use
Documentation, RFPs and
SOWs are now promoted
results to the right
people by creating
new Rank Profiles
”I want to know about my
customer Woodgrove Bank and
customers in Financial Services"
- Alan Brewer, Sales Lead
Query results with the
default relevancy
Same Results
different
New
Default Sorting order
Users can select
rank profiles in the
sort by box or
create their own
default views by
Promotes relevant modifying the web
Results
part
”I want to get right to the
technical documents"
- Renee Lo, Consultant
Rank Profiles
Tune relevancy without impacting the default algorithm
Quality
Also known as static rank, consists of multiple managed properties
including site, URL depth (preference for shorter URLs), and
relative importance of links to this document.
Authority
Applies when the query word falls in the link or anchor text.
Query
Authority
Maps the popularity of a document, or the click-through rate
when documents are clicked as a result of a query
Freshness
Increases the relevancy if a document was recently created or
modified, based on the last modified property.
Proximity
Applies to where query terms fall and how close they are to each
other within a document
Context
Increases the rank of a document if the query term is a managed
property associated with that document
Managed
Property
Effects relevancy when a managed property contains a specific
value, such as Woodgrove Bank or Financial Services
Out of the box relevancy
Extend the default algorithms
Tuned for great general productivity experience,
relevancy improves with click-throughs and link text
analysis.
Create new default relevancy models. Blend static and
dynamic ranking parameters to instantly improve
search results.
How to create a Rank Profile
IT Pros are empowered to create new profiles quickly
Rank Profiles created in
PowerShell by extending
the default relevancy
algorithm…
… and are exposed in the user
interface by modifying the
sorting web part.
Process content with advanced linguistics
Automatic and detailed analysis creates a great search experience
FAST Content Processing Pipeline
Web
Document
Entity
Lemmatization
Date
Map
Link
Tokenization
Extraction
Language
and
Analysis
Crawled
Format
Vector
Time Identifies
Extracts
Maps
Analyzes
Creates
Converts
Finds
Apply
terms
the
all
the
aplain
of
documents
dates
root
unique
the
language
the
innative
text
of
the
and
metadata
arepresentation
from
content
word
times
specific
written
formultiple
hyperlinks
for
that
toand
rules
alanguage
a given
standard
was
maps
file
of
for
discovered
extracting
alanguage.
formats,
identifying
them
document
and
representation,
locale
to encodings,
predefined
anchor
by
For
that
words,
specific
the
English
reflects
various
text
toand
it
Normalization
Encoding
Conversion
Properties
and applications
pipeline
which
important
handle
categories.
maps
concepts,
encoding
run,
reinforces
locale
stages
so
idioms
runs,
terms
Out
that
specific
running
of
and
the
and
the
authority
proper
frequency
phrases.
representations.
box
and
support
dictionaries
ran
ranking
Also
of
back
occurrence.
for
applies
to
of
People,
For
acan
asingle
document.
example,
custom
beCompanies
Used
used
lemma.
word
knows
to
by find
theand
Detection tokenization
similar
that
Locations,
Understands
breakers
14-Mar-10
documents.
found
butand
language
can
in
islemmatization
equivalent
part
be extended
numbers
specific
March
grammar
to
stages
orany
telephone
14,category.
2010.
and context.
numbers.
Breaks down content to the smallest addressable chunks to build
meaning
Understands file encoding, data formats, and written languages
Supports 400+ file formats, 80+ languages
Robust query language
Use FAST Query Language (FQL) for precise query development
FQL provides a robust and expressive query
language
Wildcard support - *, ?
Numeric Data types (Integer, Float, Decimal,
Datetime)
Operators
Direct field access (e.g., title:othello,
author:shakespeare)
Numeric (COUNT, RANGE, <, <=, >, >=)
Boolean (AND, OR, ANY, NOT)
Rank (RANK, XRANK)
Proximity (NEAR, ONEAR)
String (operator support for strings)
Boundary (starts-with, ends-with, equals)
Filter
FAST Search Server Architecture
Common platform scales with your business need
SharePoint
Search
User Experience
FAST Search
for SharePoint
Information
FAST Search Server 2010
Summary of architectural components
Content
Secure, unified access to information
Index or federate with content, applications, and services
OpenSearch Federation
Search
Index
User Experience
Enterprise
Content
Business Information
Applications Services
Indexing Connectors
Connect to all of your enterprise content
Extend your reach with Business Connectivity Services
Quickly connect to content
Build custom connectors
Use a consistent framework to quickly connect both
inside and outside of SharePoint including content
management systems, web services, databases, and
line of business systems
Use SharePoint Designer to configure data model and
connect to SharePoint. Connect to WCF services, or
create your own .NET assembly connector with Visual
Studio.
Simplified, powerful administration
A high-end enterprise search solution that’s easy to deploy and
manage
FAST Search for SharePoint Scaleout
Back-end with extreme and flexible scale out options
Query
Volume
Search and Indexing
Query and Result
Processing
No theoretical
upper bounds!
Scale-out multiple
“dimensions”
Query Volume
Content Volume
Indexing freshness
Content
Volume
Redundancy options
Search
Indexing
Performance targets*
30M Docs/node
50 QPS/node
35 docs/sec
Crawling and Content
Processing
*Depends on content and hardware specifics
INDIA │ 18-20 august 2010
virtual techdays
RESOURCES
 www.sharepoint.microsoft.com
 www.microsoft.com/enterprisesearch
 Channel9.msdn.com/posts/matthijs/fast-search-for-sharepoint-2010capabilities-deep-dive
THANKS│18-20 august 2010
virtual techdays
Thank You
Download