4/5/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.

Slides:



Advertisements
Similar presentations

Advertisements

SPS Nashville 2014 Dynamic Content using SharePoint Search SHAREPOINT SATURDAY NASHVILLE– APRIL 5, 2014 MIKE ORYSZAK BLOG: TWITTER:
Enterprise Search with FAST Rick McDannel Manager of Information Technology.
Major Sponsors Minor Sponsors. Overview What is a Search Driven Site?
Microsoft SharePoint 2010 technology for Developers
Goal One-to-OneOne-to-ManyMany-to-Many Communication- Centric Content- Centric.
S HARE P OINT 2013 S EARCH – A D EVELOPER ’ S P ERSPECTIVE Ryan McIntyre MCITP, MCPD Director, Portals & Collaboration.
Marketing Assistant Sales & Marketing Attorney Legal.
“ Leveraging SharePoint 2010 Search Technologies ” With: Ivan Neganov.
SharePoint 2007: Search Deep Dive Erik Mau Inetium.
   HTTP File shares SharePoint User profiles Lotus Notes Documentum Exchange folders Custom - BCS SharePoint SP Apps Devices Non-SP UX Search.
Windows SharePoint Services: Advancements In Document, Content, And Data Storage Dustin Friesenhahn OFF409 Program Manager Microsoft Corporation.
T Sponsors Sameer Chabungbam Principal Program Manager, Microsoft Connector API Apps BizTalk Summit 2015 – London ExCeL London | April 13th & 14th.
SharePoint de Contact Search and Find With SharePoint 2010 Thierry Gasser Technical Specialist Collaboration Platform
Technical Overview of FAST Search Server 2010 for SharePoint Sezai Komur SharePoint Solutions Architect CSG.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
You can do it! Quick, easy, powerful search (for free!) Complete intranet search High-end search delivered through SharePoint Basic search Intranet-wide.
Thank you SPSKC15 sponsors!. SharePoint 2013 Search Service Application (SSA) Ambar Nirgudkar Software Engineer
4/19/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
SharePoint Server 2013 Architecture and Identity
Ease of use with familiar tools for creating and consuming content Create, reuse and consume content for any device and language Surface the right.
SPC Developer 4/19/2017 © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
ACL Solutions for Continuous Auditing and Monitoring John Verver CA, CISA, CMC Vice President, Professional Services & Product Strategy ACL Services Ltd.
Title Information First Lastname, Title August 2012 Software Assurance Planning Services.
Russ Houberg Senior Technical Architect, MCM KnowledgeLake, Inc.
Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE Analytics Processing FAST Search Index SharePoint SP Apps Devices Non-SP.
Query Rules Item
4/20/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE Analytics Processing FAST Search Index Content Enrichment Web Service.
Quiz: Which is the CSWP? CATALOG DOCUMENTS ARTICLE S ASSET LIBRARY NAVIGATION Search Usage.
SPC095 Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE Analytics Processing FAST Search Index SharePoint SP Apps Devices.
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
Search Topology and Optimization April 12, 2013 Mike Maadarani SharePoint Architect.
2 Copyright © 2009, Oracle. All rights reserved. Getting Started with Warehouse Builder.
First Look Clinic: What’s New for IT Professionals in Microsoft® SharePoint® Server 2013 Sayed Ali (MCTS, MCITP, MCT, MCSA, MCSE )
> The average person receives 63,000 words of new information every day. Average length of a novel: words If you wanted to read everything.
Oracle Application Express (Oracle APEX), formerly called HTML DB, is a Free rapid web application development tool for the Oracle database.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Project Overview Bibliographic merging, Endeca, and Web application.
Search 2013 Thierry Gasser Technical Solution Professional (TSP)
HTTP File shares SharePoint User profiles Lotus Notes Documentum Exchange folders Custom - BCS SharePoint SP Apps Devices Non-SP UX Search Admin Content.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE API Analytics Processing Crawl Search Admin Link Analytics Reporting FAST.
User Experience Takes user input, displays results Search Engine Builds index, returns results Content Processing Retrieves content, prepares for indexing.
CaDSR Freestyle Search June 11, caDSR Freestyle Search Overview Architecture Implementation Dependencies Futures 2.
True PS C:\SPC231> Start-SPEnterpriseSearchCrawlContentSource.ps1 Recipes -Wait Wednesday, November 7, :47:08 AM Entered CrawlStarting.
Searching Business Data with MOSS 2007 Enterprise Search Presenter: Corey Roth Enterprise Consultant Stonebridge Blog:
© 2015 Ascendum Solutions. All rights reserved. Welcome To Create Dazzling End-user applications using SharePoint Search Speaker: Bill Crider #sharepointcincy2015.
Module 10 Administering and Configuring SharePoint Search.
SES310. Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE API Analytics Processing Crawl Search Admin Link Analytics.
DB´s Properties Analytics Index Timer Jobs CrawlerLogs Events Configuration PowerShell analysisenginepssnapin.
0 SharePoint Search 2013 Rafael de la Cruz SharePoint Developer Seneca Resources twitter.com/delacruz_rafael
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Microsoft Office SharePoint Server 2007 Enterprise Search Enterprise Search Overview.
Unplugged FAST meets SharePoint (FS4SP)
Búsqueda en SharePoint 2010: una introducción. Quick, easy, powerful search (for free!) Complete intranet search High-end search delivered through SharePoint.
Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE API Analytics Processing Crawl Search Admin Link Analytics Reporting.
WINDOW SEARCH SERVER Topics  Topology  High-level Architecture  Performance  WSS vs. MOSS Search Comparison  Search Server 2008.
Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE Analytics Processing FAST Search Index Public API Unit of scale/role.
Search Client Search _api/search SharePoint (Online) Team Sites SharePoint App JavaScript REST.NET CSOM OR.
Leveraging SharePoint Search In SharePoint 2013 Jameson Bozeman.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Introduction to Enterprise Search Corey Roth Blog: Twitter: twitter.com/coreyrothtwitter.com/coreyroth.
Voyager Search. INTRODUCTION › Established in 2008 › Self-funded and privately owned › Geospatial search and data management › Leverages Open Source technology.
Architecting Search in SharePoint 2016
Microsoft Office SharePoint Server 2007 Enterprise Search
Business Connectivity Services in SharePoint 2010 and Office 2010
Search and Find With SharePoint 2010
11/16/2018 4:57 AM Creating Great End-User Experiences with Fast Search for SharePoint 2010 OSP322 Mark Stone Dan Benson © 2007 Microsoft Corporation.
Orchestration and data movement with Azure Data Factory v2
Presentation transcript:

4/5/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Search Content Enrichment and Extensibility in SharePoint 2013 4/5/2017 SPC414 Search Content Enrichment and Extensibility in SharePoint 2013 Brent Groom Senior PFE Microsoft Sreedhar Mallangi Senior Consultant Microsoft © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Session Objectives Identify content extensibility points Tech Ready 15 4/5/2017 Session Objectives Identify content extensibility points Learn about custom connectors Learn the basics of content enrichment Advanced content enrichment Learn about two community Toolkits Almost all “on-prem” © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Architecture overview TechReady 18 4/5/2017 Agenda Architecture overview Content ingestion Content enrichment Advanced enrichment © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

SharePoint 2013 Search Architecture Public API Query Features Extensibility Points Unit of scale/role boundary Content Crawl Content Processing Index FAST Search Index Query Processing WFE UX API Analytics Processing Crawl Link Search Admin Analytics Reporting Search Admin

SharePoint 2013 Search Architecture Public API Content Enrichment Web Service Extensibility Points Unit of scale/role boundary Content Crawl Content Processing Index FAST Search Index Query Processing WFE UX API Analytics Processing Custom Connectors Crawl Link Search Admin Analytics Reporting Search Admin

Crawl Component ... OOB connectors Extensible through BCS HTTP OOB connectors Extensible through BCS Local disk cache Crawled items tracked in Crawl database Configurations stored in Admin database Crawl modes Full Crawl Incremental Crawl Continuous Crawl File Shares SharePoint mssearch.exe Crawl Content Processing Index FAST Search Index User Profiles Exchange Lotus Notes Documentum ... Custom Crawl Search Admin Admin

Extending content processing Tech Ready 15 4/5/2017 Extending content processing Crawler You can customize the search experience through the extensibility points in the content processing flow Content Processing Component Delete Index Delete Links Update Security Descriptors Insert Parse Documents Register crawled properties Map to managed properties Security Descriptors Detect language Web Service Callout Web Service Callout Web Service Ifilter sandbox Phonetic name variations Word breaking Custom Entity Extraction Metadata Extration Document summary Analytics © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Agenda Content ingestion Architecture overview Content enrichment TechReady 18 4/5/2017 Agenda Architecture overview Content ingestion Content enrichment Advanced enrichment © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Why? Enterprises have many different data sources TechReady 18 4/5/2017 Why? Enterprises have many different data sources We are building Enterprise Search Platforms Allow users to find the content they are looking for - all sources in one place Increase productivity No Search Content API anymore FAST ESP had a push based content API © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

OK! What do we have? Connector Default Solutions Custom solutions TechReady 18 4/5/2017 OK! What do we have? Connector Default Solutions Custom solutions Protocol Handlers File share SharePoint Website People Profile BCS Connector Framework Lotus Notes Exchange public folder Documentum BCS © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

What is Business Connectivity Services? SPC2012 - Developer 4/5/2017 What is Business Connectivity Services? Connects external data sources to SharePoint Can be used as a search source Has several flavors No-Code OData SQL Code WCF .NET Assembly B311@TechEd 2013 © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Search Indexing Toolkit - SIT TechReady 18 4/5/2017 Search Indexing Toolkit - SIT A generic implementation of a Custom SharePoint Indexing Connector Generic Data Model File Implements all the complexities of Batching – for scalability Crawling – Full and Incremental Security Trimming – Both Active Directory security and Custom Claims security Hides all of that behind one single interface © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

What’s in the package? AdventureWorks Product DB Indexing Connector TechReady 18 4/5/2017 What’s in the package? XML Files Indexing Connector AdventureWorks Product DB Indexing Connector Implementing the ISearchConnector interface With a detailed How-To Guide Search Indexing Toolkit SIT Core Library SITModel.xml © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

SIT XML file connector Index Any XML File Flexible High Performance TechReady 18 4/5/2017 SIT XML file connector Index Any XML File The connector can split items on a configurable xml element Flexible All sub elements are submitted as crawled properties, no need to configure High Performance Testing has shown 100 DPS even on a laptop Scalable Crawl million of XML files © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Demo Indexing Wikipedia Abstracts TechReady 18 4/5/2017 Demo Indexing Wikipedia Abstracts Search Indexing Toolkit © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

SIT ISearchConnector interface 4/5/2017 SIT ISearchConnector interface ContentSource SIT Core Your Connector Initialize GetAllItems [id1,id2,id3..] offset crawlType changeToken changeTokenUpdate GetSpecificItem(id1) [id1’s properties] GetSpecificItemData(id1) id1’s data GetSecurityDescriptorForSpecificItem itemId, aclmeta, usesPluggableAuth id1’s security descriptor © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Item level security Tag each document with an NTLM security descriptor TechReady 18 4/5/2017 Item level security Tag each document with an NTLM security descriptor Content source supports NTLM? Pass-through the security descriptor Otherwise… Need to map to NTLM and create security descriptors If no NTLM available, use Custom claims Implement Custom claims provider or security trimmer © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Live Use cases Crawling XML files generated from 3rd party sources. TechReady 18 4/5/2017 Live Use cases Crawling XML files generated from 3rd party sources. SQL Server with security trimming SQL Server with related BLOB on file share © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

SIT back and relax! SIT Takeaways SIT reduces the complexity to create SharePoint Search connectors Enhance the Search experience SIT back and relax!

Agenda Content enrichment Architecture overview Content ingestion TechReady 18 4/5/2017 Agenda Architecture overview Content ingestion Content enrichment Advanced enrichment © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Business Use Cases What are your customers trying to do? 4/5/2017 Business Use Cases What are your customers trying to do? What would your customers like to do? Add DB or ERP meta-data into search results Clean-up or reformat existing properties to facilitate search Tag documents that violate corporate policy Label documents that contain known patterns Copy data from one managed property to another (including a type change) © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Content Enrichment Web Service (CEWS) TechReady 16 4/5/2017 Content Enrichment Web Service (CEWS) Web service hosted outside of SharePoint Replaces SharePoint 2010 Pipeline Extensibility executable Optimized for performance (no need to read/write XML files, start a new process, etc) Input/output managed properties Web Service ProcessItem(Item) ProcessedItem Crawler Content Processing Index © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

SPC2012 - Developer 4/5/2017 CEWS Configuration Register with Search Service Application via PowerShell Endpoint URL of web service Input properties Managed properties passed in Output properties Managed properties that can be returned Include raw data? Optionally include raw data (read only) Debug mode Sends all input properties, ignores all output properties Error mode Warning or Error. In Error mode, failing items are dropped Trigger Test to determine if enrichment should be called (per document) © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Average number of milliseconds spent on content enrichment

6 more things you need to know about CEWS TechReady 16 4/5/2017 6 more things you need to know about CEWS Properties must exist when you register Property names are case sensitive Cannot use property aliases Some standard properties can be confusing DisplayAuthors vs Author Some properties are read-only (body!) Single web service per Search Application © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Agenda Advanced enrichment Challenges and techniques TechReady 18 4/5/2017 Agenda Architecture overview Content ingestion Content enrichment Advanced enrichment Challenges and techniques © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Doing it in production: the challenges TechReady 16 4/5/2017 Doing it in production: the challenges Scale-out Increase capacity to match farm Large topology ≈ 144 flow instances Fault tolerance Survive hardware failures without loss of functionality Service aggregation Multiple enrichment tasks to support disparate content sources © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Doing it in production: techniques TechReady 16 4/5/2017 Doing it in production: techniques WCF Routing Introduced in .NET 4.0 100% declarative, configured in Web.config xml Applies Xpath filters against request to determine destination endpoint Supports backup destination endpoints to achieve Fault Tolerance Load Balancing Hide multiple end points behind a load balancer to provide Scale and Fault Tolerance “Localhost” Register web service on localhost and run instance on each content processing node Scales with content processing Provides Fault Tolerance with that content processing node http://aka.ms/Pqkjjj © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Agenda Advanced enrichment CEWS Pipeline Toolkit Architecture overview TechReady 18 4/5/2017 Agenda Architecture overview Content ingestion Content enrichment Advanced enrichment CEWS Pipeline Toolkit © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

CEWS Pipeline Toolkit Enhance Search Index Architecture Document markup Entity extraction Architecture WCF XML config Hides the complexities of Scalability Service aggregation Conditional processing Powerful framework for content enrichment

CEWS Pipeline Toolkit – What does it do? Extract entities String matching Regular Expressions Dictionary-based Normalize Access external repositories Manipulate strings

CEWS Pipeline Toolkit – What’s in the package? TechReady 18 4/5/2017 CEWS Pipeline Toolkit – What’s in the package? Framework for document analysis Solves majority of customer business use cases Packaged with over 55 pipeline stages Configurable document routing Platform support SharePoint 2013 Enterprise Search FAST Search For SharePoint 2010 Stand-alone Easy to install, Easy to Customize Visual Studio 2012 & .NET 4.5 Framework Inherit from AbstractDocumentProcessor class Detailed documentation on TechNet Wiki – Help the community © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

CEWS Pipeline Toolkit architecture Initialize Web Service Pipeline config xml ProcessItem(Item) ProcessedItem Crawler Content Processing Index

Demo Wikipedia category Total population CEWS Pipeline Toolkit

Future – Community Effort TechReady 18 4/5/2017 Future – Community Effort Data Wikipedia Fileshare DB – Adventure Works Web Services Deploy Demo POC Dev QA Production Display Custom Search Center Search App © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

CEWS and SIT – Join the community effort Canned prototypes for search POCs Several sample scenarios to leverage in your project Simple to deploy and use Production ready

MCS Contact Premier Contact Public Available Date TechReady 18 4/5/2017 How to get these tools MCS Contact Premier Contact Public Available Date Going through the legal process. Will be made available publicly once approved. © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

In Review: Session Objectives Tech Ready 15 4/5/2017 In Review: Session Objectives Identified extensibility points in content acquisition Saw how to customize the content processing pipeline via code callout. Learned how to use SIT. Dove into advanced content enrichment topics (CEWS Pipeline Toolkit) © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Search Related Sessions Room Time Develop Advanced Search-Driven SharePoint 2013 Apps SPC402 Palazzo I, J Tue 1:45pm Best practices for Hybrid Search deployments SPC306 Veronese 2401 Tue 5:00pm SharePoint 2013 Search Analytics SPC340 Palazzo M, N Wed 9:00am How to manage and troubleshoot Search: A practical guide SPC375 Wed 10:45am 6 Proven Steps to Get the Best Out of Search in SharePoint 2013 SPC265 Delphino 4001 Wed 1:45pm Best practices for Information Architecture and Enterprise Search SPC207 Search content enrichment and extensibility in SharePoint 2013 SCP414 Palazzo K, L Customizing Search experiences with Azure Hosted Data and Bing Maps SPC321 Wed 3:15pm Futuristic Search applications using Kinect and Yammer! SPC405 Palazzo M, N Search architecture and sizing in SharePoint 2013 SPC336 Titian 2201 Wed 5:00pm Effective Search deployment and operations in SharePoint 2013 SPC360 Thu 9:00am SharePoint 2013 Search display templates and query rules SPC322 Managing Search Relevance in SharePoint 2013 and O365 SPC382 Thu 12:00pm Search Related Sessions See you at the Search booth’s & Search tables at Asks the Experts WED @6:15!

connect. reimagine. transform. 4/5/2017 Sponsored by MySPC Evaluate sessions on MySPC using your laptop or mobile device: myspc.sharepointconference.com connect. reimagine. transform. © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

4/5/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION. © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.