Making the Most of Your Content.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Business Development Suit Presented by Thomas Mathews.
Enterprise Search with FAST Rick McDannel Manager of Information Technology.
TKG Consulting LLC Managing Content in SharePoint: Design construct and case study Gilbane Conference on Content Management April 10, 2007 Craig St. Clair.
“ Leveraging SharePoint 2010 Search Technologies ” With: Ivan Neganov.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
One retention policy for Exchange, SP, OneDrive, Lync and Public Folders Time based policy Delete policies at item or folder level for.
beyond 10 blue links Making people more productive and driving business outcomes People & Expertise My Work Business Data Information Services.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Welcome to the Minnesota SharePoint User Group June 10 th, 2009 Search: From WSS to FAST Brian Caauwe, Wes Preston Bob Koviak,
Renaud Comte [MVP]
Overview of Search Engines
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Russ Houberg Senior Technical Architect, MCM KnowledgeLake, Inc.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
January 2013 CDMI: An Introduction. Big Data Complexity Volume Speed “Big Data” refers to datasets whose size is beyond the ability of typical tools to.
Indexing CAx Data and SharePoint based SDM SLM Seminar What commercial PLM/SLM still do not do.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
M ODULE 5 – S HARE P OINT 2010 C ONTENT T YPES.
Philadelphia Area SharePoint User Group September 29, 2010 Chris Mann RJB Technical Consulting
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Virtual techdays INDIA │ august 2010 FAST Search for SharePoint 2010 Allirajan Ramachandran │ Technology Specialist, Microsoft Corp
JourneyTEAM - –
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
Search Admin Content UX Crawl Content Processing Index Query ProcessingWFE API Analytics Processing Crawl Search Admin Link Analytics Reporting FAST.
Searching Business Data with MOSS 2007 Enterprise Search Presenter: Corey Roth Enterprise Consultant Stonebridge Blog:
When Search is not Enough Case Study: The Advertising Research Foundation Gilbane Boston November 27, 2007 Gilbane Boston November 27, 2007.
Module 10 Administering and Configuring SharePoint Search.
Virtual techdays INDIA │ august 2010 ENTERPRISE CONTENT MANAGEMENT WITH SHAREPOINT 2010 Naresh K Satapathy │ Solution Specialist, Microsoft Corporation.
Search Gotchas Sharon Richardson Joining Dots. Indexing Architecture There can be only one… …indexing server.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Unplugged FAST meets SharePoint (FS4SP)
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
ARCH-04 Before You Begin Your Transformation Project… Phillip Magnay Architect – Applied Technology.
Web Search Architecture & The Deep Web
ELISQ Systems Demonstration Sagnik Ray Choudhury Doha -- May 2015.
Business Data Integration with MOSS 2007 Naveedullah Khan PMP, MCAD.NET Senior Consultant.
Electronic Business: Concept and Applications Department of Electrical Engineering Gadjah Mada University.
#SummitNow Super Size Your Search 14 th November 2013 Fran Alvarez (Zaizi)
Gabor Fari April 26, 2007.
Search can be Your Best Friend You just Need to Know How to Talk to it IW 306 Ágnes Molnár.
Voyager Search. INTRODUCTION › Established in 2008 › Self-funded and privately owned › Geospatial search and data management › Leverages Open Source technology.
Data mining in web applications
Information Retrieval in Practice
Metataxis Can you really implement taxonomies in native SharePoint? Marc Stephenson March 2017.
Search Engine Architecture
SEARCH ENGINES & WEB CRAWLER Akshay Ghadge Roll No: 107.
Business Connectivity Services in SharePoint 2010 and Office 2010
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Strategies for improving Web site performance
Microsoft Ignite /6/2018 3:11 PM THR3055
Introduction to SharePoint 2007
Searching Business Data with MOSS 2007 Enterprise Search
Microsoft Dynamics.
SharePoint Information Architecture
Strategic Internet Marketing & Search Engine Optimization May 25, 2006
Code Tax: Programming With The Taxonomy API In SharePoint 2010
Taxonomies, Lexicons and Organizing Knowledge
Searching Business Data with MOSS 2007 Enterprise Search
SPO Demos to Business Value Discussion Pillar Mapping
Tech·Ed North America /22/2019 3:15 AM
Metadata The metadata contains
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Deep SEARCH 9 A new tool in the box for automatic content classification: DS9 Machine Learning uses Hybrid Semantic AI ConTech November.
Make it real: Help your customers comply with the GDPR
9/8/ :03 PM © 2006 Microsoft Corporation. All rights reserved.
Welcome to SharePoint/O365 Saturday Kansas City!.
Presentation transcript:

Making the Most of Your Content

About the Speaker Rem Purushothaman  19+ years in information access  9+ years in enterprise search   16 million unique visitors per day  Among highest search volumes on the web  Areas of focus  Intranet,Internet,and Extranet projects across all major verticals 2

Some Knowledgeof DocumentManagement in SharePoint   3

ECM in 2013 – Convergence & Usability Individual Team Organization 4

ECM and Search Sources: http://searchpatterns.org, www.information-management.com 6

Common Challenges  Missing or poor metadata Duplicate documents Siloed information Repositories with unknown content Incorrect security or retention Haroon Suleman, enterprise search architect at Mercer, deploys search across 40 million critical documents stored in file systems, SharePoint, Livelink, and the corporate personnel directory. Noting that a query for “(client name) plus proposal” will get thousands of hits, he concentrates on deduplication and entity extraction and “lets the search engine do the hard work wherever possible.” Source: Forrester Research 7

The LongTail of ECM Holistic Approach TRADITIONAL ECM Managed Content Unmanaged Content Holistic Approach USER PARTICIPATION Reduce cost and complexity Scale to 100% of users and content under management CONTENT TYPES 8

Metadata is Essential  ListColumns ManagedMetadata Localtermsets Globaltermsets Managedinthecontext Managedoutofcontext Static Dynamic Managedbyadmins Managedbyowners CanbeimportedfromCSV 10

Without Metadata,it Gets Really Hard <HEAD> <TITLE>Stamp Collecting World</TITLE> <META name="description" content="Everything you wanted to know about stamps, from prices to history."> <META name="keywords" content="stamps, stamp collecting, stamp history, prices, stamps for sale"> </HEAD> Enterprise Search 11

Metadata in ECM and Search DEMO 

2 Components to Machine-Made Metadata Content tagging:  Concepts (vector based information)  Entities (noun phrases)  Author specific tags (explicit content tags) Content classification  Hierarchy (taxonomy)  Object to object classification (sets)  Rules (linguistics, semantic, etc.) 13

Entity Extraction vs. AutoClassification  Closed vocabulary:projects (OOB from term store)  Open vocabulary:organizations (OOB in SP2013)  OOB,custom,and 3rd party Title Sales Forecast Companies Contoso Tailspin Toys Woodgrove Bank …  People,places,domain-specific (proteins,courts,…) Expertise Strategic  Each entity is detected each time (can provide counts)  Hierarchical classifiers (not OOB,add-on)  Works on hierarchies Consulting Market Analysis IT Implementation … Industry Financial Services Manufacturing  Europe->France->Paris vs. NorthAmerica->USA->Maine->Paris Technology ...  Tags once for whole document “Process in place” vs.“Process during indexing” 14

The“Term Store” Service management Term Store Group Term Set Term 30k terms per term set (max 1 million total) Many term per group Description • Translations • Custom properties 16

LongitudeAutoClassifier Create taxonomy from example resources and documents Taxonomy 1 Content Repositor(ies) Content Connectors and Crawler Manager Automatically classify documents to taxonomies 2 Term Store (SharePoint) 3 Taxonomy Service Search Index Annotator Indexer Use taxonomy to find and explore content 17

Creating Metadata by Machine DEMO 

Meta-data Driven Scenarios Scope ranges from local to global Metadata“control” ranges from formal managed taxonomies through social tags  19

Content

How Much doTheyActually Index? 1% 5.6% 1.7% Size of the web > 1,000,000,000,000 unique links to pages found Source: Google Blog 23

What do I Index First? Prioritize data sources Favor: 1 Highbusiness value 2 7 3 4 5 Dif 6 ficultROI Prioritize data sources Favor:  Highly authoritative  Important to largest audience  Less complexity Avoid:  Low authority  Small audience High Authority Medium Low  Highly complex High Low complexity Average Size of Audience Normal complexity Low High complexity 24

A Note on Indexing…  If it’s not indexed,you can’t find it!!  You can index content from anywhere,not just SharePoint 25

Configuring and Extending Crawling  Configure OOB connectors and crawls  Scheduling! Be aware of your sources and full/incrementals  Build your own connector  Built from SPD for simple databases and web services  Built connectors shared across SharePoint Search and FAST search  Buy prebuilt 3rd party connector  Knowledge of source  Handle complex security,metadata enhancement  Leverage framework and build on it  Managed .NET assembly BCS connector or custom BCS connector  3rd party frameworks 26

WritingYour Own Connector Tailoring crawls by getting into code Capabilities beyond OOB connectors:  Time stamp based incremental crawl  Change log + delete log crawl  Support for attachments  Item level security  Associated content SPC213:ContentAcquisition for Search in SharePoint 2010 27

Connectors and Enterprise Content Unified Search Index Security Mapping Metadata Enrichment Change Log Targets Search Optimized API SharePoint SharePoint Libraries & Lists 28

WorkingAcross Multiple Repositories DEMO 

Summary  30

http://domorewithsearch.com rem@bainsight.com @RemSearchPro http://www.linkedin.com/in/rempurushothaman