GlobalWisdom Software Bravo TM Reviewer for Online Editors Abhijit Patil.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Content Aggregation & Distribution: Abbotts Custom Solution Dawn Lynn Research Information Scientist, Abbott Labs.
Beth Golden Manager, Editorial Services Factiva Intelligent Indexing SLA 2004.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Business Development Suit Presented by Thomas Mathews.
Making Search Relevant SchemaLogic Gary Carlson Chief Taxonomist
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
BUSINESS DRIVEN TECHNOLOGY Enhancing Collaborative Partnerships
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
1 © 2010 SAGA Worldwide, LLC. All Rights Reserved.
Information Retrieval in Practice
The Hierarchy of Data Bit (a binary digit): a circuit that is either on or off Byte: 8 bits Character: each byte represents a character; the basic building.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Microsoft ® Official Course Interacting with the Search Service Microsoft SharePoint 2013 SharePoint Practice.
Knowledge Portals and Knowledge Management Tools
Libraries and Institutional Content Management Systems
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
Overview of Search Engines
Lecture-8/ T. Nouf Almujally
An innovative platform to allow translation and indexing of internet sites Localization World
A Product of Enterprise Content Management System (CMS) Web & Portal Content Management Systems for faster web publishing Copyright.
Our Commitment to Your Success: Global eTraining.
SharePoint Portal Server 2003 JAMES WEIMHOLT WEIDER HAO JUAN TURCIOS BILL HUERTA BRANDON BROWN JAMES WEIMHOLT INTRODUCTION OVERVIEW IMPLEMENTATION CASE.
IBE312: Ch15 Building an IA Team & Ch16 Tools & Software 2013.
Yuri de Lugt Collexis Karin Clavel TU Delft Library.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
The Internetworked E-Business Enterprise
A National Resource Working in the Public Interest © 2006 The MITRE Corporation. All rights reserved. KM at MITRE Jean Tatalias KM TEM, December 2007.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
1 8/24/2002 Why Upgrade to DS3.0? Xerox and the DocuShare Business Unit continue to invest in DocuShare. We want our customers to continue to invest in.
1 Chapter 11 Implementation. 2 System implementation issues Acquisition techniques Site implementation tools Content management and updating System changeover.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
Ask A Librarian and QuestionPoint: Integrating Collaborative Digital Reference in the Real World (and in a really big library) Linda J. White Digital Project.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Presentation Outline (hidden slide) Technical Level: 100 Intended Audience: TDMs, ITPros, ITDMs, BI specialists Objectives (what do you want the audience.
Knowledge Management in a fast changing world Kate Elphick
Northern Light Technology, LLC Empowering the Organization.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Question Answering.  Goal  Automatically answer questions submitted by humans in a natural language form  Approaches  Rely on techniques from diverse.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
7-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Chapter 7 IT Infrastructures.
14 October 2010 Leveraging Technical Expertise via Boeing Library Services* Diane Brenes, Librarian, Boeing Library & Learning Center Services.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Implementation Experiences METIS – April 2006 Russell Penlington & Lars Thygesen - OECD v 1.0.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Case Study SummaryChallenges Cisco WebEx, the world market leader in online web conference, has been working with Link Translation since 2009 to support.
Welcome! Deborah Baron VP Product Management and Marketing, Sentius KM World 2000 “eGlobalization” RichLink™
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Text Analytics in Action: Using Text Analytics as a Toolset TBC 4:15 p.m. - 5:00 p.m. Marjorie Hlava Semantic enrichment / Semantic Fingerprinting.
Modularization Also Known As: reuse, single sourcing, structured authoring What is it? Breaking down content into smaller chunks that can be reused in.
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
1 CS 8803 AIAD (Spring 2008) Project Group#22 Ajay Choudhari, Avik Sinharoy, Min Zhang, Mohit Jain Smart Seek.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Web Content And Customer Relationship Management Solution. Transforming web sites into a customer-focused, revenue generating channel with less stress.
Information Retrieval in Practice
Search Engine Architecture
Simple and intuitive fare conditions
Anatomy of a modern data-driven content product
QKS Classifier Overview
Presentation transcript:

GlobalWisdom Software Bravo TM Reviewer for Online Editors Abhijit Patil

Company Overview GlobalWisdom, Inc. sells software that dramatically increases the ROI (Return Of Investment) of enterprise information management systems by capturing, organizing and sharing individual and collective knowledge. Software enhances content and knowledge management systems by leveraging existing workflows and continuously adapting information structure to foster critical thinking Software is not smart; people are Software that captures the distributed expertise of people working with content.

Bravo TM Reviewer for Online Editors Workflow-based solution for the online publishing market. Effectively leverages the expertise of editors in charge of adding metadata to content. Online publishers depend on high quality metadata to make sure their readers find the information they need. Bravo makes the process of adding metadata more efficient and effective, so publishers can maximize their return on investment when producing value added content.

The Value of Bravo™, Software that Learns Producing high-quality metadata is expensive. The best metadata comes from –Editorial oversight by employees who understand the content –The taxonomy (i.e. subject matter hierarchy) being applied to the content –The customers whose business needs are being served. Auto-classification –Reduces cost –Without human oversight and control, create metadata that makes more sense to an algorithm than a human being.

The Value of Bravo™, Software that Learns (Cont.) The Bravo combines –The efficiency of auto-classification –The expertise of in-house editorial staff Allowing editors to train the auto-classification software within their ordinary workflow. This improves the performance of the auto-classification algorithms, so that business can achieve a more cost-effective balance.

Bravo TM System

How ? Auto-classifier recommends topics as metadata tags, which reviewers can quickly approve or change. This allows publishers to off-load the rote work to the software Gives reviewers tools to fine-tune the taxonomy while reviewing content. Can add negative or positive examples to the training set with a click of the mouse Easily create new topics or subtopics. These tools allow to produce better quality metadata

Bravo TM Modules The Bravo Module Concept Based Search –The difference between search and retrieval Content Classifier Content Indexer K42 Topic Map Engine

The Bravo Module (Features) Profiling Based on user/system interaction & direct input –Gathers feedback from how people already work with information Identification of subject area experts Recommendation of relevant topics or documents –Global Wisdom's search module is concept based - results are returned by the ideas expressed in a document, not simply by keywords Information alerts on preset topics, or relevant to the work at hand Automatic, accurate content routing

The Bravo Module (Features) Search Search by selecting example, from a sentence to a full document Merging two or more examples Free text System Unlimited hierarchy depth Augments performance of existing content management systems Integrates seamlessly with legacy data storage systems

Concept-based Search Module™ Search Module is concept-based, not keyword- based. Can find highly relevant documents in which query terms do not appear. Users can phrase queries in the ways that make sense to them, rather than adapting to the limitations of the search engine. Based on patented Latent Semantic Indexing (LSI), which has demonstrated a 30% increase in accuracy (Telcordia, 1999) over keyword techniques. Leverages user feedback to become fully adaptive.

Concept-based Search Module™ (Cont.) Sample Applications Corporate intranet - ensure that employees across the organization can find the information they need. Maximize reuse of information, minimize duplication of effort. Workgroup collaboration - improve support for research, analysis, project management, and more. Publishers - ensure that customers can locate the valuable content they need. Corporate extranet - improve accessibility to timely, accurate information for suppliers, end users and partners

Concept-based Search Module™ (Cont.) Features : Search by example. –Select a piece of text, from a sentence to an entire document, to serve as a search query. Search by merging examples. –Combine two examples to find documents that are relevant to each. 100% automatic. –Does not require thesaurus or controlled vocabulary (works with multiple languages, without the need for translation). Concept-based relevancy ranking. Highly effective with Optical Character Recognition (OCR). Retains high level of accuracy with small amount of text, e.g., photo captions.

Latent Semantic Indexing Uses singular-value decomposition. We take a large matrix of term-document association data and construct a "semantic" space wherein terms and documents that are closely associated are placed near one another. Singular-value decomposition allows the arrangement of the space to reflect the major associative patterns in the data, and ignore the smaller, less important influences. As a result, terms that did not actually appear in a document may still end up close to the document, if that is consistent with the major patterns of association in the data. Position in the space then serves as the new kind of semantic indexing, and retrieval proceeds by using the terms in a query to identify a point in the space, and documents in its neighborhood are returned to the user.

D-GPS™ Content Classification Module Classifier uses a model-based approach Current algorithms simply analyze the text in a document. But different people will interpret the meaning of that information in their own way. Patent-pending algorithm, D-GPS, improves accuracy and relevance by simultaneously analyzing the text and the hierarchy, or hierarchies, that reflect the understanding your business brings to information. Scalability – Handles large document sets with ease Integrated with existing hierarchy or with custom one

GlobalWisdom Indexer Essential component of the bravo™ engine, which leverages user feedback to deliver fully adaptive enterprise workflow solutions for publishers, content delivery, knowledge management and enterprise portals. Features : –Fast, reliable and accurate indexing of content using Java and SQL DBMS. –100% Java. –Parallel threads using multiple DBMS connections. –All common formats, including text, HTML, XML, etc. New formats available on request. –Optional Crawler API, with support for focused crawl

K42 Topic Map Engine Using the Topic Map standard, K42 captures the relationships and associations that connect your data in within a "knowledge layer.“ Allows a greater level of meaning to be represented within a content or knowledge management system, improving the performance of critical information applications. –For example, Topic Maps can represent that pending FDA regulations have influenced a biotech company to launch a new marketing campaign, and that both have contributed to a rise in stock prices. The knowledge layer is a neutral format that works on top of any proprietary system, easily bridging multiple formats and sources. When integrated with the bravo™ engine, K42 leverages user feedback to become fully adaptive.

References Project Home Page – – Indexing by Latent Semantic Analysis – –