An Alfresco Apache Stanbol Integration (port of OpenCalais Integration) Steve Reiner CTO Integrated Semantics.

Slides:



Advertisements
Similar presentations
Fusing Online Commerce and Social Network: Enhance Social Shopping Experience via Desktop Application A Master Project Presented By Ning Song.
Advertisements

NCBO-I2B2 Collaboration Overview and Use Cases Nigam Shah
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Go to ‘Site Actions’ ‘View All Site Content ‘View All Site Content’
1 Actuate Corporation © 2010 THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE.
Kentico CMS 5.5 R2 What’s New. Highlights Intranet Solution Document management package – WebDAV support – Project & task management – Document libraries.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
© Copyright 2012 STI INNSBRUCK Apache Stanbol.
A Product of Enterprise Content Management System (CMS) Web & Portal Content Management Systems for faster web publishing Copyright.
CRSX plug-in development. Prerequisites Software and Libraries Eclipse RCP (3.5 or higher) –Go –Select.
Text Analytics And Text Mining Best of Text and Data
A step-by-step tutorial by Henry Liu Auckland City Libraries Make a start Chinese Digital Community.
Search Search Drupal with Apache Solr with CERN Web Communications Group – Copyright 2013.
®® Microsoft Windows 7 Windows Tutorial 6 Searching for Information and Collaborating with Others.
Nathan McMinn, Technical Consultant with Alfresco
What Can Do for You! Fabian Christ
Alfresco – An Open Source Content Management System - Bindu Nayar, Bhavana Mohanraj.
Interoperability with CMIS and Apache Chemistry
Transforming the Way We Work Logistics Community of Practice Jill Garcia Defense Acquisition University 14 July 2006.
DATAVERSE FOR JOURNALS Mercè Crosas, Ph.D. Director of Data Science IQSS, Harvard Society for Scholarly Publishing 37 th Meeting,
Introduction to Android. Android as a system, is a java based operating system that runs on the Linux kernel. The system is very lightweight and full.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
WCM Platform Improvements ECM and Enterprise Metadata Advanced Routing and Document Sets In Place Records Management.
Omeka Creating Exhibits. Select “Create an Exhibit” Log in to Omeka at:
TypeScript for Alfresco and CMIS Steve Reiner CTO Integrated Semantics.
Replace OpenText with Alfresco in a SAP environment
Getting Started Telligent or SharePoint (or Hybrid)?
Making the Most of a Hybrid Alfresco Solution From Genesys Telecommunications: Michael Katten, Director of Technical Publications Joe McMonagle, Manager.
Share Enhancements David Webster. Introduction Me: David Webster Alfresco Engineer Joined April 2010 UI The Session: Share Enhancements:
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
#SummitNow SharePoint to Alfresco Migration Mark Lugert of Simflofy Inc.
Explore Various Options for Bulk File Transfer out of Alfresco Craig Tan Technical Account Manager.
How to use Drupal Awdhesh Kumar (Team Leader) Presentation Topic.
Microsoft Office System 2007: Records Management Wes Preston Inetium.
Chapter 1 Getting Started with ASP.NET Objectives Why ASP? To get familiar with our IDE (Integrated Development Environment ), Visual Studio. Understand.
The Palantir Platform… …Changes in 2.3
Chapter 17 The Need for HTML 5.
ECM Subsystems Component View
SharePoint 2007 Business Intelligence
ISRAEL – September, 12th 2017.
Weebly Elements, Continued
Development with Eclipse
UW-Superior V10.7 for Instructors
Getting Started with Alfresco Development
Veritas Content Syndication 2017
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Elasticsearch Magento 2 Extension
Bare bones notes.
Steering Group Member, Link Digital
Flipster App for iPad and iPhone
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Metadata Editor Introduction
Getting started with Alfresco Development
Accessing Spatial Information from MaineDOT
Table of Contents: Part B
Adding Post Type Archive in WordPress Navigation Menus Guided By: wpglobalsupportwpglobalsupport.
Stephen Faig to provide the introduction Mike Ruane, President / CEO
B2B Portal Training Materials
The SADE mini-project of the EGI DARIAH Competence Centre
Patricia NXT.
eSciDoc Development Schedule
Bare bones notes.
Basic Web Page Creation
Guided Research: Intelligent Contextual Task Support for Mails
B2B Portal Training Materials
Web archives as a research subject
Ridgehead K-Fuze CMS Overview
ICOM TC Charter TC’s Scope Out of TC’s Scope Call for Participation
AssetWise Operational Analytics
Presentation transcript:

An Alfresco Apache Stanbol Integration (port of OpenCalais Integration) Steve Reiner CTO Integrated Semantics

OpenCalais Integration Features Share, FlexSpaces, and Explorer UI Auto tagging action (manual and rules) in all List semantic tags in details in all Share, FlexSpaces: Semantic Tag Clouds, Geo-Tagged Map FlexSpaces: Suggest Tags, Add / Remove Tags on Doc Open Source

OpenCalais Share Integration Features Auto tag menu in Doc Lib, Repo, Details Semantic Tag Cloud Dashlets with category drop-down Geo-tagged map dashlet Dashlets work both on site and overall dashboards Search results list when click tags in these dashlets

OpenCalais Advantages / Disadvantages Advantages: Good Recognition Results on Names, Cities, Companies Good for news, public website text Disadvantages: Doc Size limit on All versions (100k bytes) Daily submission items limits on Free OpenCalais (50k) and Calais Professional (100k) Keep metadata extracted Focused on English, some support for French,Spanish Not Customizable in Taxonomy or in recognition code Not Open Source

Apache Stanbol Disadvantages: OpenNLP Recognition of Names, Cities, Companies not as good as OpenCalais (can chain other engines/services including OpenCalais) Advantages: No doc size or submission item limits Multi language focused Customizable in Taxonomy and in recognition code Open Source

Apache Stanbol More of a full semantic platform, not just text enhancement Focused on semantic content management Could be used for a more general semantic platform Componentized, OSGi based Enhancer, Enhancement Engines, Entity Hub, ContentHub, Ontology Mgr, Rules, Reasoners, CMS Adapter, FactStore

Port of OpenCalais Integration to Apache Stanbol Prototype download available now Open Source All previous features in Share and Explorer are available Alfresco extension (4.x) and Share extension (4.0 and 4.2) Share auto-tag menus, semantic tag clouds dashlet, geo-tagged dashlet, semantic tags listed in details Action can be used in content rules to auto-tag all submissions to a folder, etc. Auto tag action also available in Explorer, semantic tags listed on details page Suggest tag webscript not complete FlexSpaces doesn’t have support yet (need to add additional calls to different webscript URLs and add preference options of to use OpenCalais or Leveraged a Java client API library contributed by Zaizi to Stanbol that makes REST calls to Stanbol

Apache Stanbol Integration Features Roadmap Finish Suggest Tag WebScript and add support to FlexSpaces for Stanbol Display of dbpedia info / webpage on entity next to search results list on page displayed after semantic tag click Add using Stanbol contenthub instead of stateless entityhub to retain semantic enhancement of docs If Zaizi Stanbol integration is not made available as open source, will add some things such as Solr Facets search UI of semantic categories / entities Other things considering SKOS taxonomy editor Semantic Categories Graph (single doc, multiple docs) Tie in Alfresco as the content mgr of versions of Protégé GWT Web UI ontology editor / tie in with Stanbol Stanbol support for enhancing any CMIS repository Stanbol as platform semantic data integration of structured data in addition to unstructured

Links to Find out more blogwww.integratedsemantics.org