#SummitNow Super Size Your Search 14 th November 2013 Fran Alvarez (Zaizi)

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
Building A Digital Asset Management System With And Around Fedora 4 Stefano Cossu, Director of Application Services, The Art Institute of Chicago DC Fedora.
How to Use LucidWorks Search
“ Leveraging SharePoint 2010 Search Technologies ” With: Ivan Neganov.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
Rob Tice Vocabulary Management Group The Aspect VBE.
SharePoint 2013 Catalog Sites Brian Culver ● SharePoint Saturday DFW ● March 7, 2015 Build a SharePoint 2013 Search Driven.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
WMU GNL Automation How to make my IT life easier CHRISTOPHER KEYAERT CONSULTANT AT INOVATIV CLOUD AND DATACENTER MANAGEMENT MVP.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Federated Searching Pre-Conference Workshop - The federated searching cookbook Qin Zhu HP Labs Research Library February 18, 2007.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
ECPRD seminar on the net IX”, Brussels, 2011 Faceted Search Some examples of applied faceted search on websites developed by the EP Jerry.
Search Search Drupal with Apache Solr with CERN Web Communications Group – Copyright 2013.
ManifoldCF for Content Acquisition
Real World Examples – Part II 7/26/2013Miro Remias, Sr. Solution Architect.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Revolutionizing enterprise web development Searching with Solr.
BI Funcasts The Mac-Guyver Techniques BI - The Mac-Guyver Techniques : Office Sharepoint Excel Services Gunter Staes –
Introduction to Nutch CSCI 572: Information Retrieval and Search Engines Summer 2010.
Kelly Boccia Abi Natarajan Konstantin Livitski Senthil Anand Subbanan Meyyappan 1.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Advanced Search Solutions for SharePoint Christopher Even BA-Insight.
ON YOUR TERMS Business needs * Enhanced by upcoming Azure IAAS features GoodBetterBest * * GoodBetterBestGoodBetterBestGoodBetterBestGoodBetterBestGoodBetterBest.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Text Analytics A Tool for Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture.
An answer to your common XACML dilemmas Asela Pathberiya Senior Software Engineer.
A Technical Overview Bill Branan DuraCloud Technical Lead.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
#SummitNow Super Size Your Search 14 th November 2013 Fran Alvarez (Zaizi)
#SummitNow Managing a Billion Object Repository November 13, 2013 Munwar Shariff CTO, CIGNEX Datamatics
@toniblyx at #SummitNow Alfresco Backup and Recovery Tool: a real world backup solution November 2013 Toni de la Fuente – Alfresco Senior Solutions Engineer.
How to build a tailored and unified ECM platform? The recipe for success, from the field Maxime ORAIN Head of European Alfresco Skills Centre Rémi MOEBS.
#SummitNow A Day in the Life of an Alfresco Admin November 2013 Antonio Soler Premier Support Engineer Alfresco Software Ltd.
Audit & Reporting with Alfresco & NoSQL architecture Lucas Patingre Alfresco consultant and technical lead at Zaizi.
Sitecore.net Training, Oct ECM 2.1 UPDATE 2 PART 1 CRAWL BEFORE YOU WALK.
Replace OpenText with Alfresco in a SAP environment
Modern Development Technologies in SharePoint SHAREPOINT SATURDAY OMAHA APRIL, 2016.
Alfresco Scalability Benchmarking Before telling how cool Alfresco is, you better prove it!
MasterCard Global Marketing Center An Alfresco Case Study Jay Mandel, MasterCard International Mike Vertal, Rivet Logic Corporation 15 November 2012.
Making the Most of a Hybrid Alfresco Solution From Genesys Telecommunications: Michael Katten, Director of Technical Publications Joe McMonagle, Manager.
#SummitNow Managing Documents on the Web Using Drupal, Alfresco & Cloud November Ian Norton – Senior Web Architect at Alfresco.
Thinking Long Term - Archive Strategies for Alfresco Nathan McMinn Remote Service Engineer Alfresco Chetan Lalye Senior Software Architect Agilent Technologies.
#SummitNow Using Alfresco and Cloud Apps in Harmony 5 th November 2013 Santiago Rodríguez Antonio D. Pérez.
Crafter case: European Bank Piergiorgio Lucidi Open Source ECM Specialist Certified Alfresco Instructor and Engineer Alfresco Wiki Gardener and Forum Moderator.
Search can be Your Best Friend You just Need to Know How to Talk to it IW 306 Ágnes Molnár.
Using Alfresco and Cloud Apps in Harmony
Business Directory REST API
Crafter case: European Bank
VI-SEEM Data Discovery Service
AMGA Web Interface Salvatore Scifo INFN sez. Catania
Senior Solutions Architect, MongoDB Inc.
Building Search Systems for Digital Library Collections
SharePoint Information Architecture
Multi-Farm, Cross-Continent SharePoint Architecture
EPIC INFOTECH CONSULTING GROUP
AMGA Web Interface Vincenzo Milazzo
NoSQL Overview + Elasticsearch Quick Dive
9/8/ :03 PM © 2006 Microsoft Corporation. All rights reserved.
Presentation transcript:

#SummitNow Super Size Your Search 14 th November 2013 Fran Alvarez (Zaizi)

#SummitNow Agenda Myself & My company Background Our Solution Scenario Demo Conclusions

#SummitNow About me Director of Zaizi Iberia and Chief Architect Alfresco Certified Engineer Responsible of large Alfresco architectures Semantic Consultant for Sensefy Alfresco Meetups Organizer

#SummitNow We are an Open Source Development Company that helps people work together more effectively HQ: London (UK) Seville (Spain) Colombo (Sri Lanka) Singapore

#SummitNow What we offer Open Source System Integrator Specialist in ECM Platinum Alfresco partner Best Systems Integrator Partner EMEA 2012 Best Systems Integrator Partner EMEA 2013 Million $ Club in 2013 Support 24/7

#SummitNow Background Let’s put a bit of context

#SummitNow Those Old Days… Only Lucene in Alfresco 3.4- Indexes were managed within Alfresco context Permissions were checked after Lucene returned all results

#SummitNow Present Solr as Search Subsystem Indexes are managed outside Alfresco context Permissions are checked at query time No in-transaction index

#SummitNow Alfresco 4 is… Common Enemies Find a single document Return large data sets Filter by permissions Be fast! “Sometimes one superhero is not enough”

#SummitNow Alfresco + Solr Approach Quite a good architecture Takes care of both performance and usability Flexibility in deployment and installations However… Sometimes we just need to use something else

#SummitNow Future Don’t freak out dude! We can arrange something

#SummitNow Our solution Use Apache ManifoldCF Decoupled from Alfresco Can be integrated with either Alfresco or any other repository vendor Preserve security and permissions within results It’s included in our Semantic solution: Sensefy! API to manage Manifold Services API for searching, decoupling Search engine chosen Simple Bundled UI Lots of Manifold Customization

#SummitNow Apache ManifoldCF Open Source Apache SF Project Get content from repos Push content on search services Based on “Connector” and “Job” concept Crawling model (add, change, delete) And respect permissions, bitch!

#SummitNow ManifoldCF Overview Repository 1 Repository 3 Repository 4 Repository 2 Apache ManifoldCF Search Server 1 Search Server 2 Search Server 3 Authority Service Authority 1 Authority 2 user specific search results

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository Connector

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository ConnectorOutput Connector

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository ConnectorOutput Connector Authority Connector

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository Connector query to retrieve contents Output Connector Authority Connector

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository Connector query to retrieve contents Output Connector metadata mapping content ingestion Authority Connector

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository Connector query to retrieve contents Output Connector metadata mapping content ingestion Authority Connector retrieve content ACEs

#SummitNow ManifoldCF – Architecture Repository Job Search Server ACLs Repository Connector query to retrieve contents Output Connector metadata mapping content ingestion Authority Connector retrieve content ACEs verbal description crawling model scheduling

#SummitNow Our ManifoldCF Contribution Alfresco Repository Connector: New implementation Amazon Cloud Search Output Connector Alfresco Authority Connector: Design & Development

#SummitNow Some of our most famous villains

#SummitNow Several Alfresco instances Current Alfresco instances don’t share indexes Indexes can’t be merged Can’t have federated search No good approach for presenting results to users

#SummitNow Several Alfresco instances Our solution Once index to rule them all Data origin is irrelevant (or not if we don’t) Single search across repositories You choose your search engine!

#SummitNow Alfresco + Other data providers Current Alfresco Search subsystem != Other provider Search services Alfresco can’t reach external data No way to merge results uniformly to end users

#SummitNow Alfresco + Other data providers Our solution Search engine is shared All of them speak ‘our language’ Alfresco can reach external data through Results are present and accessible between data providers

#SummitNow Alfresco + O(TB) data Current Alfresco Search subsystem Single or clustered Solr Every Solr instance manage its own index No chance to apply scale techniques Huge server are required and performance might be compromised

#SummitNow Alfresco + O(TB) data Our Solution Alfresco uses our index Indexing techniques can be applied according to use cases Sharding, Replication… Search strategy can be adopted with best suitable search solution

#SummitNow Other benefits Extract, index and map information from any other sources Putting them together in a single index Permissions are checked just once Search capabilities: facets, highlighting… Red Link Apache ManifoldCF Search Server Authority Service Alfresco Permissions Alfresco

#SummitNow Demo

#SummitNow Demo : Architecture

#SummitNow Demo: Who are these guys? Christian Bale, Actor Christopher Nolan’s Batman Gareth Bale, footballer Real Madrid latest star

#SummitNow Conclusions Searching & Indexing in most popular Cloud Search solutions Retrieving information from most popular repositories and data providers altogether Manage permission and security for data Fully supported by us!

#SummitNow Conclusions

#SummitNow What’s coming How can we improve it, dude? - Powerful UI - New connectors - Large data volume benchmarking - Share integration

#SummitNow We are not Batman But we can be your Superhero Zaizi Ltd.Fran Álvarez (+44) (+34)

#SummitNow Thank you! May you want to help us with this one?