Semantics, Syndication and Social Networks: Mechanisms for Future Structured Information Spaces Hamish Cunningham (University of Sheffield) Werner Haas.

Slides:



Advertisements
Similar presentations
David Dawson Head of Digital Futures MINERVA and MICHAEL: Where do we go to?
Advertisements

Action Plan David Dawson Head of Digital Futures Museums, Libraries and Archives Council.
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
A centre of expertise in digital information management UKOLN is supported by: Memory institutions and the social fabric of the Web Dr.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004.
Digital Content Solutions Digital content management technology has transformed the way to manage content and knowledge, in this knowledge era. Research.
Where the Web Went Wrong Hamish Cunningham Dept. Computer Science, University.
Representation without Reason: Slow Progress toward the Semantic Web Jim Greer ARIES Laboratory Computer Science, University of Saskatchewan.
How to survive the document & data tsunami? Lambda Verdonckt Business Analyst TenForce.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
GATE, SWAN and Semantic TV Hamish Cunningham Department of Computer Science, University of Sheffield.
1 Dr Alexiei Dingli Introduction to Web Science Conclusion.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Evaulating Learning Objects Across Boundaries: The semantics of localization Bahadır Karabina Ayşe Sümeyye Güven.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Using Information Extraction for Question Answering Done by Rani Qumsiyeh.
‘european digital library’ (EDL) Julie Verleyen TEL-ME-MOR / M-CAST Seminar on Subject Access Prague, 24 November 2006.
School of Computing and Mathematics, University of Huddersfield Knowledge Engineering: Issues for the Planning Community Lee McCluskey Department of Computing.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Library Automation and Digital Libraries Class #5 LBSC 690 Information Technology.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Web Archives, IDEAL, and PBL Overview Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science Virginia Tech Blacksburg, VA, USA 21.
Logic Programming for Natural Language Processing Menyoung Lee TJHSST Computer Systems Lab Mentor: Matt Parker Analytic Services, Inc.
What’s the difference between Tony Blair and Mother Theresa? (Human Language Technology for Preservation return on investment)
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
1 Building Semantic Applications Paul Warren
1 The BT Digital Library A case study in intelligent content management Paul Warren
HEALTH DEVELOPMENT AGENCY ONLINE INFORMATION RESOURCES Heidi Livingstone Marta Calonge Contreras.
EXCS Sept Knowledge Engineering Meets Software Engineering Hele-Mai Haav Institute of Cybernetics at TUT Software department.
Metadata, the CARARE Aggregation service and 3D ICONS Kate Fernie, MDR Partners, UK.
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
Uniting Libraries And Archives: How An Integrated Metadata Strategy Can Produce a Common Research Environment Richard Gartner, King's College London.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Reading Discussions Metcalfe’s Law paper What is metcalfe’s Law? Examples from the Web? How can we utilize it? How semantics contribute to social networks,
Preservation of Interoperability and Interoperability of Preservation DL.org Autumn School – Athens, 3-8 October 2010 Seamus Ross, University of Toronto.
Artificial Intelligence By Michelle Witcofsky And Evan Flanagan.
Tetherless World Constellation Open Government Data Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information.
Building Knowledge Societies Abdul Waheed Khan Assistant Director-General for Communication and Information Durban ::: 19 August 2007 E-Learning: Universities.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
WSIS Action Plan: 1 Bamako, Mali, 6-7 May 2005 Multilingualism for Cultural Diversity and Participation for All in Cyberspace Technology solutions for.
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
OWL Representing Information Using the Web Ontology Language.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Summary Knowledge Bases from Web are Real, Big & Useful: Entities, Classes & Relations Key Asset for Intelligent Applications: Semantic Search, Question.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Information Retrieval
1 The Design of Multimedia Database Systems, for Use as Multidisciplinary, Cross-sector and Cross-cultural Educational Resources CAL2003, Belfast 9th April,
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. DERI Galway David O‘Sullivan, Tomas Vitvar, Hamish Cunningham.
Virtual Information and Knowledge Environments Workshop on Knowledge Technologies within the 6th Framework Programme -- Luxembourg, May 2002 Dr.-Ing.
Cornucopia: the UK database of museum collections Peter Winsor Resource: The Council for Museums, Archives and Libraries CD Focus - 14 May 2002.
Tetherless World Constellation Open Government Data Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information.
A Ubiquitous Permeable Web: requirements for the next generation semantic internet Hamish Cunningham Department of Computer Science, University of Sheffield.
1 ARKive-ERA Project Lessons and Thoughts Semantic Web for Scientific and Cultural Organisations Convitto della Calza 17 th June 2003 Paul Shabajee (ILRT,
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
GATE and the Semantic Web
COMP62342: Ontology Engineering for the Semantic Web
Web archives as a research subject
Presentation transcript:

Semantics, Syndication and Social Networks: Mechanisms for Future Structured Information Spaces Hamish Cunningham (University of Sheffield) Werner Haas (Johaneum Research) Ant Miller (BBC) Libby Miller (University of Bristol) Ralph Traphoener (Empolis / Bertelsmann) Paul Warren (British Telecom)

What’s the difference between Mother Theresa and Tony Bliar? Hamish Cunningham Dept. Computer Science, University of Sheffield

3 Why semantic metadata? 1.Different types of metadata allow different types of search (but also incur different costs and have different limits) full text: "find me Nevsky in Bulgaria" taxonomy / thesaurus / semantic annotation / ontology: "find me churches in Eastern Europe" E.g. BBC's INFAX taxonomic system: 66% of searches would fail if only full text 2.The web promotes diversity but also fragmentation; there's too much of it; less and less impact for curated data In face of this cultural memory institutions need Syndication and mediation (to pool outlets and multiply impact); this means presentation-independent, multipurpose content Users as assistants (to cut the cost of metadata); this can mean shared conceptualisations of content How do we get there?

4 The semantic web and why you can't have it (yet) The semantic web is about a semantic layer for interoperability, machine-readability, inference – ideal for semantic libraries? Problems: 1.Construction and maintenance of shared taxonomies, terminologies & ontologies is expensive 2.Annotation of content relative to them is v. expensive 3.How does a machine tell the difference between "Mother Theresa is a Saint" and "Tony Blair is a Saint"? (Beyond the shallow and the general we get into typical AI problems, the contextual and shifting nature of meaning, etc.)

5 Four promising directions 1.Use recommender systems to make the users into curators’ assistants (who tells Google which page is important? other web users do, by linking; also Amazon) 2.Allow curators and users to DIY simple specific ontologies and KBs (targetted adjuncts to general models like CIDOC) 3.Use Information Extraction (IE) to populate semantic models 4.Ride the next wave of social software and on-line communities (Wikis, Bloggs, OSN, file sharing / P2P, RSS/ATOM)

6 IT context: the Knowledge Economy and Human Language Gartner, December 2002: taxonomic and hierachical knowledge mapping and indexing will be prevalent in almost all information-rich applications through 2012 more than 95% of human-to-computer information input will involve textual language A contradiction: to deal with the information deluge we need formal knowledge in semantics-based systems our archived history is in informal and ambiguous natural language The challenge: to reconcile these two phenomena

7 Human Language Formal Knowledge (ontologies and instance bases) (A)IE CLIE (M)NLG Controlled Language OIE Semantic Web; Semantic Grid; Semantic Web Services KEY MNLG: Multilingual Natural Language Generation OIE: Ontology-aware Information Extraction AIE: Adaptive IE CLIE: Controlled Language IE HLT: Closing the Loop

8 Information Extraction Information Extraction (IE) pulls facts and structured information from the content of large text collections. Contrast IE and Information Retrieval NLP history: from NLU to IE Progress driven by quantitative measures MUC: Message Understanding Conferences ACE: Advanced Content Extraction General Architecture for Text Engineering (GATE):

9 “The shiny red rocket was fired on Tuesday. It is the brainchild of Dr. Big Head. Dr. Head is a staff scientist at We Build Rockets Inc.” IE Example ST: rocket launch event with various participants NE: "rocket", "Tuesday", "Dr. Head“, "We Build Rockets" CO:"it" = rocket; "Dr. Head" = "Dr. Big Head" TE: the rocket is "shiny red" and Head's "brainchild". TR: Dr. Head works for We Build Rockets Inc.

10 Ontology-based IE XYZ was established on 03 November 1978 in London. It opened a plant in Bulgaria in … Ontology & KB Company type HQ establOn CityCountry Location partOf type “03/11/1978” XYZ London UK Bulgaria HQ partOf

11 A Necessary Trade-Off Domain specificity vs. task complexity: complexity specificity acceptable accuracy domain specific bag-of-words events general simple complex relations entities

12 Open information, defended communities Trend 1: seconds out, round 5: file sharing is about to go social Trend 2: the living room is about to be computerised What will happen when all your living room devices fold into a single PC? Bill Gates hopes you'll be running Windoze, but Consumer Electronics firms bet on Linux & stable hardware (no viruses, no crashes, cheap,...) What if these two trends combine? Ubiquitous on-line communities centred on shared content, with a model of trust What if memory institutions provide means of organising, explaining, interlinking the cross-over between modern popular culture and the curated memory? Important because DRM is the beginning of the end of civilisation as we know it (controls how you consume media you buy; has the potential to be linked with censorship and with invasive behaviour logging) you can't make digital objects behave like physical objects - unless you totally control the hardware and the operating system if someone has control, then we may end up finding that someone has given the contract for preserving our culture to Haliburton

13 Memory is not a luxury C21 st : all the C20 th mistakes but bigger & better? If you don’t know where you’ve been, how can you know where you’re going? Libraries, museums, archives: ammunition in the war on ignorance (more dangerous than “terror”?) Ammunition is useless if you can’t find it: new technology must make our history accessible to all, for all our futures

14 Summary Cultural memory can benefit from semantic metadata, presentation-independence and repurposing Semantic web technology: –no: it won’t make machines intelligent –perhaps: simple specific models can work Four ways to cross the AI bridge: DIY models; recommenders; IE; OSN + P2P This talk: More: ● Related projects: