GATE and the Semantic Web

Slides:



Advertisements
Similar presentations
Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Advertisements

1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
Technical and design issues in implementation Dr. Mohamed Ally Director and Professor Centre for Distance Education Athabasca University Canada New Zealand.
An Introduction to GATE
26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
1 University of Namur, Belgium PReCISE Research Center Using context to improve data semantic mediation in web services composition Michaël Mrissa (spokesman)
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
GATE, Human Language and Machine Learning Hamish Cunningham, Valentin.
1(18) GATE: A Unicode-based Infrastructure Supporting Multilingual Information Extraction Kalina Bontcheva, Diana Maynard, Valentin Tablan, Hamish Cunningham.
The Semantic Web and Language Technology BT Exact, Martlesham Hamish Cunningham Department of Computer Science, University of Sheffield Friday October.
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
1(21) HLT, Data Sparsity and Semantic Tagging Louise Guthrie (University of Sheffield) Roberto Basili (University of Tor Vergata, Rome) Hamish Cunningham.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
1 SWE Introduction to Software Engineering Lecture 22 – Architectural Design (Chapter 13)
Asst.Prof.Dr.Ahmet Ünveren SPRING Computer Engineering Department Asst.Prof.Dr.Ahmet Ünveren SPRING Computer Engineering Department.
Ontology-Aware Information Extraction Hamish Cunningham, Kalina Bontcheva Department of Computer Science, University of Sheffield OntoWeb.
Controlled Language for Ontology Editing Adam Funk, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Brian Davis, Siegfried Handschuh.
GATE, a General Architecture for Text Engineering Hamish Cunningham, Kalina Bontcheva Department of Computer Science, University of Sheffield Wednesday.
GATE technical workshop: introduction Hamish Cunningham Sheffield, March.
Adapting Legacy Computational Software for XMSF 1 © 2003 White & Pullen, GMU03F-SIW-112 Adapting Legacy Computational Software for XMSF Elizabeth L. White.
1 Software Engineering: A Practitioner’s Approach, 6/e Chapter 1 Software and Software Engineering Software Engineering: A Practitioner’s Approach, 6/e.
GATE, a General Architecture for Text Engineering Hamish Cunningham Department.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
1 Building Semantic Applications Paul Warren
Language Technology for the Semantic Web OntoWeb5,Florida,October 17 th,2003 WP12: Language Technology Overview SIG5 Paul Buitelaar.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Mobile Topic Maps for e-Learning John McDonald & Darina Dicheva Intelligent Information Systems Group Computer Science Department Winston-Salem State University,
Chapter 1 소프트웨어의 본질 The Nature of Software 임현승 강원대학교
1 Peter Fox Xinformatics 4400/6400 Week 11, April 16, 2013 Information Audit and dealing with Unstructured Information.
University of Sheffield NLP Teamware: A Collaborative, Web-based Annotation Environment Kalina Bontcheva, Milan Agatonovic University of Sheffield.
E-Science Data Information and Knowledge Transformation Edikt : e-Science Data, Information and Knowledge Transformation E-Science Centres of Excellence.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
GATE, a General Architecture for Text Engineering Hamish Cunningham, Kalina Bontcheva Department of Computer Science, University of.
Introduction to GATE Developer Ian Roberts. University of Sheffield NLP Overview The GATE component model (CREOLE) Documents, annotations and corpora.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Scenarios for a Learning GRID Online Educa Nov 30 – Dec 2, 2005, Berlin, Germany Nicola Capuano, Agathe Merceron, PierLuigi Ritrovato
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
©2003 Paula Matuszek Taken primarily from a presentation by Lin Lin. CSC 9010: Text Mining Applications.
Workshop on Human Language Technology for the Semantic Web and Web Services 2nd International Semantic Web Conference October 20th 2003, Sanibel Island,
ICCS 2008, CracowJune 23-25, Towards Large Scale Semantic Annotation Built on MapReduce Architecture Michal Laclavík, Martin Šeleng, Ladislav Hluchý.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Software Engineering for Business Information Systems (sebis) Department of Informatics Technische Universität München, Germany wwwmatthes.in.tum.de A.
GATE, a General Architecture for Text Engineering Hamish Cunningham, Kalina Bontcheva, Valentin Tablan, Diana Maynard, Yorick Wilks.
A Unicode-based Environment for the Creation and use of LRs Valentin Tablan, Cristian Ursu, Kalina Bontcheva, Hamish Cunningham, Diana Maynard, Oana Hamza,
Modern Programming Language. Web Container & Web Applications Web applications are server side applications The most essential requirement.
Chapter – 8 Software Tools.
1 WSMO Studio – an Integrated Service Environment for WSMO Marin Dimitrov OntoText Lab. / Sirma WIW 2005, Innsbruck
A Ubiquitous Permeable Web: requirements for the next generation semantic internet Hamish Cunningham Department of Computer Science, University of Sheffield.
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
Chapter 1 The Nature of Software
Utility Evaluation of Tools for Collaborative Development
Institute of Informatics & Telecommunications NCSR “Demokritos”
Unit – 5 JAVA Web Services
SNS College of Engineering Coimbatore
Lecture 2 of Computer Science II
Chapter 1 The Nature of Software
Chapter 1 The Nature of Software
CMPE419 Mobile Application Development
Textbook Engineering Web Applications by Sven Casteleyn et. al. Springer Note: (Electronic version is available online) These slides are designed.
Design and Maintenance of Web Applications in J2EE
For University Use Only
Chapter 7 –Implementation Issues
Using Uneven Margins SVM and Perceptron for IE
Lightweight tools for on-line course development
NextGRID: From Compute Grids to Grid SOAs and beyond
New Tools In Education Minjun Wang
CMPE419 Mobile Application Development
Presentation transcript:

GATE and the Semantic Web                                                                                                                             GATE and the Semantic Web Hamish Cunningham, Kalina Bontcheva, Wim Peters, Marin Dimitrov1, Atanas Kiryakov1, Department of Computer Science, University of Sheffield 1OntoText Lab, Sirma AI Ltd. Brief intro to GATE (a General Architecture for Text Engineering), Hand waving about LT and the Semantic Web, Demo 1(7)

A Ubiquitous Permeable Web                                                                                                                             A Ubiquitous Permeable Web The next generation of the web must be: ubiquitous: semantics for every device, every organisation, every individual; permeable: allow contextual data to penetrate and persist; companionable: able to engage with us via multiple natural modalities. Roles for Language Technology: discovery of semantics (ubiquity); mediating between context and personal semantic memories (permeability); conversing with people and the semantic web (companionableness). 2(7)

Critical Mass for the Semantic Web                                                                                                                             Critical Mass for the Semantic Web The SW: machine processable, repurposable data to compliment hypertext But: semantics = 0.0000000...% of the Web How to achieve critical mass? Huge scale automatic annotation. Requirements: Huge scale: – freely available to all EU citizens – distributed (over a Grid) – re-purposeable (delivered as Web Services) Portability and robustness via: – simple and therefore shallow HLT methods – +ve and –ve learning – analogs of IPSEs for computer-literate users 3(7)

                                                                                                                            GATE is: An architecture A macro-level organisational picture for LE software systems. A framework For programmers, GATE is an object-oriented class library that implements the architecture. A development environment For language engineers, computational linguists et al, GATE is a graphical development environment bundled with a set of tools for doing e.g. Information Extraction. Some free components... ...and wrappers for other people's components Tools for: evaluation; visualisation/edit; persistence; IR; IE; dialogue; ontologies; etc. Free software (LGPL). Download at http://gate.ac.uk/download/ 4(5)

Architectural principles                                                                                                                             Architectural principles Non-prescriptive, theory neutral (strength and weakness) Re-use, interoperation, not reimplementation (e.g. v1 used LT-NSL for SGML input; v2 talks to other XML-based systems, APIs and standards) (Almost) everything is a component, and component sets are user-extendable Component-based development An OO way of chunking software: Java Beans GATE components: CREOLE = modified Java Beans (Collection of REusable Objects for Language Engineering) The minimal component = 10 lines of Java, 10 lines of XML, 1 URL. 5(7)

Displaying Multilingual Data                                                                                                                             Displaying Multilingual Data All the visualisation and editing tools for ML LRs use enhanced Java facilities: 6(7)

GATE demo Components and the main UI; the resources tree                                                                                                                             GATE demo Components and the main UI; the resources tree Document formats, databases IE, IR, annotation, evaluation, WordNet Ontologies, OntoGazetteer, Protégé, DAML export 7(7)