Download presentation
Presentation is loading. Please wait.
1
1 Ontology Based Extraction of RDF Data from the World Wide Web Tim Chartrand A Thesis Proposal Research Supported By NSF
2
2 Introduction World Wide Web Has a huge amount of existing information Designed primarily for human consumption Semantic Web Is an extension of WWW Gives information a well-defined meaning Allows automation of tasks DEG contribution – Extract data from the WWW Proposed solution Extract Semantic Web data from the WWW Superimpose extracted data
3
3 Extraction Ontology Extraction Engine HTML Page Relational Data Overview of Proposed Research Extraction Ontology DAML Ontology User Extraction Engine HTML Page Relational Data RDF Data RDF Browser
4
4 RDF – What is it? Resource Description Framework Language of the Semantic Web Set of triples “25” Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data mailto:tim@cs.byu.edu 25 genealogy:age mailto:tyler@thechartrands.com genealogy:fatherOf
5
5 RDFS & DAML Core Concepts Classes daml:class – defines a class rdfs:subClassOf – specifies the generalization of a class Properties daml:property – defines a binary relation, has a value rdfs:domain – specifies class to which a property applies rdfs:range – specifies possible values of a property Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
6
6 Example Ontology Program Size...... Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
7
7 DAML OSM Classes Non-lexical object sets Properties Binary relationship sets between object sets Literal properties Binary relationship sets between non-lexical and lexical object sets Cardinality restrictions Participation constraints Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
8
8 DAML OSM Program Size...... Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
9
9 Data Frames Lexical object sets need data frame. Use data-frame library Match lexical object sets with data frames Compare names Stemming Levenshtein edit distance Soundex Longest Common Subsequence Choose most similar data frame Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
10
10 User Modification Cardinality Constraints Provide graphical ontology editor Allow the user to edit participation constraints Disallow the user to modify ontology structure Data Frames Allow user to edit mapping Provide data frame editor Allow user to edit or add data frames Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
11
11 Extracting the Data Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
12
12 http://www.downloads.com/Program1001 software:Program Stick Death1.0Windows 3.x/95/98/Me/NT/2000/X 2.66MB rdf:type software:name software:version software:OperatingSystem software:ProgSize software:SizeVal software:SizeUnit software:Size rdf:type Convert to RDF Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
13
13 Superimposed Data Extraction Ontology DAML Ontology User Extraction Engine HTML Relational Data RDF Data
14
14 Contributions Advancement of Semantic Web Application of Information Extraction to building Semantic Web Semantic Web data as superimposed information Algorithm for ontology conversion
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.