INFM 700: Session 4 Metadata Jimmy Lin The iSchool University of Maryland Monday, February 18, 2008 This work is licensed under a Creative Commons Attribution-Noncommercial-Share.

Slides:



Advertisements
Similar presentations
CC 2007, 2011 attribution - R.B. Allen Knowledge Representation and Documents.
Advertisements

Presentation on Artificial Intelligence
INFM 700: Session 6 Taxonomies and Metadata Paul Jacobs The iSchool University of Maryland Wednesday, Feb. 29, 2012 This work is licensed under a Creative.
Classification in the curriculum- “Dew(e)y” eyed or Semantically speaking? Christine Urquhart.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Introduction to Databases
Basics of Knowledge Management ICOM5047 – Design Project in Computer Engineering ECE Department J. Fernando Vega Riveros, Ph.D.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
“Rumours of my demise have been greatly exaggerated”
Text Operations: Preprocessing. Introduction Document preprocessing –to improve the precision of documents retrieved –lexical analysis, stopwords elimination,
Ontology Notes are from:
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
1 Chapter 2 Database Environment Transparencies © Pearson Education Limited 1995, 2005.
Chapter 2 Database Environment.
File Systems and Databases
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Chapter 2 Database Environment Pearson Education © 2014.
Lecture Two Database Environment Based on Chapter Two of this book:
Alternatives to Metadata IMT 589 February 25, 2006.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
IBE312: Ch15 Building an IA Team & Ch16 Tools & Software 2013.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
IBE312: Information Architecture 2013 Ch. 9 – Metadata Many of the slides in this slideset are reproduced and/or modified content from publically available.
Query Relevance Feedback and Ontologies How to Make Queries Better.
INFM 700 Course Review Paul Jacobs The iSchool University of Maryland Tuesday, May 5, 2009 This work is licensed under a Creative Commons Attribution-Noncommercial-Share.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
CSC271 Database Systems Lecture # 4.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Making the Most of CVs Ontology Scope Note: Branch of philosophy that deals with being Branch of philosophy that deals with being Broader Term: Metaphysic.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
Bayu Adhi Tama, M.T.I 1 © Pearson Education Limited 1995, 2005.
Database Environment Session 2 Course Name: Database System Year : 2013.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
Chapter 7 K NOWLEDGE R EPRESENTATION, O NTOLOGICAL E NGINEERING, AND T OPIC M APS L EO O BRST AND H OWARD L IU.
School of Information, Fall 2007 University of Texas A. Fleming Seay Information Architecture Class Four.
Straits Knowledge 2007From Taxonomies to Tagging From Taxonomies to Tagging: developing taxonomy strategies that are appropriate.
Chapter 2 Database Environment.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
COMP6215 Semantic Web Technologies
Knowledge Representation Part I Ontology
ece 627 intelligent web: ontology and beyond
Federated & Meta Search
Taxonomies & Classification for Organizing Content
Chapter 2 Database Environment Pearson Education © 2009.
Database.
Introduction to Semantic Metadata & Semantic Web
Database Environment Transparencies
INFM 700: Session 6 Taxonomies and Metadata
Taxonomies and Classification for Organizing Content
Presentation transcript:

INFM 700: Session 4 Metadata Jimmy Lin The iSchool University of Maryland Monday, February 18, 2008 This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States See for detailshttp://creativecommons.org/licenses/by-nc-sa/3.0/us/

iSchool Today’s Topics What is metadata? Taxonomies Thesauri Ontologies Putting everything together Metadata Taxonomies Thesauri Ontologies Integration

iSchool Metadata Literally “data about data” “a set of data that describes and gives information about other data” ― Oxford English Dictionary In practical terms: Metadata helps users interpret content Metadata helps in organization, navigation, etc. Metadata Taxonomies Thesauri Ontologies Integration

iSchool Data without Metadata… Who: authored it? to contact about data? What: are contents of database? When: was it collected? processed? finalized? Where: was the study done? Why: was the data collected? How: were data collected? processed? Verified? … can be pretty useless! Metadata Taxonomies Thesauri Ontologies Integration

iSchool Early Example of Metadata Metadata Taxonomies Thesauri Ontologies Integration

iSchool Types of Organizations Taxonomies Anything organized in some sort of structure Thesauri Addition of relations between terms Emergence of “concepts” Ontologies Model of a domain Machine-readable Increasing complexity and richness Metadata Taxonomies Thesauri Ontologies Integration

iSchool Menagerie of Terms Classification Hierarchies Directories Controlled vocabularies Knowledge representations Let’s focus on significant differences. Let’s focus on advantages/disadvantages. Let’s focus on how each is useful. Let’s not quibble over what to exactly call each. Metadata Taxonomies Thesauri Ontologies Integration

iSchool Taxonomies Organization of objects according to some principle Familiar examples: Linnaean taxonomy (for living organisms) Web directories (e.g., Yahoo or ODP) Corporate directories Organization charts Organizational structures previously discussed Metadata Taxonomies Thesauri Ontologies Integration

iSchool Thesauri: Motivation “Semantic gap” between concepts and words Words are used to evoke concepts Concrete objects: MacBook Pro, iPhone Abstract ideas: freedom, peace Concepts Words Ideas Meaning Metadata Taxonomies Thesauri Ontologies Integration

iSchool To name that thing… The semantic gap: What’s the problem? Synonymy Polysemy Thesauri represent attempts to better organize mappings between words and concepts Do these present precision or recall problems? Metadata Taxonomies Thesauri Ontologies Integration

iSchool A slight detour… What’s a concept? Multiple perspectives Literature Philosophy Computer science (artificial intelligence) Cognitive science Harder to define than you think! What’s a chair? What’s a bird? Who’s a mother? Metadata Taxonomies Thesauri Ontologies Integration

iSchool Two Attempts First try: necessary and sufficient conditions Second try: prototypes Metadata Taxonomies Thesauri Ontologies Integration

iSchool Radial Categories A category with a central prototype… But has many cases deviating in different dimensions Example: “Mother” Central case: Other cases: A mother who is and always has been female, and who gave birth to the child, supplied her half of the child's genes, nurtured the child, is married to the father, is one generation older than the child, and is the child's legal guardian. George Lakoff. (1987) Women, Fire and Dangerous Things: What Categories Reveal about the Mind. Chicago: University of Chicago Press. Metadata Taxonomies Thesauri Ontologies Integration biological mother, birth mother, surrogate mother, genetic mother, stepmother, adoptive mother, foster mother, unwed mother, etc…

iSchool Basic Level Categories Two opposing principles in categorization Desire for rich structure, ability to discriminate differences Reduction of cognitive load Basic level: the balance point People learn basic level categories first Eleanor Rosch. (1977) Classification of Real-World Objects: Origins and Representation of Cognition. Johnson-Laird and Wason, eds., Thinking. Superordinate Basic LevelSubordinate Furniture Chair Table Dining chair, lawn chair, armchair, etc. Dining table, folding table, kitchen table, etc. Metadata Taxonomies Thesauri Ontologies Integration

iSchool Relation to IA Any organization system must be sensitive to users’ understanding of different concepts Examples: What’s the difference between laptop, PDA, phone, and convergence device? What documents should the system retrieval when “mother” is the query? When a user browses a furniture catalog for chairs, do you show them ottomans and footstools? Metadata Taxonomies Thesauri Ontologies Integration

iSchool Standard Thesaurus Structure Computer Notebook Laptop Desktop Replacement UltraportableTablet PC IS-A AKA Synonyms (variants) Narrower Terms Broader Terms Preferred Metadata Taxonomies Thesauri Ontologies Integration

iSchool Other Thesaurus Concepts Concepts vs. Instances ~ metadata vs. content Various relations (formal names) Synonyms Hyponyms/Hypernyms Meronym/Holonym … Metadata Taxonomies Thesauri Ontologies Integration

iSchool Uses of Thesauri For organization For navigation For indexing content For searching Metadata Taxonomies Thesauri Ontologies Integration

iSchool Poly-Hierarchies Concepts can have multiple parents Example: Cracow (Poland : Voivodship) Auschwitz II-Birkenau (Poland : Death Camp) Block 25 (Auschwitz II-Birkenau) German death camps Kanada (Auschwitz II-Birkenau) From Shoah Foundation’s thesaurus of holocaust terms Metadata Taxonomies Thesauri Ontologies Integration

iSchool Poly-Hierarchies What are the advantages and disadvantages? What’s the relationship to polysemy? Metadata Taxonomies Thesauri Ontologies Integration

iSchool Faceted Hierarchies Alternative to single and poly-hierarchies Basic idea: Describe objects along multiple facets Each facet has its associated hierarchy Issues: What’s a facet? How do you navigate faceted hierarchies? Metadata Taxonomies Thesauri Ontologies Integration

iSchool Faceted Browsing Example Metadata Taxonomies Thesauri Ontologies Integration

iSchool Faceted Browsing Example Metadata Taxonomies Thesauri Ontologies Integration

iSchool Faceted Browsing Example Demo: Metadata Taxonomies Thesauri Ontologies Integration

iSchool Advantages of Facets Integrates searching and browsing Easy to build complex queries Easy to narrow, broaden, shift focus Helps users avoid getting lost Helps to prevent “categorization wars” Metadata Taxonomies Thesauri Ontologies Integration

iSchool Ontologies First, a philosophical discipline: A branch of philosophy that deals with the nature and the organization of reality What characterizes being? What is being? More recently, computer science perspective Arose out of desire to build smarter machines Related concepts: knowledge representation, knowledge engineering Metadata Taxonomies Thesauri Ontologies Integration

iSchool What is an ontology? An computational artifact: Symbols describing relevant concepts in a domain Explicit assumptions regarding the meaning and usage of the symbols A formal specification of a particular domain: Represents shared understanding of that domain Must be capable of manipulation by a computer Metadata Taxonomies Thesauri Ontologies Integration

iSchool What’s in an ontology? Symbols representing concepts arranged according to relevant relations Rules or constraints governing relations between concepts Metadata Taxonomies Thesauri Ontologies Integration

iSchool Relationship to IA? Database Web Server Application Server Network Ontologies are implicitly “hidden” here!!! Flight Trip From: Part-of Airplane Equipment To: Departure Time: Arrival Time: Origin: Destination: Type: Capacity: Rule: Arrival Time is always after Departure Time Rule: Distance from Origin to Destination typical > 100 miles Metadata Taxonomies Thesauri Ontologies Integration

iSchool Grand Vision Ontology 2 General Purpose Reasoning Engine Ontology 3 Ontology 1 … Really, really, really smart machines! Metadata Taxonomies Thesauri Ontologies Integration

iSchool Putting it all together… Database Web Server Application Server Network Database Web Server Network Two-Layer Architecture Three-Layer Architecture Apache mySQL PHP Metadata Taxonomies Thesauri Ontologies Integration

iSchool Popular Implementation Content Metadata Presentation SQL Database PHP/HTML Metadata Taxonomies Thesauri Ontologies Integration

iSchool Encoding Hierarchies A BC DEF GH ChildParent BA CA DC EC FC GD HD Table: Hierarchy Finding children of A: Select child from Hierarchy where parent = ‘A’  B, C Finding parent of G: Select parent from Hierarchy where child = ‘G’  D Finding siblings of D: find parent, and then find its children Store in RDBMS Metadata Taxonomies Thesauri Ontologies Integration

iSchool Encoding Metadata A BC DEF GH IDAttributes …Label 0001B 0002B 0003C 0004D 0005D 0006E …… Table: Items Metadata Taxonomies Thesauri Ontologies Integration

iSchool Content  Presentation A BC DEF GH You are here: A > C > D Contents at D Related - D - E Hierarchy(child, parent)Content(id, attribute 1, attribute 2, attribute 3, …) Metadata Taxonomies Thesauri Ontologies Integration

iSchool Faceted Browsing Matching Results Filter by - Facet 1 (possible values) - Facet 2 (possible values) Hierarchy(child, parent)Content(id, attribute 1, attribute 2, attribute 3, …) Metadata Taxonomies Thesauri Ontologies Integration

iSchool Today’s Topics What is metadata? Taxonomies Thesauri Ontologies Putting everything together Metadata Taxonomies Thesauri Ontologies Integration