David M. Mark NCGIA-Buffalo

Slides:



Advertisements
Similar presentations
Cultural and Linguistic Variation in (Geo)Spatial Conceptualization David M. Mark University at Buffalo.
Advertisements

Landscape Terms, Place Names, and the Question of Formal Ontology.
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Deriving Semantic Description Using Conceptual Schemas Embedded into a Geographic Context Centre for Computing Research, IPN Geoprocessing Laboratory Miguel.
Human Cognition in the Geographic Domain David M. Mark Department of Geography, UB.
©2005 Austin Troy. All rights reserved Lecture 3: Introduction to GIS Part 1. Understanding Spatial Data Structures by Austin Troy, University of Vermont.
Conceptual modelling. Overview - what is the aim of the article? ”We build conceptual models in our heads to solve problems in our everyday life”… ”By.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Studying Geography The Big Idea
WORLD GEOGRAPHY Sept. 5, Today - Basic geographical concepts - Introduction to human geography.
Why is the World Bumpy? By: Meghan Padial.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
Geography of the Americas ____________________: The study of people, their environment, and their resources ________________: An account of what has happened.
Knowledge representation
–combines elements of computer science –database design –software design geography –map projections –geographic reasoning mathematics –mathematical topology.
LANGUAGE & CULTURE: Are there Differences in the Conceptualization of Landscape and Space? David M. Mark, University at Buffalo.
Point to Ponder “I think there is a world market for maybe five computers.” »Thomas Watson, chairman of IBM, 1943.
Whenever you see me, click on me to learn more!
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Christopher Wellen M.Sc. Candidate McGill University On Cognition and Computation: An Introduction to Spatial Ontologies.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
The American Journey (History of Our Nation) A Meeting of Different Worlds UNIT 1: Geography, Early Americans, Exploration, and Colonization.
Tag Clouds Presented By: Laura F. Bright February 27th, 2006 INF385T: Semantic Web Spring 2006 / Turnbull.
Levels of Linguistic Analysis
A Geographer’s World Section 1. Think about where you live. What does the land look like? Are there tall mountains nearby, or is the land so flat that.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Chapter 1 – A Geographer’s World
Understanding Social Studies Concepts Chapter 1 Section 1.
Business System Development
Ch 1 A Geographer’s World
Land and Water Formations
Chapter 7 Verbal Intercultural Communication
COMP6215 Semantic Web Technologies
ece 627 intelligent web: ontology and beyond
Ontology From Wikipedia, the free encyclopedia
DATA MODELS.
Chapter 7: Entity-Relationship Model
PCB 3043L - General Ecology Data Analysis.
Helping Children Learn
Fundamentals/ICY: Databases 2010/11 WEEK 1
Ideas and words to use in map skills questions….
Landforms and Biomes.
Incorporating Ancillary Data for Classification
Whenever you see me, click on me to learn more!
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Chapter 1 – Uncovering the Past
You Need: A pencil Your History Notebook Colored Pencils Glue Stick
Introduction Artificial Intelligent.
From Knowledge Organization (KO) to Knowledge Representation (KR)
Module 8 – Database Design Using the E-R Model
“Studying Geography" GEOGRAPHY Understanding Social Studies Concepts
Do we directly perceive objects? (25 marks)
THE FIELD OF DISCOURSE ANALYSIS
Key Issues How do geographers describe where things are? Why is each point on Earth unique? Why are different places similar? Why are some human actions.
WJEC GCE Geography Guidance for Teachers: Assessment at AS/A level.
SDMX Information Model: An Introduction
Schema translation and data quality Sven Schade
Levels of Linguistic Analysis
Four-day Course on Teaching Science Education KLA in the English Medium Dr Stella Kong.
Requirements Document
Traditional Grammar VS. Generative Grammar
A Geographer’s World Chapter 1.
Ch. 1, L2 The Geographer’s Tools
Continental Landforms
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 6b: Database Design Using the E-R Model
Presentation transcript:

David M. Mark NCGIA-Buffalo Semantics of Landscape: Providing Information About the Landscape via The Semantic Web David M. Mark NCGIA-Buffalo

Semantics Semantics is all about meaning Semantic interoperability is about the preservation of meaning as information is exchanged among software, databases, and people

Words Words in natural language play a critical role in the transmission of meaning from one person to another… and in the sharing of concepts among individuals within speech communities

Ontologies Recent developments in ontologies for knowledge representation also have words and their meanings playing a central role Such ontologies mainly formalize the definitions of and relations among words (terms) for a specific domain

Conceptualizations If an ontology is an explicit specification of a conceptualization, where do the conceptualizations come from??

The Key Role of Words The key role of words and language in semantics is recognized in the ubiquitous semiotic triangle, which relates: signs (symbols, including words) concepts (mental ideas), and referents (in the real world)

“The Meaning Triangle” Sowa, John F., 2000. Ontology, Metadata, and Semiotics. In B. Ganter & G. W. Mineau, eds., Conceptual Structures: Logical, Linguistic, and Computational Issues, Lecture Notes in AI #1867, Springer-Verlag, Berlin, pp. 55-81.

Semantics of what? In contrast to the the Münster group, I am trying to capture the meaning of natural language expressions! Formalizations of the common-sense meanings of words are needed for building ontologies

Semantics of Landscape The natural landscape is a very important domain for human activities, and is particularly important in geospatial intelligence Representation and exchange of information about the landscape in word-centered ontologies such as those alluded to above, for delivery through the Semantic Web, presents a number of significant research challenges that are rare or absent for other domains

Some Major Issues Semantics usually deals with objects but geospatial information is often fields or images Classifications of landscape elements vary across cultures and languages Delimitation of landscape objects may be dependent on categorizations The above issues interact!

Semantics usually deals with objects but geospatial information is often fields or images While the semantic web is word-centered, the great majority of existing geospatial information about natural landscapes is in the form of fields (digital elevation models) or images (mainly from remote sensing) Serving of field-based or image-based geospatial information via the semantic web will require robust computational methods for identifying and delimiting landscape features present implicitly in those fields and images

Classifications of landscape elements vary across cultures and languages For inorganic natural domains such as landforms and water bodies, natural variation usually forms a continuum, and categories for landscape elements thus may have a somewhat arbitrary component, contingent on landscape variation, lifestyle of the people, and linguistic and cultural history

Landscape Objects are Dependent on Classifications For many domains, the entities dealt with are bona fide objects that exist as objects independently of their definitions In contrast, many landscape elements are not bona fide objects, but are parts of a continuous land surface Thus different definitions of landscape element types may lead to a different subdivision of the land into objects

The above issues interact! The same landscape, represented by the same field-like data, may produce quite different delimited objects and object classes according to the definitions of words in different languages Higher-level feature-extraction methods probably will need to be language-specific A word-based ontology of natural landscapes developed on the basis of one natural language (e.g. English) probably will be inadequate for other languages

Prognosis The extent of these issues is not yet known, but several well-documented examples suggest that they will be a substantial impediments to multilingual semantic interoperability for landscape data using the word-centered approaches now common for knowledge-representation ontologies and for the Semantic Web

Conceptualizations Research is needed to document the terms and associated concepts that everyday people use to structure and talk about the common-sense world Ethnographic methods are needed to compile these terms and their definitions, which only then can be formalized à la Gruber!

Conceptualizations of Landscape We do not yet have a complete basis for doing this even for English, although standards such as SDTS and DIGEST, and knowledge bases such as WordNet and CYC, are good starts

Linguistic Relativism Different languages lexicalize different concepts Therefore, an ontology based on the meanings of words in one language generally will not be adequate for representing concepts for other languages

Landscape ‘Concepts’ That Have No Single Word in English “A spot of level ground in the mountains, surrounded by ridges” “A type of hollow in a sandhill, used as camping place, especially in cold weather” “A canyon wall receiving sunlight” “An ‘island’ of land completely surrounded by one or more younger lava flows” “An island of grassland left unburnt after a surrounding wildfire”

Example 1: Categories for Water Bodies

How Many Kinds of Water Bodies?

Conceptual Model for Water Bodies Kinds of bodies of water may be distinguished along several dimensions: Size Flowing or still Salt or fresh water Other aspects of water quality Origin Seasonality of water Seasonality of flow ... Different languages or cultures may give different weights to these factors

French and English Categorize Standing Water Bodies Differently

English

French

Example 1, Summary All ponds are étangs, but not all étangs are ponds The main defining character of an étang is that it has no surface outlet; English does not use that characteristic in category definitions If we were trying to translate an instance of the word étang from French to English, we would need to know what kind of water body we were referring to before we could choose the correct English word Word for word translation is not possible without added information because the concepts do not line up!

Example 2: “Hills” in Yindjibarndi

marnda

marnda bargu burbaa

mountain marnda bargu hill burbaa

mountain marnda bargu hill burbaa Scale difference in the terminological break bargu hill burbaa

(range) (ridge) mountain marnda bargu hill burbaa Scale difference in the terminological break bargu hill burbaa

A Many-to-Many relation (range) (ridge) mountain marnda Scale difference in the terminological break bargu hill burbaa A Many-to-Many relation

Summary: Marnda, Hill, etc. There is a many-to-many relation between {mountain, hill, ridge, range} and {marnda, bargu, burbaa} It would not be possible to correctly translate “marnda” into English or “hill” into Yindjibarndi without knowing the size of the landform being referred to This appears to be a different way to subdivide instances into categories, but based on similar principles

Example3: Watercourses in Yindjibarndi There are no permanent or even seasonal rivers or creeks in Yindjibarndi country. Larger watercourses have running water in them only after cyclones Permanent sources of water include permanent pools along the channels of the Fortescue River, as well as some permanent small springs, and soaks, where water can be obtained by digging

Watercourses and their beds in Yindjibarndi and English According to the dictionaries, Yindjibarndi has the following words: Wundu: “river(bed), gorge” (Anderson) Yijirdi: “small stream, rivulet” (Anderson) Manggurdu: “Fortescue River” (Anderson) Mangkurdu: “flood” (Von Brandenstein)

Creeks and Creek-beds in Yindjibarndi and English We currently believe that the linguists got confused about what was being named There are no permanently-flowing rivers or creeks in Yindjibarndi country Meanings of Yindjibarndi terms were (apparently) shoe-horned into an English-language, permanent stream ontology

“creek” or “creek bed” “creek” “creek bed” English ‘ontology’ and terminology “creek” or “creek bed” “creek” “creek bed”

Yindjibarndi ‘ontology’ and terminology “wundu” “yijirdi[?]” “wundu?”

Which is the feature, the water, or the creek bed?

Rivers and Riverbeds Yindjibarndi Dictionary “Correct” Wundu Yijirdi small stream, rivulet Trickle of water Manggurdu Fortescue River Strong, deep flow

Watercourse Summary The English ‘ontology’ has sizes of watercourses: rivers and creeks (Watercourses have beds) Yindjibarndi ‘ontology’ has two sizes of (dry) fluvial channels: wundu and garga Yindjibarndi ‘ontology’ also has two ‘sizes’ of flow events, strong deep flow (manggurdu) and trickles (yijirdi)

Terms for water features In Yindjibarndi permanent and temporary water features that otherwise appear similar are considered to be different kinds of features English, in contrast, treats permanence of water bodies and water courses as an attribute, and expresses it through adjectives like "temporary", "seasonal", "intermittent", or "ephemeral" Key distinction in English is still vs flowing water It seems (from recent fieldwork) that permanent water in Yindjibarndi is a yinda, whether still or flowing. Thus there is a significant difference between conceptualizations of water features.

Some General Conclusions from Yindjibarndi Study None of the Yindjibarndi terms for landscape features that we have examined in depth so far is exactly equivalent to one single term in English Yindjibarndi terms divide up sub-domains of geographic reality (somewhat) differently than do English terms The results support the basis Ethnophysiography hypothesis: people from different places and cultures apparently use different conceptual categories for geographic features

Study #4: Rating Landform Pictures Against Terms As part of a larger study of geospatial ontology, we asked subjects to rate how well a given landform term was exemplified by a particular picture Each subject was given one term and six photographs

Human Subjects Testing Each subject was given a page with six pictures of mountains, hills, etc., and one term that they were to compare to each photograph Four different pages of photos with 6 photos each were used, and three or four different terms were tested

The Subjects English: 334 subjects in Buffalo, with at least 25 judgments for each of the 16 photographs tested Spanish: 305 subjects in Guatemala City, with at least 13 judgments for each of the 16 photographs tested German: 65 subjects in Münster, with at least 20 judgments for each of the 6 photographs tested

The Photographs

The Terms English: Mountain, volcano, hill Spanish: Montaña, volcán, colina, cerro German: Berg, Vulkan, Hügel

The Ratings “For each photograph, please rate how good an example of a mountain it shows, on a scale from excellent (7) to poor (1)” Average scores were transformed so that unanimous “excellent” became 1.00 and unanimous “poor” became 0.00

Mountain (English) 1.00 0.50 0.00

Excellent Examples of Mountains 1.00 0.50 0.00

Good Examples of Mountains 1.00 0.50 0.00

Borderline Example of a Mountain 1.00 0.50 0.00

Poor Examples of Mountains 1.00 0.50 0.00

Very Poor Examples of Mountains 1.00 0.50 0.00

Mountain (English) 1.00 0.50 0.00

Volcano (English) 0.92 0.58 0.39 0.36 0.10 0.03 1.00 0.50 0.00

Vulkan (German) 0.91 0.33 0.28 0.19 0.00 0.11 1.00 0.50 0.00

Volcán (Spanish) 0.94 0.52 0.44 0.31 0.17 0.12 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

Volcano 0.94 0.52 0.44 0.31 0.17 0.12 Spanish 0.19 0.91 0.33 0.28 0.00 German 0.11 0.92 0.58 0.39 0.36 0.10 0.03 English 1.00 0.50 0.00

En Sp De En Sp De En Sp Sp De Mountain Volcano Hill

Volcán Agua, Guatemala Mountain Volcano Hill En Sp De En Sp De En Sp

Fuji-san En Sp De En Sp De En Sp Sp De Mountain Volcano Hill

A “Hill” in England Mountain Volcano Hill En Sp De En Sp De En Sp Sp

Conclusions for Study #4 For some images, the terms appear to translate well (i.e., similar ratings in all three languages)—robust cross-linguistic prototypes? For other images, ratings vary substantially across languages More research needed!

Summary and Conclusions Two kinds of research are needed before we can implement the Spatial Semantic Web for Landscape: One is to compile formal definitions of landscape feature types in languages of interest The other is to develop computational methods for delimiting and classifying landscape features from DEMs and images

What is needed? Geospatial information Real world A ‘Middle-ware’ of fundamental landform concepts Concept 4 Concept 2 Concept 5 Concept 1 Concept 3 “mountain” “montaña” “mont” “berg” “ ”

The landform concept modules (with weights and parameters) could be combined to provide formal definitions of the concepts associated with each landscape term A ‘Middle-ware’ of fundamental landform concepts Concept 4 Concept 2 Concept 5 Concept 1 Concept 3 “mountain” “montaña” “mont” “berg” “ ”

Geospatial information Real world A ‘Middle-ware’ of fundamental landform concepts The same landform concept modules can also be ‘executed’ against the DEM to detect and delimit regions that have the attributes embodied in the modules, and thus detect and delimit landform instances

Thanks. (For more information, see) http://www. geog. buffalo This research was supported in part by NSF Grants BCS-9975557 (David Mark and Barry Smith) and BCS-0423075 and BCS-0423023 (David Mark and David Stea)