Colour Words, Bayes and Language Complexity Mike Dowman 27 May, 2005.

Slides:



Advertisements
Similar presentations
A small taste of inferential statistics
Advertisements

1 Events and Their Probabilities Folks often want to know the probability of an event so they can plan their day (or whatever)
Color Theory Review. Color Theory Quiz 1. What is a primary color? a.Any color of the rainbow. b.A color made from mixing two others. c.A color that.
The cultural origins of colour categories Tony Belpaeme Artificial Intelligence Lab Vrije Universiteit Brussel.
Explaining Colour Term Typology with an Evolutionary Model Mike Dowman 24 November, 2005.
Etic From linguistics—sounds people really make toppot.
A (very) brief introduction to multivoxel analysis “stuff” Jo Etzel, Social Brain Lab
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
What is Statistical Modeling
Ai in game programming it university of copenhagen Statistical Learning Methods Marco Loog.
© red ©
Minority Games A Complex Systems Project. Going to a concert… But which night to pick? Friday or Saturday? You want to go on the night with the least.
Cognitive modelling (Cognitive Science MSc.) Fintan Costello
LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.
Modelling language origins and evolution IJCAI-05 Demo THSim Tutorial modelling language evolution Paul Vogt.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Sept 19, 2005.
LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.
Basics of discriminant analysis
7-2 Estimating a Population Proportion
Color appearance of natural objects Thorsten Hansen, Sebastian Walter and Karl R. Gegenfurtner Department of Psychology University of Giessen Germany.
Colour Language 1: Typology, Psycholinguistics, Neurophysiology Mike Dowman Language and Cognition 3 October, 2005.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Innateness of colour categories is a red herring: insights from computational modelling Tony Belpaeme Artificial Intelligence Lab Vrije Universiteit Brussel.
COLOURS.
Hypothesis Testing:.
Colour Theory What’s the point? Colour Wheel By Sir Isaac Newton.
Broca’s Aphasia Paul Broca Language and the Brain First connections drawn:
Key Stage 1 Phonic Talk Wednesday 4 th November. Introduction Who are we? What we do in school What you can do at home.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
An Expression-Induction Model  Ten artificial people were created.  They could learn colour word denotations by observing other artificial people talking.
Learning Color Names from Real-World Images Joost van de Weijer, Cordelia Schmid, Jakob Verbeek Lear Team, INRIA Grenoble, France
Colour Language 2: Explaining Typology Mike Dowman Language and Cognition 5 October, 2005.
Colour CPSC 533C February 3, 2003 Rod McFarland. Ware, Chapter 4 The science of colour vision Colour measurement systems and standards Opponent process.
Where did plants and animals come from? How did I come to be?
Chapter 10 Chi-Square Tests and the F-Distribution
Where does color come from? A ray of light is the source of all color. Without light, color does not exist. Light is broken down into colors of the spectrum.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.
Language and Linguistics
Topic 2: Intro to probability CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text, Probability and Statistics for Engineering.
Non-parametric tests (chi-square test) Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Colour Theory Revision and Complementary Colours.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Probability and Chance Random Experiment An experiment is random if – The outcome depends on chance (we are not sure of the outcome (result)) – We can.
Thinking like a Scientist
Color Hue - “the name” of a wavelength of light…red, blue, etc. When an object reflects a certain wavelength we see the object as having that color.
Welcome to MM570 Psychological Statistics
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
COLORS. QUESTION Does color preference affect your choice?
DO LOCAL MODIFICATION RULES ALLOW EFFICIENT LEARNING ABOUT DISTRIBUTED REPRESENTATIONS ? A. R. Gardner-Medwin THE PRINCIPLE OF LOCAL COMPUTABILITY Neural.
Unit 2 The Nature of Learner Language 1. Errors and errors analysis 2. Developmental patterns 3. Variability in learner language.
Color naming A Computational model of Color Perception and Color Naming, Johann Lammens, Buffalo CS Ph.D. dissertation
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Sampling Analysis. Statisticians collect information about specific groups through surveys. The entire group of objects or people that you want information.
Favorite Color Mrs. Baird May 21, Favorite Color Survey Results BlueGreenRedYellowOther
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
Basic Probability Rules
Watch Pete the Cat here:
Butterfly Maths Each caterpillar must be coloured the correct pattern for it to turn into a butterfly. Work out each problem to know how to colour each.
Chapter 10 Chi-Square Tests and the F-Distribution
Like.
Questions for lesson 3 Perception 11/27/2018 lesson 3.
Basic Probability Rules
What Color is it?.
GCSE Psychology Unit 2: Language, Thought & Communication
Probability and Chance
Presentation transcript:

Colour Words, Bayes and Language Complexity Mike Dowman 27 May, 2005

About me

Colour Term Typology There are clear typological patterns in how languages name colour.  neurophysiology of vision system  or cultural explanation? Constraints on learnable languages or an evolutionary process?

Basic Colour Terms Most studies look at a subset of all colour terms: Terms must be psychologically salient Known by all speakers Meanings are not predictable from the meanings of their parts Don’t name a subset of colours named by another term

Number of Basic Terms English has red, orange, yellow, green, blue, purple, pink, brown, grey, black and white. crimson, blonde, taupe are not basic. All languages have 2 to 11 basic terms Except Russian and Hungarian Some people dispute concept of basic colour term

Prototypes Colour terms have good and marginal examples  prototype categories People disagree about the boundaries of colour word denotations But agree on the best examples – the prototypes Berlin and Kay (1969) found that this was true both within and across languages.

Berlin and Kay (1969) Small set of possible colour term systems 98 Languages in study Only Cantonese, Vietnamese, Western Apache, Hopi, Samal and Papago didn’t fit the hierachy Berlin and Kay’s Implicational Hierarchy. purple pink orange grey white black red green yellow blue brown

World Colour Survey 110 minor languages (Kay, Berlin, Merrifield, 1991; Kay et al 1997; Kay and Maffi, 1999) All surveyed using Munsell arrays Black, white, red, yellow, green and blue seem to be fundamental colours They are more predictable than derived terms (orange, purple, pink, brown and grey)

Evolutionary Trajectories white + red + yellow + black-green-blue white + red + yellow + green + black-blue white-red-yellow + black-green-blue white + red-yellow + black-green-blue white + red + yellow + black + green-blue white + red-yellow + black + green-blue white + red + yellow + black + green + blue white + red + yellow-green-blue + black white + red + yellow-green + blue + black

Derived Terms Brown and purple terms often occur together with green-blue composites Orange and pink terms don’t usually occur unless green and blue are separate But sometimes orange occurs without purple Grey is unpredictable No attested turquoise or lime basic terms

Exceptions and Problems 83% of languages on main line of trajectory 25 languages were in transition between stages 6 languages didn’t fit trajectories at all  Kuku-Yalanji (Australia) has no consistent term for green  Waorani (Ecuador) has a yellow-white term that does not include red  Gunu (Cameroon) contains a black-green-blue composite and a separate blue term

Criticism of Kay Much more variability than Kay suggests – both within and across languages Criteria for distinguishing basic colour terms don’t work Colour is often conflated with other properties: texture, variegation, etc. Colour words can only be understood in relation to the rest of the language Colour words have religious and cultural significance  Saunders (1992), MacLaury (1997), Levinson (2001), MacKeigan (2005)

Neurophysiology and Unique Hues Red and green, yellow and blue are opposite colours De Valois and Jacobs (1968):  There are cells in the retina that respond maximally to either one of the unique hues, black or white.

Psycholinguistics: Rosch/Heider Free choice of colour chips:  People prefer to pick unique hues  True across cultures and languages Better memory for colour chips:  Can pick out chip better in array after 30s  Dani can learn names for unique hues more easily than for other colours But these last two results are disputed

Kay and McDaniel (1978) Universal colour categories Red, yellow, green, blue, black and white:  Derived from cell’s neural responses Purple, pink, brown and grey  Derived via fuzzy intersections Composite categories  Derived via fuzzy unions

Tony Belpaeme (2002) Ten artificial people Colour categories represented with adaptive networks CIE-LAB colour space used (red-green, yellow-blue, light-dark) Sometimes multi-generational, sometimes single generation

Guessing game Speaker tries to find a word that names a topic colour but not a context one If this fails he modifies his colour categories Otherwise word and topic + context shown to the hearer If hearer can distinguish topic from context, word-category association strengthened. Otherwise hearer is shown the correct topic, and adapts his colour category

Emergent Languages Coherent colour categories emerged that were shared by all the agents Colour space divided into a number of regions – each named by a different colour word But some variation between speakers And no explanation of Typology

Belpaeme and Bleys (2005) Colour terms represented using points in the colour space Colours chosen from natural scenes, or at random  Few highly saturated colours Emergent colour categories tend to be clustered at certain points in the colour space Similarity with WCS was greatest when both natural colours were used and communication was simulated

The Speaker makes up a new word to label the colour. Start The hearer hears the word, and remembers the corresponding colour. This example will be used to determine the word to choose, when it is the hearer’s turn to be the speaker. Yes (P=0.001) A speaker is chosen. A hearer is chosen. A colour is chosen. Decide whether speaker will be creative. No (P=0.999) The speaker says the word which they think is most likely to be a correct label for the colour based on all the examples that they have observed so far. My Model

Colour Space red - 7 orange purple blue - 30 green - 26 yellow - 19

Possible Hypotheses high probability hypothesis medium probability hypothesis low probability hypothesis

Equations Bayes’ Rule Probability of an accurate example at colour c within h if hypothesis h is correct Probability of an erroneous example at colour c R c is probability of remember an example at colour c R h is sum of R c for all c in hypothesis h R t is sum of R c for whole of the colour space

Probability of the data Problem – we don’t know which examples are accurate p is the probability for each example that it is accurate e is an example E is the set of all examples Probability for examples outside of hypothesis (must be inaccurate) Probability for examples inside of hypothesis (may be accurate or inaccurate)

Hypothesis Averaging We really want to know the probability that each colour can be denoted by the colour term  So, sum probabilities for all hypotheses that include the colour in their denotation  Doing this for all colours produces fuzzy sets Substituting into Bayes’ rule:

Urdu

Evolutionary Simulations Average lifespan (number of colour examples remembered) set at: 18, 20, 22, 24, 25, 27, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110 or simulation runs in each condition Languages spoken at end analysed Only people over half average lifespan included Only terms for which at least 4 examples had been remembered were considered

Analyzing the Results Speakers didn’t have identical languages  Criteria needed to classify language spoken in each simulation For each person, terms classified as red, yellow, green, blue, purple, orange, lime, turquoise or a composite (e.g. blue-green) Terms must be known by most adults Classification favoured by the most people chosen

Typological Results Percentage of Color Terms of each type in the Simulations and the World Color Survey

Derived Terms 80 purple terms 20 orange terms 0 turquoise terms 4 lime terms

Divergence from Trajectories 1 Blue-Red term 1 Red-Yellow-Green term 3 Green-Blue-Red terms Most emergent systems fitted trajectories: 340 languages fitted trajectories 9 contained unattested color terms 35 had no consistent name for a unique hue 37 had an extra term

Does Increased Salience of Unique Hues Matter?

Unique Hues Create More Regular Colour Term Systems 644 purple terms 374 orange terms 118 lime terms 16 turquoise terms Only 87 of 415 emergent systems fits trajectories

Adding Random Noise

Noise has Little Effect Derived terms: 60.6% purple 26.8% orange 9.9% lime 0.3% turquoise

How Reliable is WCS Data? Would a model that more closely replicated the WCS data be a better model? Field linguists tend to suggest that colours are much more messy than Kay et al suggest WCS is only a sample – not a gold standard Is data massaged to fit theories?

Number of Colour Terms Emerging

Coherence and Complexity What makes languages coherent? Shared communicative goals? A shared conceptual space? Feedback and negotiation? Consideration of others’ knowledge? What makes languages complex? More words aid communication? Many meanings leads to many words?

Taking Colours out of the Model Speaker says a new word to the hearer twice Start Yes (P=0.0001) A speaker is chosen. A hearer is chosen. Decide whether speaker will be creative No (P=0.9999) Speaker says a word he has heard at least twice to the hearer

Emergent Languages Average shared vocabulary always over 97%

Conclusions Level of creativity no noticeable effect (except when very high) Coherent languages emerge even with large number of people (1000)  Coherent languages emerge without feedback  And without consideration of other peoples’ language knowledge Languages are complex because we talk a lot  Not because complex languages help us to communicate

Coming Soon…  Syntax  Learnability  Minimum Description Length (MDL)  Minimum Message Length (MML)  Friday 3 rd June (same time, same place)