Presentation is loading. Please wait.

Presentation is loading. Please wait.

Bootstrapping April 3 2007 William Cohen. Prehistory Karl Friedrich Hieronymus, Freiherr von Münchhausen (11 May 1720 – 22 February 1797) was a German.

Similar presentations


Presentation on theme: "Bootstrapping April 3 2007 William Cohen. Prehistory Karl Friedrich Hieronymus, Freiherr von Münchhausen (11 May 1720 – 22 February 1797) was a German."— Presentation transcript:

1 Bootstrapping April 3 2007 William Cohen

2 Prehistory Karl Friedrich Hieronymus, Freiherr von Münchhausen (11 May 1720 – 22 February 1797) was a German baron who in his youth was sent to serve as page to Anthony Ulrich II, Duke of Brunswick-Lüneburg and later joined the Russian military. He served until 1750, in particular taking part in two campaigns against the Turks. Returning home, Münchhausen supposedly told a number of outrageous tall tales about his adventures. The Baron was born in Bodenwerder and died there as well.11 May172022 February 1797GermanbaronpageAnthony Ulrich II, Duke of Brunswick-LüneburgRussianTurkstall talesBodenwerder According to the stories, as retold by others, the Baron's astounding feats included riding cannonballs, travelling to the Moon, and escaping from a swamp by pulling himself up by his own hair. … In later versions he was using his own boot straps to pull himself out of the sea. [Wikipedia] cannonballsMoon

3 Prehistory “Bob Wilson is desperately trying to finish his doctoral thesis and has locked himself in his room in a marathon attempt to do so. His typewriter jams, and as he unjams it he hears someone say "Don't bother, it's hogwash anyway." The thesis, in fact, deals with time travel. The interloper is a man who seems strangely familiar, and might be recognizable without the two-day growth of beard and the black eye. …” “In computing, bootstrapping refers to a process where a simple system activates another more complicated system that serves the same purpose. It is a solution to the Chicken-and-egg problem of starting a certain system without the system already functioning. The term is most often applied to the process of starting up a computer, in which a mechanism is needed to execute the software program that is responsible for executing software programs …” [Wikipedia]Chicken-and-egg problemexecutesoftware

4 Some more recent history - 1 Idea: write some specific patterns that indicate A is a kind of B: 1.… such NP as NP (“at such schools as CMU, students rarely need extensions”) 2.NP, NP, or other NP (“William, Carlos or other machine learning professors”) 3.NP including NP (“struggling teams including the Pirates”) 4.NP, especially NP (prestigious conferences, especially NIPS) [Coling 1992] Results: 8.6M words of Grolier’s encyclopedia  7067 pattern instances  152 relations Many were not in WordNet.

5 Some history – 2a Idea: exploit “pattern/relation duality”: 1.Start with some seed instances of (author,title) pairs (“Isaac Asimov”, “The Robots of Dawn”) 2.Look for occurrences of these pairs on the web. 3.Generate patterns that match the seeds. - URLprefix, prefix, middle, suffix 4.Extract new (author, title) pairs that match the patterns. 5.Go to 2. [some workshop, 1998] Unlike Hearst, Brin learned the patterns; and learned very high-precision, easy- to-match patterns. Result: 24M web pages + 5 books  199 occurrences  3 patterns  4047 occurrences + 5M pages  3947 occurrences  105 patterns  … 15,257 books * with some manual tweaks

6 Some history – 2b Idea: exploit “pattern/relation duality”: 1.Start with some seed instances of (author,title) pairs (“Isaac Asimov”, “The Robots of Dawn”) 2.Look for occurrences of these pairs on the web. 3.Generate patterns that match the seeds. - URLprefix, prefix, middle, suffix 4.Extract new (author, title) pairs that match the patterns. 5.Go to 2. Result: 24M web pages + 5 books  199 occurrences  3 patterns  4047 occurrences + 5M pages  3947 occurrences  105 patterns  … 15,257 books * with some manual tweaks Instances Patterns Occurrences

7 Some history – 3 [COLT 98]

8 Some history – 3b InstancesPatterns Occurrences Instances/Occurrences Patterns How to filter out “bad” instances, occurrences, patterns?

9 Bootstrapping BM’98 Brin’98 Hearst ‘92 Scalability, surface patterns, use of web crawlers… Learning, semi-supervised learning, dual feature spaces… Deeper linguistic features, free text…

10 Bootstrapping BM’98 Brin’98 Hearst ‘92 Collins & Singer ‘99 Scalability, surface patterns, use of web crawlers… Learning, semi-supervised learning, dual feature spaces… Deeper linguistic features, free text… Boosting-based co-train method using content & context features; context based on Collins’ parser; learn to classify three types of NE

11 Bootstrapping BM’98 Brin’98 Hearst ‘92 Scalability, surface patterns, use of web crawlers… Learning, semi-supervised learning, dual feature spaces… Deeper linguistic features, free text… Collins & Singer ‘99 Riloff & Jones ‘99 Hearst-like patterns, Brin-like bootstrapping (+ “meta-level” bootstrapping) on MUC data

12 Bootstrapping BM’98 Brin’98 Hearst ‘92 Scalability, surface patterns, use of web crawlers… Learning, semi-supervised learning, dual feature spaces… Deeper linguistic features, free text… Collins & Singer ‘99 Riloff & Jones ‘99 Cucerzan & Yarowsky ‘99 EM like co-train method with context & content both defined by character-level tries

13 Bootstrapping BM’98 Brin’98 Hearst ‘92 Scalability, surface patterns, use of web crawlers… Learning, semi-supervised learning, dual feature spaces… Deeper linguistic features, free text… Collins & Singer ‘99 Riloff & Jones ‘99 Cucerzan & Yarowsky ‘99 Etzioni et al 2005 Rosenfeld and Feldman 2006 … … Stevenson & Greenwood 2005 De-emphasize duality, focus on distance between patterns.

14 Stevenson & Greenwood Instances/Occurrences Patterns Pattern- pattern- from is semantic similarity (Wordnet) Flow from pattern- pattern depends on empirical similarity (i.e. overlapping occurrences in corpus)

15 Bootstrapping BM’98 Brin’98 Hearst ‘92 Scalability, surface patterns, use of web crawlers… Learning, semi-supervised learning, dual feature spaces… Deeper linguistic features, free text… Collins & Singer ‘99 Riloff & Jones ‘99 Cucerzan & Yarowsky ‘99 Etzioni et al 2005 Rosenfeld and Feldman 2006 … … Stevenson & Greenwood 2005 Clever idea for learning relation patterns & strong experimental results

16 Rosenfeld & Feldman Instances  Occurrences as before. Vary “positive” occurrences to get near-miss “negative” occurrences, using asymmetry, disjointness, etc. Learn patterns in a (moderately) expressive but easy-to-match language (NPs from OpenNLP).

17 Know It All

18 Architecture Set of predicates to consider + two names for each ~= [H92]

19 Architecture

20 Bootstrapping - 1 1.Submit the queries & apply the rules  initial seeds. 2.Evaluate each seed with each discriminator U: e.g., compute PMI stats like: |hits(“city Boston”)| / |hits(“Boston”)| 3.Take the top seeds from each class and call them POSITIVE and use disjointness, etc to find NEGATIVE seeds. 4.Train a NaiveBayes classifier using thresholded U’s as features.

21 Bootstrapping - 2


Download ppt "Bootstrapping April 3 2007 William Cohen. Prehistory Karl Friedrich Hieronymus, Freiherr von Münchhausen (11 May 1720 – 22 February 1797) was a German."

Similar presentations


Ads by Google