Reading to Learn Q1 review (7/6/05) Peter ClarkMichael Glass Phil HarrisonTom Jenkins John ThompsonRick Wojcik Boeing Phantom Works.

Slides:



Advertisements
Similar presentations
Acids, Bases, and pH. Acids and Bases Acids produce H + ions which react with water to form hydronium (H 3 O + ) ions Bases produce OH - ions.
Advertisements

Acids, Bases, and Salts Chapter 19.
Dr. Namphol Sinkaset Chem 152: Introduction to General Chemistry
Acids and Bases. Acids & Bases These were introduced in Chapter 4 Arrhenius: Acid = any substance that produces H + in soution. Base = any substance that.
Reading to Learn Q4 Review Peter Clark John Thompson Phil Harrison Bill Murray.
Monday, May 2 nd : “A” Day Agenda  Begin chapter 15: Acids & Bases  15.1: “What Are Acids & Bases?” Strong acid, weak acid, strong base, weak base,
Acids and Bases: Theory Arrhenius theory of acids Arrhenius definition of an acid: any compound that contains hydrogen and produces H + (H 3 O + when.
Chapter 18 Acids, Bases, and Salts. Acids Acids Sour - lemons Sour - lemons Feel like water Feel like water React vigorously with metal React vigorously.
CHAPTER 16 ACIDS AND BASES. I. INTRODUCTION A) The Oldest Theory is the Arrhenius Theory 1) You should be familiar with this one from your earlier chemistry.
Acids, Bases & Salts n Acid Properties : n Sour taste, react with metals to produce hydrogen gas, electrolytes, affect indicators (turns blue litmus paper.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Modern Theories of Acids & Bases The Arrhenius and Bronsted-Lowry Theories.
Unit: Chemical Interactions Chapter 8: Solutions When substances dissolve to form solutions, the properties of the mixture change. 8.1: A solution is a.
Chapter 14: Acids and Bases. Initial concepts of Acids and bases First, acids were recognized as substances with a sour taste, but this was a dangerous.
Acids and Bases.
Acids and Bases Chemistry 2013.
 It shouldn't be hard for you to name several common acids, but you might find that listing bases is just a little more difficult.  Here's a partial.
Chapter 19 Acids and Bases.
Chapter 9 Charge-Transfer Reactions: Acids and Bases and Oxidation-Reduction Copyright  The McGraw-Hill Companies, Inc. Permission required for reproduction.
Chemistry 121/122 Acids, Bases and Salts. Properties of Acids Acids, in foods, give a sour or tart taste Aqueous solutions of acids are electrolytes (conduct.
The Chemistry of Acids and Bases
Chapter 18 Acids and Bases Acids Arrhenius Acid – a compound containing hydrogen that ionizes to produce hydrogen ions (H + ) in water Names: Hydrochloric.
Copyright © McGraw-Hill Education. Permission required for reproduction or display Chapter 13: Acids and Bases.
Acids and Bases. Acids are substances that turn blue litmus red, and usually react with metals such as zinc, releasing hydrogen. Examples: hydrochloric.
6.1 – Introduction to Acids and Bases Unit 6 – Acids and Bases.
Friday, April 25 th : “A” Day Monday, April 28 th : “B” Day Agenda  Hand back/go over Ch. 14 Tests  Begin chapter 15: Acids & Bases  15.1: “What.
Acids and Bases The concept of acidic and basic solutions is perhaps one of the most important topics in chemistry. Acids and bases affect the properties.
Acids and Bases.
Properties of Acids Acids have a sour taste They react with “active” metals –i.e. Al, Zn, Fe, but not Ag or Au 2 Al + 6 HCl  AlCl H 2 –Corrosive.
Look at some examples, and see if you can figure out what an acid is… HCl (Hydrochloric acid) HNO 3 (Nitric acid) H 2 SO 4 (Sulfuric acid) H 3 PO 4.
 Knowledge Acquisition  Machine Learning. The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
ACIDS and BASES Unit 10, Chapter 19
The Reading to Learn Project Peter Clark Phil Harrison Tom Jenkins John Thompson Rick Wojcik (Boeing Phantom Works) David Israel (SRI)
Acids and Bases: Introduction Section Objectives Identify the physical and chemical properties of acids and bases Classify solutions as acidic,
Aim: What are acids and bases?. Acids 1.Acids can be strong or weak electrolytes in aqueous solutions. 2.Acids (ex: HCl) react with certain metals to.
What are Acids and Bases ? There are three main theories to describe what acids and bases are In general each theory widens what chemical reactions can.
Chapter 14 Acids and Bases. Chapter 14 Table of Contents Copyright © Cengage Learning. All rights reserved The Nature of Acids and Bases 14.2Acid.
Introduction to Acids and Bases Science 10. Introduction to Acids Do your muscles ever feel sore after a heavy workout? If so, you can blame a buildup.
Acid-Base Theories The “Boyz”. Acid and Base Theories2 Arrhenius Theory of Acids Acid: molecular substances that breaks-ups in aqueous solution into H+
Acquiring and Using World Knowledge using a Restricted Subset of English Peter Clark, Phil Harrison, Tom Jenkins, John Thompson, Rick Wojcik Boeing Phantom.
Acids and Bases.
Acids and Bases Unit 12. Properties of an Acid  Sour taste  Turns litmus paper red (and responds uniquely to other indicators)  Reacts with:  Hydroxide.
11111 Chemistry 132 NT A man is rich in proportion to the number of things he can let alone. Henry David Thoreau.
Acid-Base Equilibria: The Nature of Acids and Bases What makes an Acid an Acid? An acid possess a sour tasteAn acid possess a sour taste An acid dissolves.
Ch # 14 Acids, Bases and Salts. Acid Properties sour taste change the color of litmus from blue to red. react with –metals such as zinc and magnesium.
Acid and Base Equilibria The concept of acidic and basic solutions is perhaps one of the most important topics in chemistry. Acids and bases affect the.
1 Substances are identified as acids based on their properties. Acids taste sour. Acids give the sour taste to many common foods. Acids will change the.
Acids and Bases Chapter 20.
Acids, Bases, and Salts Chapter 18. Properties of Acids and Bases When dissolved in distilled water, they look the same Taste –Acid developed from the.
Unit 6: Acid-Base Equilibria
ACIDS AND BASES Questions may involve any of the following: description of acids and bases in terms of proton transfer calculations involving K w and pH.
15 Acids and Bases Contents 15-1 The Bronsted-Lowry Definitions 15-2 The Ion Product of Water, Kw 15-3 The pH and Other “p” Scales 15-4 Concentrations.
Acid and Base Equilibria
Acids and Bases.
Acids, Bases, & Salts Acid/Base Equilibrium
Acids, Bases, and pH. Acids and Bases Acids produce H + ions Bases produce OH - ions.
16.1 Properties of Acids and Bases
Lesson 1 Arrhenius Theory Bronsted-Lowry Theory ACIDS AND BASES.
Acids and Bases – Acid Strength and K a.
Acids and Bases.
Acids & Bases Chapter 15 & 16. Acids Have a sour taste Affect indicators React with bases to produce salt & water Conduct an electric current Examples.
Chapter 16 Acids and Bases. Characteristics Acids: Sour taste –Lemons, Oranges (citric acid) Bases: Slippery, bitter taste –Soaps Change the color of.
© Houghton Mifflin Harcourt Publishing Company Acids 1.Aqueous solutions of acids have a sour taste. 2.Acids change the color of acid-base indicators.
“K” Chemistry (part 3 of 3) Chapter 15: Acids and Bases.
Acids and Bases. Acid and Base Overview Acids and bases are a group of compounds that combine to form a salt. A salt is an ionic solid made from a metal.
Chapter 1 Lecture Introduction and Review Organic Chemistry, 8 th Edition L. G. Wade, Jr.
Acids and Bases.
Acids and Bases.
Acid-Base Equilibria: The Nature of Acids and Bases
Acids and Bases.
Presentation transcript:

Reading to Learn Q1 review (7/6/05) Peter ClarkMichael Glass Phil HarrisonTom Jenkins John ThompsonRick Wojcik Boeing Phantom Works

Agenda Introduction 1. Textual knowledge to CPL 2. CPL to logic: How CPL is interpreted Processing some CPL (demo) 3. Knowledge Integration Extracting knowledge from text

LbR Framework Corpus Using what you know to get more Knowledge Repository “Worldview” Knowledge Acquisition Loop Robust Reasoning Tasks Introspection Solidification Loop Knowledge Integration

Corpus Using what you know to get more Knowledge Integration Knowledge Repository “Worldview” Knowledge Acquisition Loop Robust Reasoning Tasks Introspection Solidification Loop CPL or Logic LbR Framework

Overview What does a person do? a) Start aready knowing something about a domain + general knowledge b) Read; existing knowledge helps him/her understand the new material c) integrate the new knowledge into pre-existing knowledge d) can now perform new tasks – In its full and unrestricted form, is too difficult to be implemented – BUT: significant, partial approaches are feasible

Reduced Version Select a domain where KB exists (chemistry) Manually reformulate a section of text into controlled English (CPL) Automatically process that controlled English to generate new knowledge Integrate the new knowledge into the KB Rationale: Two key problems in reading to learn: full natural language processing knowledge integration This approach separates them and can focus on (2)

Corpus Using what you know to get more Knowledge Integration Knowledge Repository “Worldview” Knowledge Acquisition Loop Robust Reasoning Tasks Introspection Solidification Loop CPL or Logic Step 1: Unrestricted NL to CPL

Step 1: Possible methods Texts (unrestricted English) Text in restricted English Manual reformulation Machine “translation” Locate subset in simple English Knowledge extraction from corpora

Step 1: Possible methods Texts (unrestricted English) Text in restricted English Manual reformulation Machine “translation” Locate subset in simple English Knowledge extraction from corpora

Reformulating Chemistry Text into CPL John A. Thompson

Selecting the important text Textbook: “Acids have a sour taste (for example, citric acid in lemon juice) and cause certain dyes to change color (for example, litmus turns red on contact with acids). Indeed, the word acid comes from the Latin word acidus, meaning sour or tart.” Selected and reworded as CPL: “Acids have a sour taste.” “Acids cause some dyes to change color.” Judgment calls regarding “what might be on the test” Simplified rewordings – no pronouns or complex sentences, see CPL User Guide

Example 2: Text to CPL Textbook: “Sodium hydroxide is an Arrhenius base. Because NaOH is an ionic compound, it dissociates into Na+ and HO- ions when it dissolves in water, thereby releasing OH- ions into the solution.” CPL: “Sodium hydroxide is an Arrhenius base.” “NaOH is sodium hydroxide.” [inserted background knowledge] “NaOH is an ionic compound.” “NaOH dissolves in water.” “NaOH dissociates in water.” “The dissociating produces Na-plus ions and OH-minus ions.”

Example 3: Text to CPL Textbook: “Some substances can act as an acid in one reaction and as a base in another. For example, H2O is a Bronsted-Lowry base in its reaction with HCl and a Bronsted-Lowry acid in its reaction with NH3. A substance that is capable of acting as either an acid or a base is called amphoteric.” CPL: “Some substances sometimes act as a Bronsted-Lowry acid and sometimes act as a Bronsted-Lowry base.” [required “and”] “These substances are called amphoteric substances.” [vocab] “H2O acts as a Bronsted-Lowry base in a reaction with HCl.” “H20 acts as a Bronsted-Lowry acid in a reaction with NH3.” “Therefore, H2O is an amphoteric substance.” [deduced]

Rewording generics “Acids have a sour taste” is a generic sentence about a class of things – the textbook is full of generics! One interpretation: “Every instance of an acid has a sour taste” But is it “every” instance, or just “typically”? Another interpretation: “If a person is tasting an acid, then the person is experiencing a sour taste” We are planning to do some automatic interpretation of generic sentences in the future Our short-term strategy is to reword every generic sentence to another form, especially to an if-then rule

Rewording generics - 2 Another example: “Acids cause some dyes to change color” One interpretation: “There are some dyes that change color when in contact with any acid” Another interpretation: “For each acid there is some dye that changes color when in contact with the acid” Possible CPL rewording as an if-then rule: “If an acid-sensitive dye is in contact with an acid, then the acid is causing the dye to change color” Bottom line: Generics are a major issue with textbook knowledge

CPL now includes if-then rules Rewriting generic sentences required if-then rules to be added to CPL Examples of if-then rules in CPL: “If HCl is immersed in water, then the HCl is dissolving in the water.” “If HCl is dissolving water, then each molecule of the HCL is reacting with a molecule of the water.” “If an HCl molecule is reacting with a water molecule, then an H-plus ion is transferring from the HCl molecule to the water molecule.”

Connecting verb and noun forms Example: reacts vs. reaction “Hydrogen chloride gas in water reacts with the water” “The reaction produces H-plus ions and Cl-minus ions The CPL writer assumes that the interpreter will make the connection from the verb to the related noun in the next sentence We are building a large set of verb-noun relations to handle these cases

Gross vs. molecular events Chemistry text switches between: gross-level events involving substances reactions between two molecules Example: “When HCl dissolves in water, we find that the HCl molecule transfers an H+ ion (a proton) to a water molecule” In CPL, we interpret “HCl” as the gross-level substance, and “HCl molecule” as the molecular-level individual If-then rules carry the logic from the gross level to the molecular level: “If HCl is dissolving water, then each molecule of the HCL is reacting with a molecule of the water.”

Issues with fuzzy information Examples: “An H-plus ion sometimes reacts with an H2O molecule” “A molecule of a Bronsted-Lowry acid can donate a proton to another substance” “The NH4 is mostly solid particles” “Some substances containing hydrogen are not acids” “An ion of a Bronsted-Lowry acid must have a hydrogen atom.” These are easy to state in CPL, but what logical representation should be produced? This is a universal problem in any kind of language interpretation We are developing solutions as we progress

The textbook teaches by example Example: “A molecule of a Bronsted-Lowry acid can donate a proton to another substance” “An HCl molecule in water donates a proton to an H2O molecule” “Therefore, the HCl molecule acts as a Bronsted-Lowry acid” Text is using an example to teach the student how to make similar deductions Unclear how to capture this in CPL – how to specify where the deductive chain begins? Perhaps we should skip the examples and only enter the general principles being taught?

Teaching by hypothetical situations Example: “[Assume that] H2O is a stronger base than X-minus in Equation 16.9” “[Assume that] X-minus is the conjugate base of HX in Equation 16.9” “H2O extracts the proton from HX in the reaction in Equation 16.9” “The reaction produces H3O-plus and X-minus” Therefore, the equilibrium is on the right side of Equation 16.9” How can CPL make it clear that this is a hypothetical? Note that X stands for a variety of atoms So the hypothetical teaches the student a useful pattern How should we represent and reason about this?

Real world vs. written equations The textbook switches between the real world and the syntax of written chemical equations: “If H2O (the base in the forward reaction) is a stronger base than X- (the conjugate base of HX), then H2O will abstract the proton from HX to produce H30+ and X-. As a result, the equilibrium will lie to the right.” The CPL author must be careful to mention the equation number when referring to its syntax: “In Equation 16.9 H2O is the base in the forward reaction.” “The equilibrium is on the right side of Equation 16.9”

Textbook figures and tables A textbook is not just text! Some figures and diagrams can be re-expressed in text, but others cannot be Tables (such as 16.4, a listing of the relative strengths of some conjugate acid-base pairs) can be converted to many lines of CPL text We are omitting sample & practice exercises from the CPL

Textbook to CPL: Conclusions Authors can learn to restate the essential parts of a textbook in simple CPL sentences Most textbook sentences contain ambiguous “generics” about classes of things For now, we can disambiguate generics by rewriting them as if-then rules in CPL For most CPL sentences, we can process them to get an adequate logical representation (with some user assistance) Special problems involving fuzzy statements and hypothetical examples require further work

Corpus Using what you know to get more Knowledge Integration Knowledge Repository “Worldview” Knowledge Acquisition Loop Robust Reasoning Tasks Introspection Solidification Loop CPL or Logic Step 2: CPL Interpretation

“HCl is immersed in water” Parser & LF Generator Word sense disambiguator Relational disambiguator Coreference identifier Structural reorganizer HClWater Immerse object is-inside-of (_HCl13320 instance_of HCl-Substance) (_Water13321 instance_of Water) (_Immerse13319 instance_of Move-Into) (_Immerse13319 object _HCl13320) (_Immerse13319 is-inside-of _Water13321) World Knowledge Linguistic Knowledge Step 2: CPL Interpretation (overview)

CPL Processing Spots coreferences across sentences Handles nominalizations (“reacts”, “the reaction”) Use of WordNet to coerce text to KB’s ontology Set of heuristics for identifying semantic relations Interprets rules, as well as ground facts Rules are in KM (inference-capable) Can be used in interactive or automatic mode

Integrating Knowledge from Reading into a Knowledge Base Michael Glass Boeing Phantom Works

Corpus Using what you know to get more Knowledge Integration Knowledge Repository “Worldview” Knowledge Acquisition Loop Robust Reasoning Tasks Introspection Solidification Loop CPL or Logic Step 3: Knowledge Integration

The Task After knowledge is input through CPL it must integrated with existing knowledge in the KB. This task carries with it three main problems. Missing Concepts: The new knowledge may contain concepts the KB does not have. Conflicting Concepts: The new knowledge may use concepts in the KB in an incompatible way. Elaboration Requires Updates: The knowledge base may be resistant to new knowledge.

1. Missing Concepts Consider: If HCl is immersed in water, then the HCl is dissolving in the water. The knowledge base does not know the concept “immerse” or “dissolve”. Even if they were added in a superficial way, the knowledge base would have no axioms allowing it to reason about immersion or dissolving. This is the simplest type of mismatch. It is relatively easy to detect. It simply requires adding the concepts to the KB or the user may restrict himself to concepts the KB already has.

1. Missing Concepts: Solutions Add the concepts to the KB. The concepts could be added selectively as needed and (partially) axiomatized. The concepts could be added in bulk in a vacuous way. Defining new concepts through CPL. Restrict the user to the set of concepts in the KB. In many cases a similar concept in the knowledge base may serve just as well.

2. Conflicting Concepts Two significant classes of conflicting concepts: Genuine Conflicting Concepts: A concept in the KB may simply not match what is intended. Naïve Encodings: The KB may expect knowledge structured in a specific way.

2. Conflicting Concepts Consider: If HCl is dissolving water, then each molecule of the HCl is reacting with a molecule of the water. The knowledge base has a concept of a Reaction, but the inputs are required to be Chemicals (aggregates of molecules). NOT Molecules. The reaction concept is well axiomatized, but some of these axioms are not applicable to molecules.

2. Conflicting Concepts This might be considered another case of missing concepts. The KB has concepts for reasoning about reactions at the macroscopic level, but not at the molecular level.

2. Conflicting Concepts Consider: If an HCl molecule is reacting with a water molecule, then an H-plus ion is transferring from the HCl molecule to the water molecule. Transferring is in the KB, but it is axiomatized in a way consistent with a person transferring an object to another person. The object transferred must be in the “possession” of the donor.

2. Naïve Encodings, A Special Case of Conflicting Concepts There can be a mismatch in the form in which a new piece of knowledge is stated and the form in which it is expected. A chemistry textbook might refer to the equilibrium constant of a reaction rather than the equilibrium constant of a equilibrium reaction. The Chemistry knowledge base expects only equilibrium reactions to have equilibrium constants.

2. Naïve Encodings Also the CPL parser may produce KM that is not quite right: Produced by CPL: (a Chemical with (color (*red))) Expected by KM: (a Chemical with (color ((a Color-Value with (value (*red)))))) The difference is purely structural.

2. Conflicting Concepts: Solutions First the conflict must be detected Constraint violations: Certainly if a reaction requires Chemical raw-materials and it is given Molecules instead, there must be a conflict. Resemblance check: If the knowledge looks totally novel, such as a Transfer from one Molecule to another, it may be a conceptual conflict. To fix a naïve encoding an automatic solution may be used to coerce the concepts into an acceptable form. (James Fan’s Loosespeak)

3. Elaboration Requiring Updates Two significant ways a KB may require updates to add knowledge: Closed World Assumption: The KB may implicitly or explicitly assume it already knows everything about a given topic. Unstated Assumptions: The KB designers may have built the KB with some assumptions about the circumstances it will reason about.

3. Updates and the Closed World Assumption Some rules in the KB close off the KB to further elaboration. The else part of an if…then…else may give a default value that prevents other components from concluding anything else.

3. Unstated Assumptions Often a knowledge base will be built with a set of unstated assumptions about how it will be used. These assumptions may lead to assertions in the knowledge base stated as always true, but really true only under certain circumstances. This can result in knowledge that is difficult to extend.

3. Unstated Assumptions: Example Consider a knowledge base meant to deal with chemical reactions at room temperature and standard pressure. This knowledge base might include the assertion that water is a liquid. (every Water has (state (*liquid))) However, if the scope of the knowledge base is extended to consider a range of temperatures, the knowledge base is not simply incomplete, but wrong.

3. Elaboration Requiring Updates: Solutions First the conflict must be detected. The knowledge base could forward chain for a while on concepts related to the new knowledge. Resolving the conflict is difficult in general. It may entail a case of the attribution problem.

Conclusion and Future Directions Missing Concepts Allow additions to ontology through CPL Naïve Encodings Loosespeak interpreter Genuine Conflicting Concepts and Elaboration Requiring Updates Detection through constraint checks, resemblance check and forward chaining. Correction assisted by feedback.

Acquiring Knowledge from Reading Phil Harrison Boeing Phantom Works

Corpus Using what you know to get more Knowledge Integration Knowledge Repository “Worldview” Knowledge Acquisition Loop Robust Reasoning Tasks Introspection Solidification Loop CPL or Logic Step 1: Unrestricted NL to CPL - an additional approach

The Problem Uncontrolled (non CPL) text is difficult for computers to analyze. Advances are needed in all areas of NLP: grammar, WSD, semantic representation, and discourse processing. A method is needed for extracting as much knowledge as possible from text.

Tuple extraction The Parse trees are a source of “head- complement” or “head-modifier” relations: From “The heavy man bought an expensive book”  (S “man” “buy” “book”) (AN “heavy” “man”) (AN “expensive” “book”) “Books can be bought” “Men can be heavy” “Books can be expensive” Even incorrect parses can generate some valid tuples.

Examples from chemistry Acids can be strong or weak. Acids can be vitamins. Acids can release ions. Bases can turn litmus. Carbonates can form CO2. Chlorides can become ions. Concentrations can be measured.

Uses of tuples Extracted tuples can guide parsing. Human review of tuples is necessary. Iteration over a corpus improves accuracy and allows more knowledge extraction. The generic sentences derived from tuples can be integrated into a knowledge base.

Interpretation of generic sentences Generics are characterized by the use of bare plurals, mass terms, or adverbs of quantification. Examples: “Birds fly” “Ice is cold” “John usually drinks a beer” Interpretation of generics is problematic, but a substantial amount of work has been done.

Summary This is not a method for full and general reading. But, can potentially acquire certain types of knowledge very quickly Illustrated in the domain of chemistry

Overall Future Directions CPL extensions CPL processing of generics Appropriate structure of KB for effective knowledge integration Degree of automation possible now, 5 years from now Evaluation