Enabling Domain Experts to Convey Questions to a Machine: A Modified, Template-Based Approach Peter Clark (Boeing Phantom Works) Ken Barker, Bruce Porter.

Slides:



Advertisements
Similar presentations
Do Now:.  TRANSCRIPTION: process that makes an RNA copy of DNA.  RNA is single-stranded, and T is replaced by U (A-U; G-C)  RNA polymerase makes RNA,
Advertisements

A Library of Generic Concepts for Composing Knowledge Bases Ken Barker, Bruce UTAustin Peter
SBI 4U November 14 th, What is the central dogma? 2. Where does translation occur in the cell? 3. Where does transcription occur in the cell?
Cell Division, Genetics, Molecular Biology
DNA Transcription and Translation
From DNA to Proteins Chapter 13.
Transcription & Translation Biology 6(C). Learning Objectives Describe how DNA is used to make protein Explain process of transcription Explain process.
Protein Synthesis Ordinary Level. Lesson Objectives At the end of this lesson you should be able to 1.Outline the steps in protein synthesis 2.Understand.
2.7 DNA Replication, transcription and translation
13.3: RNA and Gene Expression
Protein Synthesis.
TRNA. Transfer RNA (tRNA) is a small molecule, existing as a single- strand that is folded into a clover-leaf shape.
8.4 DNA Transcription 8.5 Translation
How Genes Work. Transcription The information contained in DNA is stored in blocks called genes  the genes code for proteins  the proteins determine.
RNA carries DNA’s instructions.
Chapter 13.2 (Pgs ): Ribosomes and Protein Synthesis
Biology 10.1 How Proteins are Made:
NOTES: Chapter 13 - RNA & Protein Synthesis
FROM DNA TO PROTEIN Transcription – Translation We will use:
Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri,
Protein Synthesis Chapter 13. Protein Synthesis  How does your DNA eventually lead to your different phenotypes (hair color, eye color, etc)
Protein Synthesis 12-3.
AP Biology Discussion Notes Wednesday 01/28/2015.
FROM DNA TO PROTEIN Transcription – Translation. I. Overview Although DNA and the genes on it are responsible for inheritance, the day to day operations.
Gene Expression How is the information in DNA used to determine an organism’s characteristics?
DNA to Proteins 3-4.
SC.912.L.16.5 Protein Synthesis: Transcription and Translation.
Introduction to protein synthesis Some key aspect to learn and understand A Protein is a polymer which consists of monomers (repeating building blocks),
DNA, RNA, and Proteins Section 3 Section 3: RNA and Gene Expression Preview Bellringer Key Ideas An Overview of Gene Expression RNA: A Major Player Transcription:
12-3 RNA and Protein Synthesis
Chapter 18 Gene Expression & Protein Synthesis Chemistry 20.
DNA The Code of Life.
3.A.1 DNA and RNA Part IV: Translation DNA, and in some cases RNA, is the primary source of heritable information. DNA, and in some cases RNA, is the primary.
Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri,
Transcription and Translation How genes are expressed (a.k.a. How proteins are made) Biology.
CHAPTER 13 RNA and Protein Synthesis. Differences between DNA and RNA  Sugar = Deoxyribose  Double stranded  Bases  Cytosine  Guanine  Adenine 
Transcription Objectives: Trace the path of protein synthesis.
DNA determines your unique characteristics. A Review… DNA is the instructions for making proteins. DNA determines the structure and function of the protein.
8.5 Translation KEY CONCEPT Translation converts an mRNA message into a protein.
Welcome to class 1/19/16 – 1/20/16  Turn in Check for understanding (3 of them)  Warm up  Notes on RNA and Transcription process  Complete check.
Protein Synthesis Biology 12. Genes  Proteins Genes: a sequence of nucleotides in DNA that performs a specific function. Each gene contains the instructions.
AP Biology Discussion Notes Wednesday 2/10/2015. Goals for Today Be able to describe how DNA & RNA molecules differ from each other. Be able to name and.
Chapter 13 GENE FUNCTION. A. Comparison of DNA & RNA.
CHAPTER 10 “HOW PROTEINS ARE MADE”. Learning Targets  I will compare the structure of RNA with that of DNA.  I will summarize the process of transcription.
From Genes to Proteins. Transcription in prokaryotes Simple process No nucleus; ribosomes lie close to the DNA in cytoplasm 1 st part of the mRNA attaches.
12-3 RNA and Protein Synthesis Page 300. A. Introduction 1. Chromosomes are a threadlike structure of nucleic acids and protein found in the nucleus of.
Protein Synthesis The Making of Proteins Using the Genetic Information Stored in DNA.
DNA Transcription and Translation Review. There are 3 types of RNA: Messenger RNA (mRNA) Ribosomal RNA (rRNA) Transfer RNA (tRNA)
Ch. 11: DNA Replication, Transcription, & Translation Mrs. Geist Biology, Fall Swansboro High School.
Gene Expression = Protein Synthesis.
Biology 1-1c Protein Synthesis.
An Introduction to Protein Synthesis
Section 3: RNA and Gene Expression
DNA vs RNA.
13.3 RNA & Gene Expression I. An Overview of Gene _____________ A. RNA
From DNA to Proteins Transcription.
Lesson starter Name the four bases found in DNA
Protein Synthesis Standards:
Transcription & Translation.
Protein Synthesis Step 2: Translation
Protein Synthesis Lecture 5
PROTEIN SYNTHESIS = CELL CONTROL
(Transcription & Translation)
Translation and Transcription
Transcription/ Translation Notes 16-17
An Overview of Gene Expression
DNA Transcription and Translation
12-3 RNA & Protein Synthesis
Unit 3: Genetics Part 1: Genetic Informaiton
The Production of Proteins by DNA
Presentation transcript:

Enabling Domain Experts to Convey Questions to a Machine: A Modified, Template-Based Approach Peter Clark (Boeing Phantom Works) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri, Sunil Mishra, Jerome Thomere (SRI International)

How can End-Users Pose Questions? User needs to express: –domain knowledge –questions posed to that domain knowledge Posing questions: –can be straightforward, e.g., single-task systems: “What disease does this patient have?” –or, can itself be a major “knowledge capture” challenge This talk: –How to pose questions (not how to answer them!) Start-to-Finish Knowledge Capture:

Some Example Questions… 1.When during RNA translation is the movement of a tRNA molecule from the A- to the P-site of a ribosome thought to occur? 2.What are the functions of RNA? 3.What happens to the DNA during RNA transcription? 4.In a cell, what factors affect the rate of protein production? 5.A mutation in DNA generates a UGA stop codon in the middle of the RNA coding for a particular protein. What nucleotide change has probably occurred ?

Some Previous Approaches Just allow one question to be asked –“What disease does this patient have?” –But: inappropriate for multifunctional systems Ask in Natural Language –e.g., for databases “How many employees work for Joe?” –But: lacks sufficient constraint Use Question Templates –e.g., HPKB “What risks/rewards would face/expect in taking hostage citizens of ?” –But: domain-specific

A Modified, Template-Based Approach 1.Complex questions can be factored into –the question scenario (“Imagine that…”) –query to that scenario (“Thus, what is…”) 2.The scenario contains most of the complexity –The “raw query” itself is usually simple 3.The query can be mapped into one of a small number of domain-general templates –grouped around different modeling paradigms Claims:

A Modified, Template-Based Approach  basis for a modified, template-based approach: Full Question = Scenario + Query Capture using graphical tools (Shaken) Capture with a finite set of domain-general templates

Full Question = Scenario + Query Create using a graphical “representation builder” –Select objects from an ontology –Connect them together using small library of relations –Graph converted to ground logic assertions “A DNA virus invades the cell of a multicellular organism”

Huge variety of possible queries –But can be grouped according to reasoning paradigms the KB supports  Catalog of 29 Domain-General Question Types –based on analysis of 339 cell biology questions –have a fill-in-the-blank template –“blanks” are (often complex) objects from the scenario Full Question = Scenario + Query

Paradigms and Some Templates… 1.Lookup & Simple Deductive Reasoning q2 “What is/are the function of RNA?” q4 “Is a ribosome a cytoplasmic organelle?” q6 “How many membranes are in the parts relationship to the ribosome?” 2.Discrete Event Simulation q12 “What happens to the DNA during RNA transcription?” 3.Qualitative Reasoning q25 “In cell protein synthesis, what factors affect the rate of protein production?” q26 “In RNA transcription, what factors might cause the transcription rate to increase?” 4.Analogical/Comparitive Reasoning q29 “What is the difference between procaryotic mRNA and eucaryotic mRNA?”

Question Reformulation Small set of question types  users often must re-cast original question in terms of those types For example… : “Where in a eucaryotic cell does RNA transcription take place?”  “What is/are the site of RNA transcription?” : “When is the sigma factor of bacterial RNA polymerase released with respect to RNA transcription?”  “During RNA transcription, when does the RNA polymerase | release | the sigma factor?”

Posing questions

Posing questions (cont)

Receiving Answers

Receiving Answers (cont)

Evaluation and Lessons Learned Large-scale trials in biology students used system for 4 weeks Their goals: –Encode 11-page subsection on cell biology –Test their representations using a set of 70 questions Qns expressed in English High-school level of difficulty Qns set independently, no knowledge of our templates 18 of the 29 templates implemented at time of trials

Results It works… –All 4 users able to pose most (~80%) of the qns –Answer score (average) = 2.23 (2 = “mostly correct”) –Exposes what the system is able to do …but three major challenges…

Challenges 1.Users had difficulty reformulating their questions to match a template, e.g. (Original) “Where in a eucaryotic cell does RNA transcription take place?”  (Desired) “What is/are the site of RNA transcription?”  (User) “What is RNA transcription?”  Heavy use of a few generic templates:

Challenges Reformulation is not just a rewording task Rather, requires user to view problem in terms of one of the KB’s modeling paradigms Easier for us than for the users

Challenges 2.Users need to be fluent with the graph tool and KB ontology for specifying scenarios Not an problem in this case

Challenges 3.Sometimes, the template approach breaks down Some questions require identifying the scenarios: “What kinds of final products result from mRNA?” Similarly, identifying the right viewpoint/level of detail: e.g., DNA as a line vs. sequence vs. two strands Some topics not covered by templates Uncertainty, causal event structure Diagnosis, abduction Some questions go beyond concepts in the KB “What are the building blocks of proteins?” Can’t specify “impossible objects” “Is possible?”

Summary Conveying questions can itself be a major “knowledge capture” challenge A modified, template-based approach: –Factor full questions into scenarios + templates –Templates are domain-general, and based on modeling paradigms available –Balances flexibility vs. interpretability Results: –A catalog of templates –Approach works! but with significant caveats.