Mining Fine-grained Argument Elements Adam Wyner Department of Computing Science University of Aberdeen 4 November, 2014 CS4025, Computing Science, University.

Slides:



Advertisements
Similar presentations
Visualization Tools, Argumentation Schemes and Expert Opinion Evidence in Law Douglas Walton University of Winnipeg, Canada Thomas F. Gordon Fraunhofer.
Advertisements

Identifying Arguments for Evaluation using an Argument Explorer Jodi Schneider 1, Adam Wyner 2, Katie Atkinson 2, Trevor Bench-Capon 2 1 Digital Enterprise.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Opportunities and Challenges of Textual Big Data for the Humanities Dr. Adam Wyner, Department of Computing Prof. Barbara Fennell, Department of Linguistics.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Intelligent Systems MGMT Summer 2012 Night #7, Part 1.
Argumentation in Artificial Intelligence Henry Prakken Lissabon, Portugal December 11, 2009.
Comments on Natural Language and Argumentation Adam Wyner Department of Computer Science July 13, 2012 London Text Analytics Meetup.
© 2002 by Bryan B. ThompsonPresentation to SWIG, November 1 st, 2002 Critical Thinking On the Web The Cognitive Web Presentation to the Semantic Web Interest.
CS4025: Advanced Information Extraction. Overview CS4025, Department of Computing Science, University of Aberdeen 2 Overview of aspects of IE and General.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Expert System Human expert level performance Limited application area Large component of task specific knowledge Knowledge based system Task specific knowledge.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
1 / 26 Supporting Argument in e-Democracy Dan Cartwright, Katie Atkinson, and Trevor Bench-Capon Department of Computer Science, University of Liverpool,
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
PDDL: A Language with a Purpose? Lee McCluskey Department of Computing and Mathematical Sciences, The University of Huddersfield.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Common Core State Standards Professional Learning Module Series
Building Knowledge-Driven DSS and Mining Data
Programming by Example using Least General Generalizations Mohammad Raza, Sumit Gulwani & Natasa Milic-Frayling Microsoft Research.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Organizing Your Legal Analysis
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Michigan Common Core Standards
1. Human – the end-user of a program – the others in the organization Computer – the machine the program runs on – often split between clients & servers.
Jonathan Band Jonathan Band PLLC Google Library Project: Copyright Issues.
Critical and creative thinking Assessment Tool How could schools use the tool? Sharon Foster.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Applying Belief Change to Ontology Evolution PhD Student Computer Science Department University of Crete Giorgos Flouris Research Assistant.
Copyright © 2013 Curt Hill The Zachman Framework What is it all about?
Explanation Facility دكترمحسن كاهاني
1 ECE 453 – CS 447 – SE 465 Software Testing & Quality Assurance Instructor Kostas Kontogiannis.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
The New English Curriculum September The new programme of study for English is knowledge-based; this means its focus is on knowing facts. It is.
Flexible Text Mining using Interactive Information Extraction David Milward
Slide 1 Propositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs Jim Little UBC CS 322 – CSP October 20, 2014.
Book: Bayesian Networks : A practical guide to applications Paper-authors: Luis M. de Campos, Juan M. Fernandez-Luna, Juan F. Huete, Carlos Martine, Alfonso.
Overview of MOT Knowledge representation system : Basic Modeling Editor LexiconGrammarSemantics Pragmatics MOT Editor.
Statistics Canada Statistics Canada Statistique Canada Statistique Canada Disseminating gender statistics: The Canadian experience Heather Dryburgh, Ph.D.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Directions for Hypertext Research: Exploring the Design Space for Interactive Scholarly Communication John J. Leggett & Frank M. Shipman Department of.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Generic Tasks by Ihab M. Amer Graduate Student Computer Science Dept. AUC, Cairo, Egypt.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
Corpus-based evaluation of Referring Expression Generation Albert Gatt Ielka van der Sluis Kees van Deemter Department of Computing Science University.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
International Conference on Fuzzy Systems and Knowledge Discovery, p.p ,July 2011.
Formal Specification: a Roadmap Axel van Lamsweerde published on ICSE (International Conference on Software Engineering) Jing Ai 10/28/2003.
Argumentative Writing Grades College and Career Readiness Standards for Writing Text Types and Purposes arguments 1.Write arguments to support a.
Be.wi-ol.de User-friendly ontology design Nikolai Dahlem Universität Oldenburg.
CS223: Software Engineering
Users´ Behavior and Institutional Repositories Jela Steinerová Comenius University Bratislava
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Artificial Intelligence, simulation and modelling.
INTRODUCTION TO APPLIED LINGUISTICS
Chapter 11: Artificial Intelligence
PART IV: The Potential of Algorithmic Machines.
IB Assessments CRITERION!!!.
Henry Prakken & Giovanni Sartor July 16, 2012
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Introduction to Prolog
Outline What is Literature Review? Purpose of Literature Review
Presentation transcript:

Mining Fine-grained Argument Elements Adam Wyner Department of Computing Science University of Aberdeen 4 November, 2014 CS4025, Computing Science, University of Aberdeen

Outline 2 Problem. Pragmatic goal – extract argument elements. Approach: – flexible, interactive, incremental, modular, semi- automatic information extraction using – semantic annotations that are relative to – argumentation schemes and that support – semantic queries. Examples CS4025, Wyner, Computing Science, University of Aberdeen

What is an Argument? CS4025, Wyner, Computing Science, University of Aberdeen 3 A claim that is justified (premises) and may have exceptions (premises). A 'strict argument'. – Premises: Socrates is a man. Every man is mortal. – Claim: Therefore, Socrates is mortal. A 'defeasible argument'. – Premises: Bill is a doctor. Doctors know about heart disease. Bill told Jill that she has heart disease. – Claim: Therefore, Jill has heart disease.

What is an Argument? CS4025, Wyner, Computing Science, University of Aberdeen 4 A 'strict argument'. – Premises: Socrates is a man. Every man is mortal. – Claim: Therefore, Socrates is mortal. Given the premises, must accept the claim. A 'defeasible argument'. – Premises: Bill is a doctor. Doctors know about heart disease. Bill told Jill that she has heart disease. – Claim: Therefore, Jill has heart disease. Perhaps Bill does not have all the facts, or he is biased, or he has a second rate medical degree....

Corpora - Where Arguments Appear CS4025, Wyner, Computing Science, University of Aberdeen 5 Consumer websites. Law: policy (policy-making, voting, legislation), Supreme Court transcripts, case based reasoning, regulations. BBC's Have Your Say and Moral Maze. Medical diagnosis. Making plans. Debatepedia, Wikipedia, meeting annotations Social media: web-forums, Twitter, Facebook,....

Purposes and Goals CS4025, Wyner, Computing Science, University of Aberdeen 6 Classify/cluster texts as arguments for or against a claim. Search for and extract argument patterns and components, e.g. premises and conclusions. Reconstruct a complex network of arguments dispersed across texts. Support decision-making (e.g. inference and Abstract Argumentation). Relate arguments to provenance, to trust, to rankings/weightings, to values,.... Audience can: view static argument, contribute, interact with analytic tools.

Example: Solve Knowledge Acquisition Bottleneck for Reasoning CS4025, Wyner, Computing Science, University of Aberdeen 7 Identify and extract arguments from natural language, transform them into a knowledge base (KB), then transform the KB into a formal representation such that we can reason with inconsistent KBs. Argument Pipeline (Wyner, van Engers, Hunter 2010)

Three Stages Graph – Structured or Instantiated AFs CS4025, Wyner, Computing Science, University of Aberdeen 8 Three Stages - Caminada and Wu 2011 Knowledge Acquisition Bottleneck: time, labour, expertise to construct a KB at scale.

Logic-based Instantiated Argumentation (Besnard and Hunter (2009)) CS4025, Wyner, Computing Science, University of Aberdeen 9 An argument is an ordered pair ; ψ is a subset of a given KB and α is an atomic proposition from the KB; ψ is a minimal set of formulae such that ψ implies α, and ψ does not imply a contradiction. ψ is said to support the claim α. Where p and q are atoms, and where the KB is comprised of p and p→q, then is an argument. We could have a KB from which we can form an argument which supports ¬q,. In addition and with respect to this argument, suppose we can form an undercutter and a rebuttal. KBs (even relatively small ones) generate lots of arguments and attack relationships which can be structured in a tree.

Abstract Argumentation (Dung 1995) CS4025, Wyner, Computing Science, University of Aberdeen 10 Preferred extension: {a, c, d, h, i, k}

Zeroing In CS4025, Wyner, Computing Science, University of Aberdeen Source text Knowledge base & argumentation schemes Generated arguments (abstract or instantiated). 11

Current Tools to Extract and Structure Arguments from Text CS4025, Wyner, Computing Science, University of Aberdeen 12 Rationale, Araucaria, Carneades, IMPACT Project, Legal Apprentice,.... All manual. No NLP.

Argument Fragment for a Camera CS4025, Wyner, Computing Science, University of Aberdeen 13

Pro and Con CS4025, Wyner, Computing Science, University of Aberdeen 14

Comments on Comments CS4025, Wyner, Computing Science, University of Aberdeen 15

LIBER Response to Copyright Consultation CS4025, Wyner, Computing Science, University of Aberdeen 16 - Question 9. Should the law be clarified with respect to whether the scanning of works held in libraries for the purpose of making their content searchable on the Internet goes beyond the scope of current exceptions to copyright? - Yes. - Not all the material digitised by publishers is scanned with OCR (Optical Character Recognition) with the purpose of making the resulting content searchable. If the rights holders will not do this, libraries should be able to offer this service. It would have a transformative effect on research, learning and teaching by opening up a mass of content to users which can be searched using search engines. The interests of copyright holders will not be harmed, because the resulting output will act as marketing material for their materials.

A Problem with the Analysis of the Data CS4025, Wyner, Computing Science, University of Aberdeen 17 The arguments in these texts are: – fragmented and distributed across texts – conflicting/inconsistent statements How to find, link, and summarise the arguments? Contrast to arguments in a biomedical text or a Moral Maze discussion.

Various Argumentation Schemes 18 Patterns of defeasible reasoning: – Argument from position to know: reasoning about knowledge and individual's relation to it. – Argument from practical reasoning: reasoning about what to do. – Argument from trust from direct experience: reasoning about trusting someone. CS4025, Wyner, Computing Science, University of Aberdeen

Argumentation Schemes (Walton 1996) CS4025, Wyner, Computing Science, University of Aberdeen Patterns of presumptive (defeasible) reasoning Normalise the language. Practical Reasoning with values: – Do action (transition) because: Current circumstances - a list of literals. Consequences – a list of literals. Values (promoted, demoted, neutral wrt actions) – a list of terms. Credible Source: – Z is accepted because: X is an expert in domain Y. X stated literal Z Z is about domain Y. 19

Current Practice and Proposal CS4025, Wyner, Computing Science, University of Aberdeen 20 For consumer websites – none. For policy-making – backroom analysis of Green Paper responses. Method? Transparent? Structure? Proposal: Provide a tool for expert analysts (in context and workflow) to extract relevant material, homogenise statements, and construct/reconstruct arguments, e.g. output structured policy statements derived from source. Better understanding of what to look for and how.

Objectives 21 Identify and extract scheme information in and from across text. Explore how schemes appear in text. Address discontinuity/fragmentary presentation within sentences and across texts. Interconnect relevant, related portions of schemes from extracted information. Construct an argumentation graph for reasoning with inconsistency. CS4025, Wyner, Computing Science, University of Aberdeen

Means: A Language Of Schemes 22 Finger-print the scheme. – characteristic terminology of the scheme. – generalise the terminology to cover variation. – distinguish "domain" from "generic" terminology. – semantic annotations as "conceptual covers" over terminology. A scheme of linked schemes. Complex, flexible queries over the annotations. Use general tools to access many functionalities. CS4025, Wyner, Computing Science, University of Aberdeen

Consumer Argumentation Scheme Typed variables in schemes as targets for extraction. Premises: Camera X has property P. Property P promotes value V for agent A. Conclusion: Agent A should Action Camera X. 23 CS4025, Wyner, Computing Science, University of Aberdeen

Analyst’s Goal: Instantiate Premises: The Canon SX220 has good video quality. Good video quality promotes image quality for casual photographers. Conclusion: Casual photographers should buy the Canon SX CS4025, Wyner, Computing Science, University of Aberdeen

Annotating Text Annotate text: – Scheme's typed variables = semantic annotations. – Atomic (context free) or complex annotations (contextualised). – Colour annotations = XML. – Search for and extract text by annotation. 25 CS4025, Wyner, Computing Science, University of Aberdeen

Find Argument Passages 26 CS4025, Wyner, Computing Science, University of Aberdeen

Rhetorical Terminology 27 CS4025, Wyner, Computing Science, University of Aberdeen

Find What Is Being Discussed 28 CS4025, Wyner, Computing Science, University of Aberdeen

Domain Terminology 29 CS4025, Wyner, Computing Science, University of Aberdeen

Find Attacks Between Arguments 30 CS4025, Wyner, Computing Science, University of Aberdeen

Sentiment Terminology 31 CS4025, Wyner, Computing Science, University of Aberdeen

32 CS4025, Wyner, Computing Science, University of Aberdeen

Query For Patterns 33 Create queries on the fly from the "language" of the annotations. Extract strings that match the query. Use the results to identify useful, complex queries. Discontinuities. CS4025, Wyner, Computing Science, University of Aberdeen

An Argument To Buy The Camera Premises: The pictures are perfectly exposed. The pictures are well-focused. No camera shake. Good video quality. Each of these properties promotes image quality. Conclusion: (You, the reader,) should buy the CanonSX CS4025, Wyner, Computing Science, University of Aberdeen

An Argument NOT To Buy The Camera Premises: The colour is poor when using the flash. The images are not crisp when using the flash. The flash causes a shadow. Each of these properties demotes image quality. Conclusion: (You, the reader,) should not buy the CanonSX CS4025, Wyner, Computing Science, University of Aberdeen

Contrary Propositions The colour is poor when using the flash. For good colour, use the colour setting, not the flash. The images are not crisp when using the flash. No need to use flash even in low light. The flash causes a shadow. There is a corrective video about the flash shadow. 36 CS4025, Wyner, Computing Science, University of Aberdeen

Input to Argument Evaluators CS4025, Wyner, Computing Science, University of Aberdeen 37 Preferred extension: {a, c, d, h, i, k}

Additions 38 Verb classes, e.g. propositional attitudes/opinion terms. Fine-grained domain terminology. Adjectival and noun classes for sentiment/opinion. Ontologies. Terminology for contexts, for user classes.... CS4025, Wyner, Computing Science, University of Aberdeen

Argument Exploration 39 Flexible, interactive, incremental, modular, semi- automatic information extraction. Identify and extract scheme information in and from across text. Explore how schemes appear in text. Discontinuity/fragmentation by semantic search within sentences and across texts. Interconnect relevant, related portions of schemes from extracted information. Construct an argumentation graph for reasoning with inconsistency. CS4025, Wyner, Computing Science, University of Aberdeen

Different From 40 CS4025, Wyner, Computing Science, University of Aberdeen Not statistical NLP/Machine learning, though could use some aspects. Want to explain results, (manually) learn from error, and modify to suit (easy with lists and rules). Can be used to develop a gold standard corpora. No fixed syntactic or semantic structures.

Relates to CS4025, Wyner, Computing Science, University of Aberdeen Scheme/template filling for opinion and event mining. Issues of discontinuity and coherence. Structured argumentation. Abstract argument evaluation. Entailment graphs.

Thanks For Your Attention! Questions? Contacts: –Adam 42 CS4025, Wyner, Computing Science, University of Aberdeen