Digging by Debating (DbyD): linking massive datasets to specific arguments UK: Prof Andrew Ravenscroft (University of East London) Dr David Bourget (University.

Slides:



Advertisements
Similar presentations
1 Web Search Environments Web Crawling Metadata using RDF and Dublin Core Dave Beckett Slides:
Advertisements

ELearning Digital Curation For The Classroom A brief overview of current digital tools that can be used to collect, bookmark, sort and organize digital.
Panel: What Changes With Digital? Web Archiving ARL Forum 2009 Tracy Seneca – California Digital Library.
1/1/ A Knowledge-based Approach to Citation Extraction Min-Yuh Day 1,2, Tzong-Han Tsai 1,3, Cheng-Lung Sung 1, Cheng-Wei Lee 1, Shih-Hung Wu 4, Chorng-Shyong.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
SIG2: Ontology Language Standards WebOnt Briefing Ian Horrocks University of Manchester, UK.
Mapping Studies – Why and How Andy Burn. Resources The idea of employing evidence-based practices in software engineering was proposed in (Kitchenham.
Sentiment Analysis An Overview of Concepts and Selected Techniques.
Machine Learning and the Semantic Web
Information and Business Work
NetworkedPlanet Networked Information – Networked Knowledge Topic Maps & Web 3.0 © 2007 Networked Planet Limited. Web 3.0 Technology Platform to enable.
Understanding Goals Character Traits Literary Themes Collaboration On-going Assessments Informal & Formal Peer Feedback Teacher Feedback Self Reflection.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
Erasmus University Rotterdam Frederik HogenboomEconometric Institute School of Economics Flavius Frasincar.
Measuring Scholarly Communication on the Web Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK Bibliometric Analysis.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
CS300 Planning and Executing a Project Terry Hinton Helen Treharne.
4. Interaction Design Overview 4.1. Ergonomics 4.2. Designing complex interactive systems Situated design Collaborative design: a multidisciplinary.
Is Noesis Noetic and Why Does this Matter? Anthony F. Beavers, Ph.D. Philosophy / Cognitive Science The University of Evansville.
Overview of Web Data Mining and Applications Part I
19 April, 2017 Knowledge and image processing algorithms for real-life applications. Dr. Maria Athelogou Principal Scientist & Scientific Liaison Manager.
Digging by Debating (DbyD):
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Copyright R. Weber INFO 629 Concepts in Artificial Intelligence Fall 2004 Professor: Dr. Rosina Weber.
Artificial Intelligence What’s Possible, What’s Not, How Do We Move Forward? Adam Cheyer Co-Founder, VP Engineering Siri Inc.
Contents:  1 – Introduction to the subject of web mining and techniques  2 – Overview of research conducted (both theory and practical)  3 – Software.
The Future Of AI What’s Possible, What’s Not, How Do We Get There? Adam Cheyer Co-Founder, VP Engineering Siri Inc.
University of Malta CSA3080: Lecture 9 © Chris Staff 1 of 13 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
Welcome to Turton High School.
Artificial Intelligence: Its Roots and Scope
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
References: [1] [2] [3] Acknowledgments:
CDL-Flex Empirical Research
Interviewing and Deception Detection Techniques for Rapid Screening and Credibility Assessment Dr. Jay F. Nunamaker, Jr. Dr. Judee K. Burgoon.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Case Study Summary Link Translation entered a partner agreement with Autodesk to provide translation solutions integrating human and machine translation.
CLARIN work packages. Conference Place yyyy-mm-dd
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Data Mining for Web Intelligence Presentation by Julia Erdman.
PIER Research Methods Protocol Analysis Module Hua Ai Language Technologies Institute/ PSLC.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Lian Yan and David J. Miller 國立雲林科技大學 National Yunlin University of.
© The ATHENA Consortium. EM1 - Enterprise Modelling as a way to achieve Interoperability Module 3 - What interoperability problems does Enterprise.
Data for secondary analysis: the experience of the UK Data Archive Hilary Beedham UK Data Archive.
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Community Jenn Riley, Indiana University Constance A. Mayer, University of Maryland.
User Interface Design for a Large-Scale Computer Science Research Digital Library Min-Yen Kan Department of Computer Science National University of Singapore.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Geospatial Semantic Web Interim (2.5 months) Progress Review for Phase 1 Dr. Bhavani Thuraisingham (Computer Science) Dr. Latifur Khan (Computer Science)
The Chinese Room Argument Part II Joe Lau Philosophy HKU.
Chapter 6. Inference beyond the index 2007 년 1 월 30 일 부산대학교 인공지능연구실 김민호 Text : FINDING OUT ABOUT Page. 182 ~ 251.
Katy Börner Teaching & Research Teaching & Research Katy Börner
The Impact of the Social Sciences Jane
CS276B Text Information Retrieval, Mining, and Exploitation Practical 1 Jan 14, 2003.
Ping-Tsun Chang Intelligent Systems Laboratory NTU/CSIE Using Support Vector Machine for Integrating Catalogs.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Modelling a team-based astronomy task using LAMS James Dalziel Professor of Learning Technology, and Director, Macquarie E-Learning Centre Of Excellence.
Brief Intro to Machine Learning CS539
School of Computer Science & Engineering
Done Done Course Overview What is AI? What are the Major Challenges?
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Topic 2 Computational Approaches to Argumentation Mining in Natural Language Text 11/23/2018 Huy V. Nguyen.
Overview of Machine Learning
How To Extend the Training Data
Presentation transcript:

Digging by Debating (DbyD): linking massive datasets to specific arguments UK: Prof Andrew Ravenscroft (University of East London) Dr David Bourget (University of London) Prof Chris Reed (University of Dundee) US: Prof Colin Allen & Prof Katy Borner (Indiana University) …all together

DbyD Overview 1. The Problem 2. InterDebates: Making a shovel 3. Work and progress 4. Challenges, issues and solutions

Map argument dialogue and argumentative search semantics against argument structures implicit within and across texts (Philosophy domain) …harmonise human and machine semantics …extract and interrogate argument relations …deeper dialectic understanding of subjects …find new relations within and across texts (e.g. penetration of science and philosophy)

InterDebates: Making a shovel Make a shovel through integrating, adapting and extending existing technologies 1. Visualisation/Science Mapping: SCiVis (Indiana) 2. Data: Hathi Trust, PhilPapers, SEP, InPho (Indiana & London) 3. Interfaces: PhilPapers, Dialogue Games/InterLoc (UK) 4. Argument structure and analysis: Dialogue Games, Araucaria, AIF (London, Dundee) …how marshall these to interrogate massive document collections in argument-driven ways? …quite ambitious!

,

Citation networks: InPho, SEP

Digital Dialogue Games and InterLoc

OVA: Online Argument Analysis

Work and progress Small-scale studies using argument analysis and representation techniques applied to exemplar texts (in Animal Intelligence) AI simulation of argument mining process Argument mining literature review  …Key Insights and Implications for ongoing work

Marking up and mapping example texts Small-scale studies using OVA to map arguments in example texts Animal Psychology from Hathi Trust Collection Philosophers: Perform their own markup of important arguments in key texts Externalise the ‘algorithm’ they used to perform the markup Can OVA represent the arguments in a way that Philosophers want? Can we automate their algorithms or approximate them using combination of statistical, syntactic and semantic means?

AI Simulation of Argument Mining A hand simulation of a feasible machine analysis of an example text (a 10 page extract from Comparative Psychology) Uses syntactic parsing and rewrite rules into pre-formed argument patterns - results were mixed and fragmentary In example text obvious premises to the argument often missing - intelligent reader assumed by author Argument results are fragmentary - needs linking up - but require semantics of the argument to do this What arguments are important? What can be ignored? Cannot proceed further without sophisticated semantic layer to interpret results Would need to create 'ontology of argument' plus domain knowledge (to decide importance) - a big task

Argument Mining Literature Review Extracting meaningful argument information from natural text remains a major challenge facing computer science and AI Key works on argument mining are: Moens et al., 2007; Palau and Moens, 2009; Mochales and Moens, 2011; Feng and Hirst, 2011; Mainly classification using maximum entropy model and support vector machine % accuracy in legal cases Also considered works on opinion mining (e.g. Pang et al., 2002), citation mining (e.g. Teufel et al., 2006), and argumentative zoning (e.g. Teufel et al., 2009) Extra information, for example on argument scheme features, can improve accuracy

Implications for the Workplan May be possible to auto-identify argumentative blocks of text, using statistical / training techniques e.g Argumentative Zoning Also possible to auto-select only those blocks dealing with pre-specified topics thought to be important No way to reliably auto-extract 'core' arguments from texts without a sophisticated semantic layer (a large task!) Therefore use human experts to extract 'core' information - provide a tool that helps them be very efficient at it A tool could help with -  locating key blocks of argument in a text  indicating author's 'position' with regard to cited texts  associating 'core' argument(s) with the text We will need volunteers to process lots of texts to provide training sets for machine learning

Challenges, issues and solutions Ambitious problem + complicated international collaboration = hard work! Managing the collaboration How perform a joint project with independent ‘halves ‘ that have different funding and project management models? How hold regular working meetings to maintain national and international co- operation and joint working with limited cross-over in typical working hours? Consortium agreement How procure a consortium agreement that covers a joint project but is only ‘legally binding’ on one half? How articulate data access issues that are evolving as the project progresses? How to specify the joint working in a way that captures joint responsibility and also ‘legal’ independence? Data Access and Use How dig into data that is freely available to one partner (the US) but restricted to the other (UK)? e.g. Hathi Trust Research Collection (in Google Books)

Challenges, issues and solutions Managing the collaboration How perform a joint project with independent ‘halves ‘ that have different funding and project management models? How hold regular working meetings to maintain national and international co- operation and joint working with limited cross-over in typical working hours? Extended project planning phase (WP1 ) Continued problematisation and scoping phase, the outcome of which is a revised joint workplan (WP2) Had two ‘opportunistic’ f2f meetings in Europe, during the first we agreed a skype meeting regime for the life of the project: - UK meetings every second and last Tuesday at UK/US meetings every first and third Tuesday 1400/0900

Challenges, issues and solutions Consortium agreement How procure a consortium agreement that covers a joint project but is only ‘legally binding’ on one half? How articulate data access issues that are evolving as the project progresses? How to specify the joint working in a way that captures joint responsibility and also ‘legal’ independence? UK only consortium agreement, following JISC prototype, plus link to related documents covering: - UK/US joint working - Data access protocols …UK finance etc., covered by consortium agreement with joint working and data access covered by separate but related documents

Challenges, issues and solutions Data Access and Use How dig into data that is freely available to one partner (the US) but restricted to the other (UK)? e.g. Hathi Trust Research Collection (in Google Books)? 2 phase solution: 1. Immediate work-around – limited access to Data at Indiana via specific computers/ip addresses at UK site 2. UK partners negotiate equivalent data access directly with HTRC with help from US partners

Contacts Summary of progress: " Though this be madness, yet there is method in it“ Hamlet