ANALYSIS OF INTER-ANNOTATOR AGREEMENT (TEXT MINING & REG. ANNOTATION) RegCreative Jamboree, Friday, December, 1st, (2006) MARTIN KRALLINGER, 2006 TEXT.

Slides:



Advertisements
Similar presentations
1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
Advertisements

Standardized Scales.
Annotation standards in ORegAnno (Draft) Obi Griffith The RegCreative Jamboree Nov 29, 2006 Ghent, Belgium.
PREDetector : Prokaryotic Regulatory Element Detector Samuel Hiard 1, Sébastien Rigali 2, Séverine Colson 2, Raphaël Marée 1 and Louis Wehenkel 1 1 Bioinformatics.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
The Research Consumer Evaluates Measurement Reliability and Validity
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
LESSONS FROM THE BIOCREATIVE PROTEIN- PROTEIN INTERACTION (PPI) TASK RegCreative Jamboree, Friday, December, 1st, (2006) MARTIN KRALLINGER, 2006 LESSONS.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
A New Suffix Tree Similarity Measure for Document Clustering Hung Chim, Xiaotie Deng City University of Hong Kong WWW 2007 Session: Similarity Search April.
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Methodologies for Evaluating Dialog Structure Annotation Ananlada Chotimongkol Presented at Dialogs on Dialogs Reading Group 27 January 2006.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Empirically Assessing End User Software Engineering Techniques Gregg Rothermel Department of Computer Science and Engineering University of Nebraska --
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Quality-driven Integration of Heterogeneous Information System by Felix Naumann, et al. (VLDB1999) 17 Feb 2006 Presented by Heasoo Hwang.
A Framework for Named Entity Recognition in the Open Domain Richard Evans Research Group in Computational Linguistics University of Wolverhampton UK
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
New Hampshire Enhanced Assessment Initiative: Technical Documentation for Alternate Assessments Alignment Inclusive Assessment Seminar Brian Gong Claudia.
Mining the Medical Literature Chirag Bhatt October 14 th, 2004.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
Cis-Regulatory/ Text Mining Interface Discussion.
Information Retrieval in Practice
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
C OLLECTIVE ANNOTATION OF WIKIPEDIA ENTITIES IN WEB TEXT - Presented by Avinash S Bharadwaj ( )
1 Statistical NLP: Lecture 10 Lexical Acquisition.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
MEASUREMENT CHARACTERISTICS Error & Confidence Reliability, Validity, & Usability.
Modeling Documents by Combining Semantic Concepts with Unsupervised Statistical Learning Author: Chaitanya Chemudugunta America Holloway Padhraic Smyth.
Learner corpus analysis and error annotation Xiaofei Lu CALPER 2010 Summer Workshop July 13, 2010.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Latent Semantic Analysis Hongning Wang Recap: vector space model Represent both doc and query by concept vectors – Each concept defines one dimension.
Bioinformatics: Theory and Practice – Striking a Balance (a plea for teaching, as well as doing, Bioinformatics) Practice (Molecular Biology) Theory: Central.
Recognizing Names in Biomedical Texts: a Machine Learning Approach GuoDong Zhou 1,*, Jie Zhang 1,2, Jian Su 1, Dan Shen 1,2 and ChewLim Tan 2 1 Institute.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Selecting a Sample. Sampling Select participants for study Select participants for study Must represent a larger group Must represent a larger group Picked.
Developing a Review Protocol. 1. Title Registration 2. Protocol 3. Complete Review Components of the C2 Review Process.
Presented By Dr / Said Said Elshama  Distinguish between validity and reliability.  Describe different evidences of validity.  Describe methods of.
Extracting Keyphrases to Represent Relations in Social Networks from Web Junichiro Mori and Mitsuru Ishizuka Universiry of Tokyo Yutaka Matsuo National.
A Repetition Based Measure for Verification of Text Collections and for Text Categorization Dmitry V.Khmelev Department of Mathematics, University of Toronto.
Mining the Biomedical Research Literature Ken Baclawski.
Bioinformatics and Computational Biology
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
BioCreAtIvE Critical Assessment for Information Extraction in Biology Granada, Spain, March28-March 31, 2004 Task 2: Functional annotation of gene products.
Automatic Writing Evaluation
You Can’t Afford to be Late!
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Test Validity.
Erasmus University Rotterdam
Extracting Semantic Concept Relations
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Ass. Prof. Dr. Mogeeb Mosleh
Statistical NLP: Lecture 10
Presentation transcript:

ANALYSIS OF INTER-ANNOTATOR AGREEMENT (TEXT MINING & REG. ANNOTATION) RegCreative Jamboree, Friday, December, 1st, (2006) MARTIN KRALLINGER, 2006 TEXT MINING & REG. ANNOTATION

MAIN ASPECTS  Explore annotation overlap  Discuss variability in annotation  Text mining and regulatory element annotation: needs, limits, tasks MARTIN KRALLINGER, 2006 TEXT MINING & REG. ANNOTATION

SOCIOLOGY OF GENOME ANNOTATION (Lincoln Stein 2001) MARTIN KRALLINGER, 2006  Models of annotation  Museum model: small group of specialized curators  Jamboree model: a group of biologists and bioinformaticians come together for a short intensive annotation workshop  Cottage industry: decentralized effort of annotators among the recruited community  Factory model: highly automated methods (Elsik et al, 2006) TEXT MINING & REG. ANNOTATION

MARTIN KRALLINGER, 2006 WHY PRE-JAMBOREE QUEUE?  Get familiar with annotation system (before jamboree)!  Understand content and annotation strategy of Oreganno  Detect aspects which require improvements such as incompleteness, ambiguity or wrong structures in annotation strategy, guidelines or documentation -> active Feedback (Questionnaire and wiki)  Assess consistency of the current annotation procedures  Explore which aspects affect annotation agreement  Estimate difficulty of task (alternative interpretation, uncertainty, etc,..) TEXT MINING & REG. ANNOTATION

SIMILARITY MEASURES MARTIN KRALLINGER, 2006  Similarity calculation popular subject in computer science  Different entities considered:  Feature vectors: Alignment, Cosine, Dice, Euclidean, …  Strings or sequences of strings (text): averaged String Matching, TFIDF  Sets: Jaccard, Loss of Information, Resembalance  Sequences: Levensthein Edit Distance  Trees: Bottom-up/Top-down Maximum Common Subtree, Tree Edit Distance  Graphs: Conceptual Similarity, Graph Isomorphism, Subgraph Isomorphism, Maximum Common Subgraph Isomorphism, Graph Isomorphism Covering, Shortest Path  Information theory: Jiang & Conrath, Lin, Resnik  Bioinformatics: sequence similarity, structural similarity, similarity of gene expression  Here similarity between human annotations ( refer to SimPack project examples) TEXT MINING & REG. ANNOTATION

MEASUREMENT OF OBSERVER AGREEMENT MARTIN KRALLINGER, 2006  Assumption when independent annotators agree they are correct ?!  Statistical agreement measures for categorical data  Overall proportion of agreement  Pairwise comparison; Cohen’s kappa; Pearson Chi-square  Weighted kappa for multiple categories  High accuracy implies high agreement  Kappa sometimes is inconsistent with accuracy measured as AROC Measurement of Observer agreement Kundel and Polansky, Statistical concepts Series (2003) Kappa coefficient TEXT MINING & REG. ANNOTATION

ANNOTATOR AGREEMENT FOR WSD MARTIN KRALLINGER, 2006 A case Study on Inter-Annotator Agreement for Word Sense Disambiguation, Ng et al  Word Sense Disambiguation (WSD) a central problem in NLP  WSD: discerning the meaning of a word in context  Two human annotators may disagree in their sense assignment  Agreement of human annotators often the baseline for evaluation of automated approaches  Case study using more than 30,000 instances of the most frequently occurring nouns and verbs in English  Sense tagged word in sentences manually by two groups of annotator to WordNet  Used the Kappa score to measure inter-annotator agreement considering effect of chance agreement  Difficult to achieve high agreement when they have to assign refined sense tags  Importance of example sentences for the usage of each word sense TEXT MINING & REG. ANNOTATION

AGREEMENT OF SPEECH CORPORA MARTIN KRALLINGER, 2006  Phonetically annotated speech corpora  Quality of manual annotations affected by:  Implicit incoherence: labeling incoherent due to human variability in perceptual capacities and other factors  Lack of consensus on coding schema: manual annotations reflect the variability of the interpretation and application of the coding schema by the annotators  Annotator characteristics: individual characteristics of coders such as familiarity with the material, amount of former training, motivation, interest and fatigue induced errors Measuring the reliability of Manual annotations of Speech corpora, Gut and Bayerl TEXT MINING & REG. ANNOTATION

CHALLENGES FOR OREGANNO ANNOTATION MARTIN KRALLINGER, 2006  Complexity of gene regulation  Need of ontologies and lexical resources  Deep inference of domain expert curators  Spatial, temporal, experimental conditions  Range of entity types: genes, regulatory sequences, proteins  Gene family and individual gene member distinction  TF binding site sequence extraction and mapping to genome  TF mapping to normalized database entries (NCBI, Ensembl)  Archeology-like annotation: annotation of old papers  BUT GENE REGULATION IS ONE OF THE MAIN BIOLOGICAL INFORMATION (ANNOTATION) ASPECTS! TEXT MINING & REG. ANNOTATION

SOURCES FOR ANNOTATION VARIABILITY (1) MARTIN KRALLINGER, 2006  Curator background (biologist, bioinformatician,...)  Familiarity with the annotation system  Number of previously annotated papers or proteins  Prior knowledge on the regulated gene or TF  Prior knowledge (experience) on the experimental types  Sub-domain knowledge (e.g. developmental biology or OS)  Publication date (reflect the state of knowledge) TEXT MINING & REG. ANNOTATION

SOURCES FOR ANNOTATION VARIABILITY (2) MARTIN KRALLINGER, 2006  Nr. of papers annotated the same day (fatigue effect)  Unclear or partial documentation of certain annotation aspects  Annotation type (ontology of annotation types?, CV?)  Nr. of pages, figures, tables, references,…  Consultation of additional resources (material, databases, web)  Different degrees of granularity in annotation  Differences in the recall of manually extracted annotations (all ?)  Sequence (paper/database, strand, typos, length) TEXT MINING & REG. ANNOTATION

REGCREATIVE CASE STUDY: PREJAMBOREE (1) MARTIN KRALLINGER, 2006  Relatively few articles -> only exploratory examination  Annotation type: 9/11 ( , : RR vs. TFBS)  Considerable difference in average nr. of annotations/paper  Some only extracted a single annotation others basically every annotation mentioned in the paper  Almost perfect agreement in organism source (1 case of human and mouse disagreement), but genes correct!  Very high agreement on the gene names, only few user defined cases (which are difficult to evaluate) TEXT MINING & REG. ANNOTATION

REGCREATIVE CASE STUDY: PREJAMBOREE (2) MARTIN KRALLINGER, 2006  Certain disagreement in TF names, many are user defined!  Evidence class: high agreement many Transcription regulator site, and unknown  Evidence type: high agreement, some more complete than others, (again, some annotate all the types others only some of them)  Evidence sub-type: similar to evidence types, but in general a little lower agreement than for the evidence type. TEXT MINING & REG. ANNOTATION

Transcription names factor: PREJAMBOREE MARTIN KRALLINGER, 2006 TEXT MINING & REG. ANNOTATION

MARTIN KRALLINGER, 2006 Example case 1: TF annotation variance TEXT MINING & REG. ANNOTATION Curator B UNKNOWN USER DEFINED Curator B AP-1 USER DEFINED Curator A c-Rel/p65 heterodimer USER DEFINED Curator A UNKNOWN USER DEFINED A B

MARTIN KRALLINGER, 2006 Example case 2: TF annotation variance TEXT MINING & REG. ANNOTATION Curator A Tcf1 NCBI Curator B Tcf1 NCBI Curator B C\EBP family USER DEFINED Curator B C\EBP and NF-1 USER DEFINED Curator B Tcf1 NCBI Curator B UNKNOWN USER DEFINED

MARTIN KRALLINGER, 2006 Example case: difference in evidence types A B A B Curator A REGULATORY REGION Curator B TRANSCRIPTION FACTOR BINDING SITE Curator A REGULATORY REGION Curator B TRANSCRIPTION FACTOR BINDING SITE TEXT MINING & REG. ANNOTATION

MARTIN KRALLINGER, Curator A TRANSCRIPTION FACTOR BINDING SITE Col1a2 UNKNOWN TCCAAACTTGGCAAGGGCGAGA CLASS:OREGEC00001 TYPE: OREGET00003 SUBTYPE:OREGES00015 CLASS:OREGEC00001 TYPE: OREGET00001 SUBTYPE:OREGES00003 Curator B 1->TRANSCRIPTION FACTOR BINDING SITE Col1a2 Nfia TTCCAAACTTGGCAAGGGCGAGAGAGGGCGA CLASS:OREGEC00001 TYPE: OREGET00003 SUBTYPE:OREGES00033 CLASS:OREGEC00001 TYPE: OREGET00003 SUBTYPE:OREGES00015 CLASS:OREGEC00002 TYPE: OREGET00001 SUBTYPE:OREGES00003 Different amount of annotation extracted A B A B TEXT MINING & REG. ANNOTATION

REGCREATIVE CASE STUDY: JAMBOREE MARTIN KRALLINGER, 2006  Intensive annotation strategy: face to face with other curators and expert annotators  Get direct feedback and provide suggestions  Promote integration of additional aspects in the annotation structure as well as annotated information types  Populate the database with new annotation records  Explore efficient curation training strategies  Create Gold Standard collection of annotation records, maybe useful to allow example-based annotation training/evaluation  Explore demands of biologists / curators to text mining community - > where it would be useful TEXT MINING & REG. ANNOTATION

REGCREATIVE CASE STUDY: POST-JAMBOREE MARTIN KRALLINGER, 2006  Monitor improvements in the annotation consistency  Allow consistent community-based annotation  Promote integration of additional aspects in the annotation structure as well as annotated information types  Increase efficiency in populating the database  Construction for text mining training collection TEXT MINING & REG. ANNOTATION

ANNOTATION CONSISTENCY MARTIN KRALLINGER, 2006 TEXT MINING & REG. ANNOTATION For selection as relevant paper for curation For the evidence class For the evidence types For the evidence subtypes For the regulated genes For the transcription factors For cell types How to structure comments Other aspects:...

TEXT MINING TASKS FOR GENE REGULATION EXTRACTION MARTIN KRALLINGER, 2006 TEXT MINING & REG. ANNOTATION  Detection of relevant articles: abstracts or full text  Extraction of ranked list of regulated genes: mention or normalized gene (database entries)  Extraction of ranked list if TF  Extraction of ranked list of evidence type IDs together with name and text passage (sentence)  Extraction of ranked associations between these genes and TF  Extraction of associations to other controlled vocabularies or ontologies