1 CBioC: Collaborative Bio- Curation Chitta Baral Department of Computer Science and Engineering Arizona State University.

Slides:



Advertisements
Similar presentations
Project Supervisor: Dr. Sanath Jayasena Project Coordinator: Mr. Shantha Fernando Athukorala A.U.B Dissanayake C.P. Kumara M.G.C.P. Priyadarshana G.V.J.
Advertisements

Terrapin Trader Transformation by Oliver Stohr - Olga Kuznetsova Tyler Cordrey - Brett Holbert December 9, 2008.
1 The PageRank Citation Ranking: Bring Order to the web Lawrence Page, Sergey Brin, Rajeev Motwani and Terry Winograd Presented by Fei Li.
Cloud platforms Lead to Open and Universal access for people with Disabilities and for All WP Federating repositories of Solutions.
CBioC: Massive Collaborative Curation of Biomedical Literature Chitta Baral, Hasan Davulcu, Anthony Gitter, Graciela Gonzalez, Geeta Joshi-Tope, Mutsumi.
DESIGN AND IMPLEMENTATION OF SOFTWARE COMPONENTS FOR A REMOTE LABORATORY J. Fernandez, J. Crespo, R. Barber, J. Carretero University Carlos III of Madrid.
How to locate an online journal article within PubMed How to register an account for the library interlibrary loan system How to submit a journal article.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
CADDLAB Medical Imaging on Remote Compute Servers.
CBioC: Massive Collaborative Curation of Biomedical Literature Future Directions.
Paper Outline By: Antonis Voutsinos Design of Interactive Content.
Jun Peng Stanford University – Department of Civil and Environmental Engineering Nov 17, 2000 DISSERTATION PROPOSAL A Software Framework for Collaborative.
What is adaptive web technology?  There is an increasingly large demand for software systems which are able to operate effectively in dynamic environments.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 16 Slide 1 User interface design.
5. Presentation of experimental results 5.5. Original contribution (paper) - the main outcome of scientific activities - together with patents, they can.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
WIKI IN EDUCATION Giti Javidi. W HAT IS WIKI ? A Wiki can be thought of as a combination of a Web site and a Word document. At its simplest, it can be.
Valma Technical Aspects
Dr. Tom WayCSC What is Software Engineering? CSC 4700 Software Engineering Lecture 1.
Fall, Privacy&Security - Virginia Tech – Computer Science Click to edit Master title style Design Extensions to Google+ CS6204 Privacy and Security.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
What is SMEcollaborate Primarily developed for Small and Medium Companies who wish to collaborate together. It is a:- A resource center for collaborating.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
A Web/Grid Services Approach for a Virtual Research Environment Implementation Y. W. Sim, C. Wang, L. A. Carr, H. C. Davies, L. Gilbert, S. Grange, D.
Waseda Univ Nakajima Lab Interaction Group Computer-supported knowledge sharing in co-located environments Yasufumi Hirakawa, Harumi Mase, Eiji Tokunaga.
Service Computation 2010November 21-26, Lisbon.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
WAD Web application for managing the indicators of the research activity in a university department.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
Research Resources Eugene Tseytlin Department of Biomedical Informatics University of Pittsburgh.
Dr Jamal Roudaki Faculty of Commerce Lincoln University New Zealand.
Okalo Daniel Ikhena Dr. V. Z. Këpuska December 7, 2007.
Internet Services Introduction Expertise is a collaborative tool for knowledge sharing, interacting and group working that can be adapted to the needs.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
CMPS 435 F08 These slides are designed to accompany Web Engineering: A Practitioner’s Approach (McGraw-Hill 2008) by Roger Pressman and David Lowe, copyright.
5.5. Original contribution (paper) - the main outcome of scientific activities - together with patents, they can not be combined together at one time -
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Employing Wikis for online collaboration in the e-learning environment: Case study 1 Raitman, R., Augar, N. & Zhou, W. (2005). Employing Wikis for online.
Cloud platforms Lead to Open and Universal access for people with Disabilities and for All WP Federating repositories of Solutions.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Software Engineering for Business Information Systems (sebis) Department of Informatics Technische Universität München, Germany wwwmatthes.in.tum.de A.
D R A T D R A T ABSTRACT Every semester each department at Iowa State University has to assign its faculty members and teaching assistants (TAs) to the.
Thomas Kern | The system documentation as binding agent for and in between internal and external customers April 24th, 2009 | Page 1 The system documentation.
RE-ENGINEERING AND DOMAIN ANALYSIS BY- NISHANTH TIRUVAIPATI.
© SERG Reverse Engineering (REportal) REportal: Reverse Engineering Portal (reportal.cs.drexel.edu)
Requirements Engineering Requirements Management Lecture-25.
Agents for Case-based software reuse Stein Inge Morisbak Web:
UK Interest & Input to the Factories of the Future Horizon 2020 Roadmap. © ActionPlant 2011.
Eurostat Report on SDMX Reference Infrastructure User Group 1 st meeting in Luxembourg Sept 2012 Item 5.2 of the agenda November 2012IT Director's.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech 5 th GO-ESSP Community Meeting.
Web application component mapping Noé Fernández. The Problem 19/08/2014Noé Fernández › Dozens of s/day › Lack of information  Users don’t know what.
OpenACS and.LRN Conference 2008 Automatic Limited-Choice and Completion Test Creation, Assessment and Feedback in modern Learning Processes Institute for.
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
After this course you will be able to:
<Student’s name>
Lecture 1 What is Software Engineering? CSC 4700 Software Engineering
Chapter 18 Maintaining Information Systems
Major ILS disciplines What does iSchools like SILS study?
Architecture Components
Publishing software and data
LCG Monte-Carlo Events Data Base: current status and plans
Designing Software for Ease of Extension and Contraction
CS 321: Human-Computer Interaction Design
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
Performance and Scalability Issues of Multimedia Digital Library
5. Presenting a scientific work
Name: NAMUNJI JOSHUA MUNDIA PROGRAM: BSC SYSTEMS ENGINEERING
Presentation transcript:

1 CBioC: Collaborative Bio- Curation Chitta Baral Department of Computer Science and Engineering Arizona State University

2 Agenda Introduction Using the C-BioCurator System  Overall Architecture  Installation  User Authentication  User Interaction  Text extraction systems  Existing databases System Implementation Conclusion and Future Work

3 Introduction Motivation  Our goal in this paper is to help get information nuggets of articles and abstracts and store in a database.  The challenge is that the number of articles are huge and they keep growing, and need to process natural language.  The two existing approaches human curation and use of automatic information extraction systems They are not able to meet the challenge, as the first is expensive, while the second is error-prone.

4 Introduction (cont’d) Approach: We propose a solution that is inexpensive, and that scales up.  Our approach takes advantage of automatic information extraction methods as a starting point, Based on the premise that if there are a lot of articles, then there must be a lot of readers and authors of these articles.  We provide a mechanism by which the readers of the articles can participate and collaborate in the curation of information.  We refer to our approach as “Collaborative Curation''.

5 Introduction (cont’d) Results:  We report on our system CBioC (short for Collaborative Bio-Curator) which facilitates collaborative curation. Availability:  A prototype of the web interaction version is currently available at

6 Using the C-BioCurator System Overall Architecture:  The two main components of our CBioC system are (i) the CBioC interface and (ii) the CBioC database.  The user interacts with the CBioC system through the CBioC interface, and  The curated or extracted data (from the abstracts and texts of the articles) together with the user interaction with respect to these data is stored in the CBioC database.

7 Using the C-BioCurator System (cont’d)

8 Installation and Invocation  A researcher need to download our system and install it in her computer.  Whenever the researcher accesses a web page from where she can access an article or an abstract, the CBioC system wakes up and creates an interaction frame.

9 With Web Band Version

10 Without Web Band Version

11 Using the C-BioCurator System (cont’d) User authentication  The authentication is necessary as different kinds of user are allowed different levels of interaction by our system. For example, anonymous (non-registered) users are only allowed browsing ability, and are not allowed to leave any impression (such as adding facts or voting) for the future.

12

13 Using the C-BioCurator System (cont’d) User Interaction  Past the user authentication, the CBioC uses the pubmed ID passed to search the database regarding any data about that article.  If it finds such data, it then displays them in the interaction frame, taking into account the researcher’s preferences.  It allows registered researchers to vote for the correctness of individual data tuples.

14 Using the C-BioCurator System (cont’d) Text extraction systems  We periodically run (off-line) the best available automated text extraction systems on the pubmed abstracts and store the results in the CBioC database.  If no information regarding a particular abstract is found in the CBioC database, then the information extraction systems will be run (on- line) on that abstract and the results will be displayed.

15 Using the C-BioCurator System (cont’d) Existing databases  Protein Interaction (Extracted, Exchanged (e.g., BIND))  Reference  User account  Voting

16 Implementation

17 Conclusion and Future Work we have presented a vision  that overcomes and suggests a solution to the seemingly insurmountable problem of being able to curate information nuggets from the extremely large and fast growing body of bio-medical literature. We have developed a prototype implementing our solution, and will be improved continuously. We believe that our proposed solution could really have a big impact on Bio-medical research, and hence this paper.

18 Conclusion and Future Work (cont’d) Our approach of using mass collaboration to curate bio-medical texts can be further generalized to the web as a whole (or other document repositories)  where a group of people interested in a group of documents can collaborate to extract the knowledge buried in those documents, and  simultaneously using automated extracted systems as a first step.  We refer to this as collaborative meta-web, and are working on expanding it to many other domains.