1 Web 2.0 and Grids for Scholarly Research Peking University July 27 2006 Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories.

Slides:



Advertisements
Similar presentations
Managing References : Mendeley
Advertisements

Using Social Bookmarking in Academic Research Adriana Reed J. Willard Marriott Library April 30, 2008.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Reference Management Software Tools Mendeley. Table of Contents: Part A Background/Location Signup/Login Import References Organize (Manage) References.
Trends in Scientific Publishing Guenther Eichhorn DirectorAbstracting & Indexing Cambridge, MA April 2010.
1/1/ A Knowledge-based Approach to Citation Extraction Min-Yuh Day 1,2, Tzong-Han Tsai 1,3, Cheng-Lung Sung 1, Cheng-Wei Lee 1, Shih-Hung Wu 4, Chorng-Shyong.
FALL 2011 JACKIE STAPLETON, LIAISON LIBRARIAN MARTHA LAUZON, LIBRARY ASSOCIATE RefWorks: The Basics Introduction to Refworks workshop Pre-learning assessment.
28 October 2005Jeremy Frey, University of Southampton1 “The CombeChem Experience” CICC Workshop 28 October 2005 Bloomington Indiana.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Anatomy of a Semantic Virus.
Evaluating (Scientific) Knowledge for people, documents, organizations/activities/communities ICiS Workshop: Integrating, Representing and Reasoning over.
1 Web 2.0 and Grids March Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN.
1 Web 2.0 and Grids Introduction for Web 2.0 Tutorial OGF19 Chapel Hill North Carolina January Geoffrey Fox Computer Science, Informatics, Physics.
SocioBiblog : A Decentralized Platform for Sharing Bibliographic Information Aman Shakya 1, Hideaki Takeda 1, Vilas Wuwongse 2, Ikki Ohmukai 1 1 National.
Enhancing Research Projects with Environmental Informatics and Web Technologies.
Social Networking for Research Communities Using Tagging and Shared Bookmarks: a Web 2.0 Application Marlon Pierce, Geoffrey Fox, Joshua Rosen, Siddharth.
Social Bookmarking tools  Del.icio.us  Ideal for sharing generic links  Makes your bookmarks accessible from anywhere  Can share your bookmarks with.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
IS Today (Valacich & Schneider) 5/e Copyright © 2012 Pearson Education, Inc. Published as Prentice Hall 7/2/ Facebook is the most popular social.
Before class begins… Help us to assess this session and plan for future workshops Please complete the Advanced Refworks Pre-learning assessment at:
Management of information. Objectives Discuss the benefits of good management practice Present reference management tools Present bookmark management.
Managing references : Mendeley
Mendeley What is it? How is it different from other “Bibliographic databases” like End Note and Reference.
Use Watch folders to automatically add PDFs to Mendeley Desktop. When you place a document in a watched folder, it will be automatically added to Mendeley.
Internet and Social Networking Research Tools for Academic Writing Copyright © 2014 Todd A. Whittaker
New Workflows in Research and Collaboration and the Role of the Library. Introducing the Mendeley Institutional Edition LIBER Conference, Tartu, June 2012.
1 Grids/CI for Scholarly Research and application to Chemical Informatics HPC 2006 in Cetraro – Italy July Geoffrey Fox Computer Science, Informatics,
Explore over 36.7 million publications
Event-Based Model for Reconciling Digital Entries Thesis Proposal Ahmet Fatih Mustacoglu 10/3/20151Ahmet.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Web 2.0 Features on Scitation. Web 2.0 and Powder Diffraction Web 2.0 features can be found on the Scitation platform for Powder Diffraction –
Mendeley Citation Management and Research Network Helen Smith Life Sciences Library Penn State University.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
Integrated Collaborative Information Systems Ahmet E. Topcu Advisor: Prof Dr. Geoffrey Fox 1.
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
Information Literacy support and research strategy skills Mary Joan Crowley DISG Library, Engineering Faculty, Sapienza, University of Rome * all images.
INFSCI 3005: Introduction to Doctoral Program Lecture 6: Reference and Search Tools With materials and inspiration from professors Marek Druzdzel, Stephen.
Web of Science User’s guide. What is Web of Science? How to Register? How to use Web of Science Main screen of Web of Science How to do a search General.
1 Manage your References: Using RefWorks, Endnote Mendeley & Zotero Winter Term 2012 Helen B. Josephine
1 Semantic Research Grid Open Grid Forum Web 2.0 Workshop OGF21, Seattle Washington October Geoffrey Fox, Aurel Cami, Ahmet Fatih Mustacoglu, Ahmet.
SRG: A Digital Document-Enhanced Service Oriented Research Grid Ahmet E. Topcu Ahmet Fatih Mustacoglu Geoffrey C. Fox Aurel Cami Indiana University Computer.
Faceted browsing for ACL Anthology Praveen Bysani.
WRITING REPORTS Introduction Section 0 Lecture 1 Slide 1 Lecture 6 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall 2015.
Information Literacy – Masters 2015 Linda Stoop October 2015.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
EBI is an Outstation of the European Molecular Biology Laboratory. Literature Resources at the EBI Information Workshop on European Bioinformatics Resources.
Internet Documentation and Integration of Metadata (IDIOM) Presented by Ahmet E. Topcu Advisor: Prof. Geoffrey C. Fox 1/14/2009.
INFO 4990: Information Technology Research Methods Searching in the Research Literature Lecture by A. Fekete (based in part on materials by J. Davis and.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Event-Based Infrastructure for Reconciling Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey C. Fox.
Mendeley a free reference manager and academic social network…  Assists - cataloguing and managing your.
Event-Based Model for Reconciling Digital Entities Ahmet Fatih Mustacoglu Ahmet E. Topcu Aurel Cami Geoffrey C. Fox Indiana University Computer Science.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
The reference management software -also called citation management software, citation manager or personal bibliographic management software- are programs.
Social Bookmarking Services : Delicious, Connotea, Citeulike, work through Blackboard Scholar.
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
Microsoft Academic Search Search | Explore | Discover
How to Use Google Scholar An Educator’s Guide
Ahmet Fatih Mustacoglu
Eric Sieverts University Library Utrecht Institute for Media &
IL Step 3: Using Bibliographic Databases
Event-Based Infrastructure for Reconciling Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey C. Fox.
Semantic Scholars’ Grid I
Integrated Collaborative Information Systems
Reference Management Software Tools Mendeley (Part A)
Presentation transcript:

1 Web 2.0 and Grids for Scholarly Research Peking University July Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN

2 Application Drivers Science Informatics for document analysis as in case of chemistry which has very precise naming rules for compounds that allow accurate searches in documents Suggesting how to tag scientific documents either when writing it or after the fact Journal web site of the future as illustrated by Nature building social bookmarking tool Connotea Conference support tools as can benefit from features needed by journals This gives document enhanced Cyberinfrastructure (CI)

Community Tools and list-serves are oldest and best used Kazaa, Instant Messengers, Skype, Napster, BitTorrent for P2P Collaboration – text, audio-video conferencing, files del.icio.us, Connotea, Citeulike, Bibsonomy, Biolicious manage shared bookmarks MySpace, Bebo, Hotornot, Facebook, or similar sites allow you to create (upload) community resources and share them; Friendster, LinkedIn create networks Writely, Wikis and Blogs are powerful specialized shared document systems ConferenceXP and WebEx share general applications Google Scholar tells you who has cited your papers while publisher sites tell you about co-authors Windows Live Academic Search has similar goals Note sharing resources creates (implicit) communities Social network tools study graphs to both define communities and extract their properties

How to use Web2.0 Community tools in CI Nearly all of them have “profiles”, “users”, “groups”, “friends” etc. Need to integrate these P2P File Sharing: Maybe this is useful for sharing files in research groups (virtual organizations) Will modify Maze – popular Chinese social P2P system with 2.5 million usershttp://maze.pku.edu.cn BitTorrent: more popular than FTP – why not use for higher performance fault tolerant cached file sharing? MySpace etc.: Could consider MyGridSpace or MyScienceSpace that supports a similar document sharing model with users uploading pictures, papers and even data/services of interest Could include uploaded material in workflows Social Bookmarking and linking: discuss later

5 Existing User Interface Document-enhanced Cyberinfrastructure etc. Google Scholar Manuscript Central Science.gov Windows Live Academic Search Citeseer CMT Conference Management Existing Document based Research Tools Web service Wrappers New Document-enhanced Research Tools Integration/ Enhancement User Interface Community Tools Generic Document Tools MyResearch Database Bibliographic Database Export: RSS, Bibtex Endnote etc. CiteULike Connotea Del.icio.us Bibsonomy Biolicious PubChem PubMed Traditional Cyberinfrastructure

Strategy Doesn’t seem useful to build the 251 st community tool In fact a major barrier to use of existing tools is What happens when a better tool comes along and/or chosen tool disappears (unsupported/removed from Web) So assume use existing tools but wrap them all as web services so can transfer information to new tools and integrate information between tools Need some “glue” logic, a “unification” database and minimal user interface Bookmarking tools: del.icio.us, Connotea, CiteULike (includes plug-ins to major publisher sites) Document: Google Scholar, Windows Live, Citeseer tools, OSCAR3 for Chemistry, Science.gov (later) Journals: Manuscript Central Conferences: CMT from Microsoft or ?

7 Delicious Semantic Web/Grid purchased by Yahoo for ~$30M (Nature) Associate metadata with Bookmarks specified by URL’s, DOI’s (Digital Object Identifiers) Users add comments and keywords (called tags) Users are linked together into groups (communities) Information such as title and authors extracted automatically from some sites (PubMed, ACM, IEEE, Wiley etc.) Bibtex like additional information in CiteULike This is perhaps de facto Semantic Web – remarkable for its simplicity

8 Connotea

9 Connotea queried by SERVOGrid

10 Document-enhanced Cyberinfrastructure aka Semantic Scholar Grid I Citeseer and Google Scholar scour the Internet and analyze documents for incidental metadata Title, author and institution of documents Citations with their own metadata allowing one to match to other documents Science.gov extracts metadata from lots of US Government databases These capabilities are sure to become more powerful and to be extended Give “Citation Index” in real time Tell you all authors of all papers that cite a paper that cites you etc. (Note it’s a small world so don’t go too far in link analysis) Tell you all citations of all papers in a workshop

11 Document-enhanced Cyberinfrastructure aka Semantic Scholar Grid II It is natural to develop core document Services such as those used in Citeseer/Google Scholar but applied to “your” documents of interest that may not have been processed yet As just submitted to a conference perhaps These tools can help form useful lists such as authors of all cited or submitted papers to a journal OSCAR2/3 (from Peter Murray-Rust’s group at Cambridge) augment the application independent “core” metadata (Title, authors, institutions, Citations) with a list of all chemical terms This tool is a Service that can be applied to “your” document or to a set of documents harvested in some fashion Other fields have natural application specific metadata and OSCAR like tools can be developed for them Such high value tools could appear on “publisher” sites of future (or else publishers will disappear)

12 OSCAR3 Service from Cambridge UK Oscar3 is a tool for shallow, chemistry-specific natural language parsing of chemical documents (i.e. journal articles). It identifies (or attempts to identify):  Chemical names: singular nouns, plurals, verbs etc., also formulae and acronyms.  Chemical data: Spectra, melting/boiling point, yield etc. in experimental sections.  Other entities: Things like N(5)-C(3) and so on. Uses SMILES, InChI and CML There is a larger effort, SciBorg, in this area 

13 OSCAR2 Chemistry Document analysis It detects “magic” chemical strings in text and then Stores them as metadata associated with document Queries ChemInformatics repositories to tell you lots of information about identified compounds Tells you which other documents have this compound

Clustering Documents from chemical properties

15 Provenance and Delicious CI We can use del.icio.us style interface to annotate Application Data with (extra) provenance and user comments of any type (describing quality of data or a keyword relating different data etc.) All data should be labeled by a URI to enable this One has in addition Citeseer/OSCAR metadata Current major tagging systems support flat list of tags without name=value (RDF triple) or schema organization Tradeoff between features and pervasive deployment Some extra features are easy to add as a custom service Features not supported by del.icio.us can be uploaded as comments

16 Current Status Google Scholar, Windows Live Academic Search, del.icio.us, Connotea, CiteULike, OSCAR3 are Web Services Debugging on 500 presentations and papers from my CGL research group Experiment with GGF Presentations, Broad collection of Chemical Informatics resources (explore science document CI link) and Concurrency&Computation: Practice&Experience Web site (?business model for journals)