" GTPS" is acronym of Gene Trek in Procaryote Space. Various complete genomes of eubacteria and archaea have been registered in the International Nucleot.

Slides:



Advertisements
Similar presentations
I. Spasić,1 D. Schober,2 S. Sansone,2 D. Rebholz-Schuhmann,2 D
Advertisements

IMA 2.5: Software Architecture and Development Environment Roberto Olivares M.S. Electrical Engineering Vanderbilt University, Spring 2003.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Kino : Making Semantic Annotations Easier Ajith Ranabahu #, Priti Parikh #, Maryam Panahiazar #, Amit Sheth # and Flora Logan- Klumpler* # Ohio Center.
Semantic Annotation and Search for Resources in the Next Generation Web Ajith H. Ranabahu, Amit Sheth, Maryam Panahiazar, Sanjaya Wijeratne Kno.e.sis Center.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
A Toolbox for Blackboard Tim Roberts
AHRT: The Automated Human Resources Tool BY Roi Ceren Muthukumaran Chandrasekaran.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
James Martin CpE 691, Spring 2010 February 11, 2010.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Search Engines and Information Retrieval
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
For more notes and topics visit:
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Search Engines and Information Retrieval Chapter 1.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Mihir Daptardar Software Engineering 577b Center for Systems and Software Engineering (CSSE) Viterbi School of Engineering 1.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Master Thesis Defense Jan Fiedler 04/17/98
Service Computation 2010November 21-26, Lisbon.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
ITR: Collaborative research: software for interpretation of cosmogenic isotope inventories - a combination of geology, modeling, software engineering and.
Hugh E. Williams and Justin Zobel IEEE Transactions on knowledge and data engineering Vol. 14, No. 1, January/February 2002 Presented by Jitimon Keinduangjun.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
LexBIG/LexGrid Services for LexBIG 2.3 Model and API for the Grid.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
Alexey Kolosoff, Michael Bogatyrev 1 Tula State University Faculty of Cybernetics Laboratory of Information Systems.
Knowledge Enabled Information and Services Science SAWSDL: Tools and Applications Amit P. Sheth Kno.e.sis Center Wright State University, Dayton, OH Knoesis.wright.edu.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
Anil Wipat University of Newcastle upon Tyne, UK A Grid based System for Microbial Genome Comparison and analysis.
LHCb Software Week November 2003 Gennady Kuznetsov Production Manager Tools (New Architecture)
EMBOSS over a Grid 1. 1st EELA Grid School December 4th of 2006 Eduardo MURRIETA LEON Romualdo ZAYAS-LAGUNAS Pierre-Alain BRANGER Jérôme VERLEYEN Roberto.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
# load data originaldata = load_data_from_csv(rawdatafile) #filter out a range filtered = range_filter({:min=> 20,:max =>50},originaldata) # sum normalize.
Marcin Płóciennik Poznan Supercomputing and Networking Center OGF23, Barcelona, Spain, June 3rd, 2008 Use case of NMR spectrometry in Virtual Laboratory.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
PLANETS, OPF & SCAPE A summary of the tools from these preservation projects, and where their development is heading.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
A Study in Hadoop Streaming with Matlab for NMR data processing Kalpa Gunaratna1, Paul Anderson2, Ajith Ranabahu1 and Amit Sheth1 1Ohio Center of Excellence.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Presenter: Bradley Green.  What is Bioinformatics?  Brief History of Bioinformatics  Development  Computer Science and Bioinformatics  Current Applications.
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
The EBI Search RESTful API
Building Search Systems for Digital Library Collections
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Defining Data-intensive computing
Spark and Scala.
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Supporting High-Performance Data Processing on Flat-Files
Presentation transcript:

" GTPS" is acronym of Gene Trek in Procaryote Space. Various complete genomes of eubacteria and archaea have been registered in the International Nucleot ide Sequence Databases (INSD) of DDBJ/EMBL/GenBank. The annotation and sequence data are available from GIB (Genome Information Broker; Semantically Annotated RESTful Services for Large-scale Metabolomics Data Analysis Ashwin Manjunatha, Paul Anderson, Satya Sahoo, Ajith Ranabahu, Michael Raymer, Amit Sheth Kno.e.sis Center, Wright State University 1. Introduction 7. Tools 6. What is the bottom line for the Biologist ? 3. How about Scalability ? 4. Annotation and SA-REST Adding metadata to point to richer models 5. Advantages of Annotation 8. References 2. What is the problem ? Large Data sets Standard post-instrumental processing Quantification of spectral features Normalization Scaling Multivariate statistical modeling All Computationally intensive processes Variety of algorithms for each step Need a robust and flexible analysis platform Move to a Service based Architecture ! Provide Web Services for each algorithm Assemble workflows as required ! Taverna – an open source family of tools for designing and executing workflow Move to a Service based Architecture ! Provide Web Services for each algorithm Assemble workflows as required ! Taverna – an open source family of tools for designing and executing workflow A common solution for flexibility The term metabolomics is defined as a comprehensive analysis in which metabolites of a biological system are identified and quantified. Any technique that can quantify metabolites can be used for metabolomics, but there are two primary techniques seen in the literature: nuclear magnetic resonance (NMR) and mass spectrometry with a prior on-line separation step such as high performance liquid chromatography (HPLC) or gas chromatography (GC). While neither technique is strictly superior, each technique has its own advantages and disadvantages. Existing applications include the identification of biomarkers associated with responses to toxin and pathophysiologic changes, sample classification based on the type of toxic exposure, large scale human studies, clinical diagnosis, and the study of genetic disorders. Metabolomics An open-source software framework for reliable, scalable, distributed computing. [ Uses the map-reduce computational paradigm Runs off Computing Clouds An open-source software framework for reliable, scalable, distributed computing. [ Uses the map-reduce computational paradigm Runs off Computing Clouds Hadoop Shared hardware resources, software and information are provided to computers and other devices on-demand. Many vendors Shared hardware resources, software and information are provided to computers and other devices on-demand. Many vendors Computing Cloud Use Apache Hadoop on Computing Clouds to run processes in parallel. Applicable to many common mathematical operations such as summing and averaging. Faceted Search Technique for accessing a collection of information represented using a faceted classification, allowing users to explore by filtering available information. When annotated with richer models, the indexing software can easily create faceted indexes to support a fine grained search. Even the regular keyword search can be improved. 1.Query by concept – not by keyword Search for NCI:FASTA instead just FASTA. Yields documents that indicate the term FASTA as defined by the NCI Thesaurus. 2.Filter by multiple facets Issue queries indicating many facets, say type: soap binding:java include:NCI:FASTA to look for service descriptions that are SOAP services with java bindings including mentions about NCI:FASTA. Semi-Automated Composition When service interface documents are annotated service compositions can be done more intelligently. 1.A composition tool can warn the creator of incompatible connections : Output of Service A cannot be input to Service B ! 2.Supplement transformations by suggesting matching elements : Create transformations or suggest the difficulty of transformation to the human (see Mediatability[1]) Firefox Plug-in Annotate web pages inside the browser and submit them to the index Indexing/Search framework 1. Built using the technology made for faceted classification of Web APIs [2]. 2. Multiple Apache Lucene indexes in the back-end 1.Mediatability: Estimating the degree of human involvement in xml schema mediation, K Gomadam, A Ranabahu, L Ramaswamy, AP Sheth 2.A Faceted Classification Based Approach to Search and Rank Web APIs, Gomadam, K. and Ranabahu, A. and Nagarajan, M. and Sheth, A.P. and Verma, K. 3.SA-REST: Semantic Annotation of Web Resources, W3C member submission by Wright State University Better Search for Biological Web Services Services can be searched with more precise terms and concepts. Search by ontology concept and add facets to make precise filtering. Convenience in Creating Workflows Find and mash services together with ease. The tools can suggest the degree of match and also create data mappings. The workflows can be made graphically and then executed by just a point and click. There is no need to download, install and configure a number of applications. Faster processing and result generation The backend services can be Cloud based providing results much faster than any single computer. No need for heavy in-house computing facilities Use services that are hosted on clouds and avoid the equipment costs and all the hassle of hardware maintenance. Pay per use pricing model is convenient for sporadic usage. SA-REST W3C member submission on Semantic Annotation of RESTful services [3]. Three basic properties domain-rel : mark the top level domain of a document :e.g.Nucleotides sem-rel : mark the domain of a linked document sem-class : mark the meaning of a selected word Toxicology is the branch of pharmacology that deals with poisons and their effects on plant, animal and human life. Toxicology Nuclear magnetic resonance (NMR) spectroscopy is an experimental technique that exploits the properties of an atoms nucleus. It can be used to obtain information about the concentration and structure of molecules. NMR studies magnetic nuclei by applying a static magnetic field followed by applying a second oscillating magnetic field. Specifically, only nuclei with an odd number of protons or neutrons can be measured using NMR; however, the two most common atoms studied are 1 H And 13 C. Nuclear magnetic resonance (NMR) spectroscopy is an experimental technique that exploits the properties of an atoms nucleus. It can be used to obtain information about the concentration and structure of molecules. NMR studies magnetic nuclei by applying a static magnetic field followed by applying a second oscillating magnetic field. Specifically, only nuclei with an odd number of protons or neutrons can be measured using NMR; however, the two most common atoms studied are 1 H And 13 C. NMR Spectrometer NMR Ontology Annotation links the Ontology concept with a term Web page